{"id":617940,"date":"2019-10-28T12:28:14","date_gmt":"2019-10-28T19:28:14","guid":{"rendered":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/?post_type=msr-project&#038;p=617940"},"modified":"2022-04-21T15:55:10","modified_gmt":"2022-04-21T22:55:10","slug":"prometheus-microsoft-research","status":"publish","type":"msr-project","link":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/project\/prometheus-microsoft-research\/","title":{"rendered":"Prometheus"},"content":{"rendered":"<section class=\"mb-3 moray-highlight\">\n\t<div class=\"card-img-overlay mx-lg-0\">\n\t\t<div class=\"card-background  has-background-catalina-blue card-background--full-bleed\">\n\t\t\t\t\t<\/div>\n\t\t<!-- Foreground -->\n\t\t<div class=\"card-foreground d-flex mt-md-n5 my-lg-5 px-g px-lg-0\">\n\t\t\t<!-- Container -->\n\t\t\t<div class=\"container d-flex mt-md-n5 my-lg-5 align-self-center\">\n\t\t\t\t<!-- Card wrapper -->\n\t\t\t\t<div class=\"w-100 w-lg-col-5\">\n\t\t\t\t\t<!-- Card -->\n\t\t\t\t\t<div class=\"card material-md-card py-5 px-md-5\">\n\t\t\t\t\t\t<div class=\"card-body \">\n\t\t\t\t\t\t\t\n\t\t\t\t\t\t\t\n\n<h1 id=\"prometheus\" class=\"h2\">Prometheus<\/h1>\n\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/section>\n\n\n\n\n\n<p>Project Prometheus is building faster, more efficient datacenter systems by co-designing distributed systems with new network primitives. Prometheus takes advantage of new programmable hardware to accelerate applications. We are working across the entire system stack, from applications and distributed algorithms to network design and device architecture.<\/p>\n\n\n\n<p>In the last decade, the scale of distributed systems in the datacenter has grown dramatically. At the same time, single-node performance has increased substantially, thanks largely to the deployment of new types of accelerators. The result is that communication, coordination, and other classic distributed systems challenges have become the bottleneck in many systems. Our goal is to build a <em>distributed systems accelerator<\/em> that addresses this bottleneck and gives order-of-magnitude improvements in distributed system performance and efficiency.<\/p>\n\n\n\n<p>The enabling technologies are already here, in the form of programmable network cards and network switches, capable of running custom logic at terabits per second. We are using them in a new way, to support distributed systems rather than traditional network protocols. This requires co-design between distributed algorithms and new network functionality.<\/p>\n\n\n\n<p>Some examples of the benefits gained from this approach include:<\/p>\n\n\n\n<ul class=\"wp-block-list\"><li><a href=\"https:\/\/newed.any0.dpdns.org\/en-us\/research\/publication\/harmonia-near-linear-scalability-for-replicated-storage-with-in-network-conflict-detection\/\">replicated storage systems<\/a> that scale with the number of replicas<\/li><li>up to a 35x speedup in <a href=\"https:\/\/newed.any0.dpdns.org\/en-us\/research\/publication\/eris-coordination-free-consistent-transactions-using-network-multi-sequencing\/\">transaction processing<\/a> performance<\/li><li>a reduction in <a href=\"https:\/\/newed.any0.dpdns.org\/en-us\/research\/publication\/scaling-distributed-machine-learning-with-in-network-aggregation\/\">DNN training time<\/a> by as much as 67%<\/li><li>a <a href=\"https:\/\/newed.any0.dpdns.org\/en-us\/research\/publication\/pegasus-load-aware-selective-replication-with-an-in-network-coherence-directory\/\">key-value store<\/a> that meets strict latency SLOs using 88% fewer machines<\/li><\/ul>\n\n\n","protected":false},"excerpt":{"rendered":"<p>Project Prometheus is building faster, more efficient datacenter systems by co-designing distributed systems with new network primitives. Prometheus takes advantage of new programmable hardware to accelerate applications.<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"research-area":[13547],"msr-locale":[268875],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-617940","msr-project","type-msr-project","status-publish","hentry","msr-research-area-systems-and-networking","msr-locale-en_us","msr-archive-status-active"],"msr_project_start":"","related-publications":[735949,1054344,1054323,1053594,924795,896829,896814,826018,826009,781846,766291,475407,701242,701233,656973,656964,656949,579094,579076,579067,475590],"related-downloads":[],"related-videos":[],"related-groups":[144927],"related-events":[],"related-opportunities":[],"related-posts":[],"related-articles":[],"tab-content":[],"slides":[],"related-researchers":[{"type":"user_nicename","display_name":"Dan Ports","user_id":37404,"people_section":"Section name 0","alias":"dports"},{"type":"user_nicename","display_name":"Jacob Nelson","user_id":36275,"people_section":"Section name 0","alias":"jacnels"},{"type":"user_nicename","display_name":"Ricardo Bianchini","user_id":33393,"people_section":"Section name 0","alias":"ricardob"}],"msr_research_lab":[199565],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/617940","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-project"}],"about":[{"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-project"}],"version-history":[{"count":8,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/617940\/revisions"}],"predecessor-version":[{"id":896817,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/617940\/revisions\/896817"}],"wp:attachment":[{"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/media?parent=617940"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=617940"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=617940"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=617940"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/newed.any0.dpdns.org\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=617940"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}