Microsoft at ASPLOS 2024: Advancing hardware and software for high-scale, secure, and efficient modern applications
Publication SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills Amey Agrawal, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Ramachandran Ramjee September 2023 Project
Publication Switchboard: Efficient Resource Management for Conferencing Services Rahul Bothra, Rohan Gandhi, Ranjita Bhagwan, Venkat Padmanabhan, Rui Liang, Steve Carlson, Vinayaka Kamath, Sreangsu Acharyya, Ken Sueda, Somesh Chaturmohta, Harsha Sharma SIGCOMM | September 2023 Project
Publication Improving Network Availability with Protective ReRoute David Wetherall, Abdul Kabbani, Van Jacobson, Jim Winget, Yuchung Cheng, Charles B. Morrey III, Uma Moravapalle, Phillipa Gill, Steven Knight, Amin Vahdat SIGCOMM 2023 | September 2023
Publication Teal: Learning-Accelerated Optimization of WAN Traffic Engineering Zhiying Xu, Francis Y. Yan, Rachee Singh, Justin T. Chiu, Alexander M. Rush, Minlan Yu ACM Special Interest Group on Data Communication Conference (SIGCOMM ’23) | September 2023
Publication Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures Yongji Wu, Danyang Zhuo, Matthew Lentz, Yao Lu VLDB | September 2023
Publication Resilient Baseband Processing in Virtualized RANs with Slingshot Nikita Lazarev, Tao Ji, Anuj Kalia, Daehyeok Kim, Ilias Marinos, Francis Y. Yan, Christina Delimitrou, Zhiru Zhang, Aditya Akella ACM Special Interest Group on Data Communication Conference (SIGCOMM ’23) | September 2023 Project
Publication PACE-LM: Prompting and Augmentation for Calibrated Confidence Estimation with GPT-4 in Cloud Incident Root Cause Analysis Dylan Zhang, Xuchao Zhang, Chetan Bansal, Pedro Las-Casas, Rodrigo Fonseca, Saravan Rajmohan September 2023
Publication Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina Zhuolong Yu, Bowen Su, Wei Bai, Shachar Raindel , Vladimir Braverman, Xin Jin SIGCOMM 2023 | September 2023 Project
Publication DBO: Fairness for Cloud-Hosted Financial Exchanges Eashan Gupta, Prateesh Goyal, Ilias Marinos, Chenxingyu Zhao, Radhika Mittal, Ranveer Chandra ACM SIGCOMM | September 2023
Publication An Adaptive and Robust Deep Learning Framework for THz Ultra-Massive MIMO Channel Estimation Wentao Yu, Yifei Shen, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Khaled Ben Letaief IEEE Journal of Selected Topics in Signal Processing | September 2023 Project: Mean field asymptotics in machine learning and networking