Microsoft at ASPLOS 2024: Advancing hardware and software for high-scale, secure, and efficient modern applications
Publication Towards Cloud Efficiency with Large-scale Workload Characterization A. Parayil, Jue Zhang, Xiaoting Qin, Íñigo Goiri, Chetan Bansal ICPE | May 2025
Publication Automated Service Design with Cerulean (Project Showcase) Vaastav Anand, Alok Kumbhare, Celine Irvene, Chetan Bansal, Gagan Somashekar, Jonathan Mace, Pedro Las-Casas, Ricardo Bianchini, Rodrigo Fonseca 2025 IEEE/ACM International Workshop on Cloud Intelligence & AIOps (AIOps) | May 2025, pp. 1-3
Publication Storage Class Memory is Dead, All Hail Managed-Retention Memory: Rethinking Memory for the AI Era Sergey Legtchenko, Ioan Stefanovici, Richard Black, Ant Rowstron, Junyi Liu, Paolo Costa, Burcu Canakci, Dushyanth Narayanan, Xingbo Wu The ACM SIGOPS 20th Workshop on Hot Topics in Operating Systems | May 2025
Publication Good things come in small packages: Should we build AI clusters with Lite-GPUs? Burcu Canakci, Junyi Liu, Xingbo Wu, Nathanael Cheriere, Paolo Costa, Sergey Legtchenko, Dushyanth Narayanan, Ant Rowstron The ACM SIGOPS 20th Workshop on Hot Topics in Operating Systems | May 2025
Publication Robust Optical Transceiver Manipulation in Cluttered Cable Environments Using 3D Scene Understanding and Planning Iason Sarantopoulos, Chenyu Liu, Bohong Weng, Sicheng Xu, Yizhong Zhang, Jiaolong Yang, Xin Tong, Fabian Otto, David Sweeney, Andromachi Chatzieleftheriou, Ant Rowstron 2025 IEEE International Conference on Robotics and Automation | May 2025 Project
Publication Rollbaccine : Herd Immunity against Storage Rollback Attacks in TEEs David C. Y. Chu, Aditya Balasubramanian, Dee Bao, Natacha Crooks, Heidi Howard, Lucky E. Katahanas, Soujanya Ponnapalli May 2025
Publication CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion Jiayi Yao, Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng, Qizheng Zhang, Kuntai Du, Shan Lu, Junchen Jiang EuroSys 2025 | April 2025 EuroSys Best Paper Award
Publication TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms Jovan Stojkovic, Chaojie Zhang, Íñigo Goiri, Esha Choukse, Haoran Qiu, Rodrigo Fonseca, Josep Torrellas, Ricardo Bianchini ASPLOS | April 2025 Project
Publication Towards Energy Efficient 5G vRAN Servers Anuj Kalia, Nikita Lazarev, Leyang Xue, Xenofon Foukas, Bozidar Radunovic, Francis Y. Yan NSDI | April 2025 Project
Publication Enabling Silent Telemetry Data Transmission with InvisiFlow Yinda Zhang, Liangcheng Yu, Gianni Antichi, Ran Ben Basat, Vincent Liu NSDI 2025 | April 2025