Selected Publications

Full list of publications here.
(2024). Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference. In EMNLP ‘24.
(2024). Interpretable Analysis of Production GPU Clusters Monitoring Data via Association Rule Mining. In IPDPS ‘24.
(2023). Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service. In SC ‘23.
(2023). Toward Sustainable HPC: Carbon Footprint Estimation and Environmental Implications of HPC Systems. In SC ‘23.
(2023). Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale. In SoCC ‘23.
(2022). MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters. In SoCC ‘22.
(2022). AI-Enabling Workloads on Large-Scale GPU-Accelerated System: Characterization, Opportunities, and Implications. In HPCA ‘22.
(2022). Great Power, Great Respobsibility: Recommendations for Reducing Energy for Training Language Models. In NAACL ‘22 Findings.
(2021). RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances. In SC ‘21.