Research Papers

  1. Yang Li, Changsheng Zhao, Hyungtak Lee, Ernie Chang, Yangyang Shi, Vikas Chandra.
    "Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications".
    in review at a conference, 2024.

     

  2. Yang Li*, Yuan Shangguan*, Yuhao Wang, Liangzhen Lai, Ernie Chang, Changsheng Zhao, Yangyang Shi, and Vikas Chandra.
    "Not All Weights Are Created Equal: Exploring Weight Sensitivity in Latency and Power Optimization for On-Device Speech Recognition”.
    in review at a conference, 2024.

     

  3. Ernie Chang, Matteo Paltenghi, Yang Li, Pin-Jie Lin, Changsheng Zhao, Patrick Huber, Zechun Liu, Rastislav Rabatin, Yangyang Shi, Vikas Chandra.
    "Scaling Parameter-Constrained Language Models with Quality Data".
    Conference on Empirical Methods in Natural Language Processing (EMNLP) Industry Track, 2024.

     

  4. Ernie Chang, Pin-Jie Lin, Yang Li, Changsheng Zhao, Daeil Kim, Rastislav Rabatin, Zechun Liu, Yangyang Shi, Vikas Chandra.
    "Target-Aware Language Modeling via Granular Data Sampling".
    Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.

     

  5. Maximilian Lam, Jeff Johnson, Wenjie Xiong, Kiwan Maeng, Udit Gupta, Yang Li, Liangzhen Lai, Illias Leontiadis, Minsoo Rhu, Hsien-Hsin Lee, Vijay Janapa Reddi, Gu-Yeon Wei, David Brooks, and Edward Suh.
    "GPU-based Private Information Retrieval for On-Device Machine Learning Inference".
    ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2024 (Spring round; acceptance rate: 16%).

     

  6. Jamin Seo, Yang Li, Debabrata Mohapatra, Liangzhen Lai, Hyoukjun Kwon, Tushar Krishna.
    "Memory Placement Policy Exploration for Dynamic Multi-model Multi-task ML Workloads”.
    in review at a conference, 2024.

     

  7. Yang Li*, Liangzhen Lai*, Yuan Shangguan, Forrest N. Iandola, Zhaoheng Ni, Ernie Chang, Yangyang Shi, and Vikas Chandra.
    "Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition".
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024.

     

  8. Ernie Chang*, Pin-Jie Lin*, Yang Li, Sidd Srinivasan, Gael Le Lan, David Kant, Yangyang Shi, Forrest Iandola, and Vikas Chandra.
    "In-Context Prompt Editing for Conditional Audio Generation".
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024.

     

  9. Y. Li, D. Wang, and José M. F. Moura.
    "GSA-Forecaster: Forecasting Graph-Based Time-Dependent Data with Graph Sequence Attention".
    in review at ACM Transactions on Knowledge Discovery from Data (TKDD), 2024.

     

  10. Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, and Michael L. Seltzer.
    "Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers".
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.

     

  11. Tong Shen, Yang Li, and José M. F. Moura.
    "Forecasting COVID-19 Dynamics: Clustering, Generalized Spatiotemporal Attention, and Impacts of Mobility and Geographic Proximity".
    IEEE International Conference on Data Engineering (ICDE), 2023 (acceptance rate: 31%).

     

  12. Y. Li and José M. F. Moura.
    "Forecaster: A Graph Transformer for Forecasting Spatial and Time-Dependent Data".
    European Conference on Artificial Intelligence (ECAI), 2020 (acceptance rate: 25%).

  13.  

  14. Yang Li, Charles R. Lefurgy, Karthick Rajamani, Malcolm S. Allen-Ware, Guillermo J. Silva, Daniel D. Heimsoth, Saugata Ghose, and Onur Mutlu.
    "A Scalable Priority-Aware Approach to Managing Data Center Server Power".
    IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2019 (acceptance rate: 22%).

    Technical report: "CapMaestro: exploiting power redundancy, data center-wide priorities, and stranded power for boosting data center performance," in IBM Research Report, 2018.

  15.  

  16. Cong Xu, Karthick Rajamani, Alexandre Ferreira, Wesley Felter, Juan Rubio, and Yang Li.
    "dCat: Dynamic Cache Management for Efficient, Performance-Sensitive Infrastructure-as-a-Service".
    ACM European Conference on Computer Systems (EuroSys), 2018 (acceptance rate: 16%).

  17.  

  18. Y. Li, Saugata Ghose, Jongmoo Choi, Jin Sun, Hui Wang, and Onur Mutlu.
    "Utility-based Hybrid Memory Management".
    IEEE International Conference on Cluster Computing (IEEE Cluster), 2017 (acceptance rate: 22%).

  19.  

  20. Yang Li, Di Wang, Saugata Ghose, Jie Liu, Sriram Govindan, Sean James, Eric Peterson, John Siegler, Rachata Ausavarungnirun, and Onur Mutlu.
    "SizeCap: Efficiently Handling Power Surges for Fuel Cell Powered Data Centers".
    IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2016 (acceptance rate: 22%).

  21.  

  22. Zuochang Ye, Tianshi Wang, and Yang Li.
    "Domain-Alternated Optimization for Passive Macromodeling".
    IEEE Transactions on Very Large Scale Integration Systems (TVLSI), 2015.

  23.  

  24. Yang Li and David Z. Pan.
    "An Accurate Semi-Analytical Framework for Full-Chip TSV-induced Stress modeling".
    IEEE/ACM Design Automation Conference (DAC), 2013 (acceptance rate: 23%).

  25.  

  26. Zuochang Ye, Bichen Wu, Song Han, and Yang Li.
    "Time-Domain Segmentation based Massively Parallel Simulation for ADCs".
    IEEE/ACM Design Automation Conference (DAC), 2013 (acceptance rate: 23%).

  27.  

  28. Tianshi Wang, Yang Li, and Zuochang Ye.
    "Robust Passive Macro-Model Generation with Local Compensation".
    IEEE Transactions on Microwave Theory and Techniques (TMTT), 2012
    (see "Corrections to ‘Robust Passive Macro-Model Generation with Local Compensation'" in TMTT’13 for authorship information).

  29.  

  30. Zuochang Ye, Yang Li, Mingzhi Gao, and Zhiping Yu.
    "A Novel Framework for Passive Macromodeling".
    IEEE/ACM Design Automation Conference (DAC), 2011 (acceptance rate: 21%).