Hi! I am currently a second-year Ph.D. student in the Department of Computer Science and Engineering of Shanghai Jiao Tong University, advised by Prof. Jingwen Leng.
Before this, I obtained my master degree at School of Information Science and Technology (SIST), ShanghaiTech University. I at that time researched under the supervision of Prof. Xin Lou and Dr. Yuanfeng Wang. I received the B.E. degree from College of Architecture & Environment, Sichuan University in 2020.
My research interests include Computer Architecture, with a focus on LLM Quantization, GPGPU Micro-architecture, and Hardware/Software Co-Design.
News!
Publication
M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type
Weiming Hu, Haoyan Zhang, Cong Guo, Yu Feng, Renyang Guan, Zhendong Hua, Zihan Liu, Yue Guan, Minyi Guo, Jingwen Leng
International Symposium on High-Performance Computer Architecture (HPCA 2025)To appear.
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization
Cong Guo*, Jiaming Tang*, Weiming Hu, Jingwen Leng, Chen Zhang, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu
Proceedings of the 50th Annual International Symposium on Computer Architecture (ISCA 2023)Teaching
Education
2023.09 - Now, Ph.D student in Computer Science, Department of Computer Science and Engineering, Shanghai Jiao Tong University.
2020.09 - 2023.06, M.S. in Computer Science, School of Information Science and Technology, ShanghaiTech University.
2016.09 - 2020.06,, B.E. in Civil Engineering, College of Architecture & Environment, Sichuan University.
2020.09 - 2023.06, M.S. in Computer Science, School of Information Science and Technology, ShanghaiTech University.
2016.09 - 2020.06,, B.E. in Civil Engineering, College of Architecture & Environment, Sichuan University.