profile photo

Zhuofu Chen

Hi there! I am an undergraduate student at Tongji University, enrolled in Computer Science Elite Class 2021. Currently, I am fortunate to work with Prof. Zhihao Jia at CMU Catalyst as a research intern.

Previously, I also had a wonderful time as an undergraduate researcher advised by Prof. Xingda Wei and Prof. Rong Chen at SJTU IPADS. My research journey began at Tongji University, under the mentorship of Prof. Zhijun Ding, where I first found my aspiration to become a system 'watchman'.

I'm also looking for a Ph.D. position in the area of systems in 2025 Fall.

 ~  Email  |  CV  |  Github  |  Scholar  ~ 


Interests

My general purpose is to redesign next-generation datacenter/cloud operating systems to bridge the gap between evolving hardware and emerging needs of software, with a current focus on serving machine learning workloads and supporting compound AI systems.


Publications
(* indicates equal contribution)

AdaServe: SLO-Customized LLM Serving with Fine-Grained Speculative Decoding
Zikun Li*,  Zhuofu Chen*,  Remi Delacourt,  Gabriele Oliaro,  Zeyu Wang,  Qinghan Chen,  Shuhuai Lin,  April Yang,  Zhihao Zhang,  Zhuoming Chen,  Sean Lai,  Xupeng Miao,  Zhihao Jia 
under review. [paper] [code]
Large language model serving; SLO customization; Speculative decoding.

Characterizing Network Requirements for GPU API Remoting in AI Applications
Tianxia Wang*,  Zhuofu Chen*,  Xingda Wei,  Jinyu Gu,  Rong Chen,  Haibo Chen 
under review. [paper] [code]
GPU disaggregation; Transparent API remoting; Proxy Optimization.

TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
Lijie Yang*,  Zhihao Zhang*,  Zhuofu Chen,  Zikun Li,  Zhihao Jia 
International Conference on Learning Representations (ICLR), 2025. [paper] [code]
Sparse attention; Efficient decoding.


Updated at Nov. 2024
This web is a modification to Rishab Khincha's