Zhuofu Chen

Hi there! I am a first-year CS Ph.D. student at Princeton advised by Prof. Ravi Netravali. I obtained my Bachelor's degree from Tongji University, enrolled in Computer Science Elite Class 2025.

Previously, I was fortunate to work with Prof. Zhihao Jia at CMU Catalyst as a research intern. I also had a wonderful internship at SJTU IPADS advised by Prof. Xingda Wei and Prof. Rong Chen. My academic journey began at Tongji University, under the mentorship of Prof. Zhijun Ding.

Feel free to drop me an e-mail if you would like to connect!

~ Email | CV | Github | Scholar ~

Interests

My general purpose is to redesign next-generation datacenter/cloud operating systems to bridge the gap between evolving hardware and emerging needs of software, with a current focus on serving machine learning workloads and supporting compound AI systems.

Publications

(* indicates equal contribution)

Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs
Rui Pan, Zhuofu Chen, Hongyi Liu, Arvind Krishnamurthy, Ravi Netravali
under review. [paper] [code]

Aragog: Just-in-Time Model Routing for Scalable Serving of Agentic Workflows
Yinwei Dai, Zhuofu Chen, Anand Iyer, Ravi Netravali
under review. [paper]

Kimi K2: Open Agentic Intelligence
Kimi Team (was part of the project while interning at Kimi in spring 2025)
[paper] [code]

AdaServe: SLO-Customized LLM Serving with Fine-Grained Speculative Decoding
Zikun Li*, Zhuofu Chen*, Remi Delacourt, Gabriele Oliaro, Zeyu Wang, Qinghan Chen, Shuhuai Lin, April Yang, Zhihao Zhang, Zhuoming Chen, Sean Lai, Xupeng Miao, Zhihao Jia
Proceedings of the European Conference on Computer Systems (EuroSys), 2026. [paper] [code]

Characterizing Network Requirements for GPU API Remoting in AI Applications
Tianxia Wang*, Zhuofu Chen*, Xingda Wei, Jinyu Gu, Rong Chen, Haibo Chen
under review. [paper] [code]

TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
Lijie Yang*, Zhihao Zhang*, Zhuofu Chen, Zikun Li, Zhihao Jia
International Conference on Learning Representations (ICLR), 2025. [paper] [code]

Updated at Nov. 2025

This web is a modification to Rishab Khincha's