
Registered user since Tue 29 Jul 2025
I am a Senior Research Scientist at the Turbo Team, Together AI, supervised by Ben Athiwaratkun.
I am a Ph.D. at the School of Computer Science, Faculty of Engineering, The University of Sydney, supervised by Prof. Shuaiwen Leon Song. I have been fortunate to intern at Dolby, DeepSpeed Microsoft, Weixin Group Tencent, Microsoft (China), contributing to projects in building machine learning system. I was also a research associate at the School of Computer Science and Engineering, Sun Yat-sen University from 2019 to 2022, under the supervision of Prof. Dan Huang and Yutong Lu. I received my B.E. degree from the School of Computer Science and Engineering, Sun Yat-sen University in 2019.
My research spans the efficient machine learning and system, from model pertaining quality to efficient algorithms & system co-design that bridges emerging ML/LLM methods and real-world applications, improving both productivity (usable, robust stacks) and performance (throughput, memory, cost-efficiency). I work across academia and industry (Together AI), with an emphasis on LLM efficiency and scalable training infrastructure. My research is supported by the Together AI. Feel free to drop me an email if you have aligned interests. Currently, I am working on the following projects:
(* indicate the projects I am leading)
- Efficient ML algorithm:
- Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient (Efficient RL training) *
- CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention (Efficient inference) *
- SQUEEZE THINK: Multi-Model Orchestration for Efficient Recursive Self-Aggregation (Efficient Inference)
- Bio-Inspired LLM-Based Multiagent Systems (Efficient Agent Inference-time Training )
- Efficient ML System
- Tomni (RL training system) *
- Aurora: When RL Meets Adaptive Speculative Training: A Unified Training-Serving System (Speculator training system design)
- LCFS: Hierarchical Performance Isolation for Distributed LLM Serving with LLM Completely Fair Scheduler (Efficient Agent Fairness System)
- Model Related:
- CoderForge *
Contributions
2026
View general profile