I am a CS Ph.D. candidate at the University of Pennsylvania (defending 2026).
I am seeking full-time industry roles starting in 2026.
I build GPU scheduling, agentic RL post-training, and inference systems for large-scale LLMs. At Alibaba, I shipped Partial Overlapping, a high-priority feature in ROLL, for production RL training of models with 100s of billions of parameters on 1000s of GPUs, open-sourced RLix , and contributed to the ROME technical report. My work spans vLLM, Megatron-LM, and Ray.
Please visit taoluo.net for more informatiom.