About me
I am a PhD student in Computer Science at UIUC, advised by Prof. Tong Zhang and Prof. Huan Zhang. Previously, I earned my bachelor’s and master’s degrees from the Department of Automation at Tsinghua University and CSE, HKUST. My current research interests lie in Foundation Models for Agents, Trustworthy LLMs/VLMs, and Deep Reinforcement Learning. My long-term goal is to develop agent foundation models with strong perception, reasoning, and planning capabilities, enabling scalable and reliable autonomous systems.
Prior to my PhD, I was fortunate to work closely with Prof. Chongjie Zhang (Washington University in St. Louis), Dr. Lei Han (Noitom Robotics, formerly Tencent AI Lab), and Prof. Meng Fang (University of Liverpool).
News
- 🌟 (2026.2) We released GUI-Libra, a data-efficient post-training receipe for GUI agents that uses 81K open-source data to achieve strong performance on online environments. Check out our paper and code for more details!
- 🎉 (2026.1) BEAT and DROCO are accepted to ICLR 2026. Congrats to all co-authors!
- 🎉 (2025.11) MiCRo won the EMNLP 2025 Outstanding Paper Award. Huge congrats to the team!
- 🌟 (2025.11) Check out our new paper about visual backdoor attacks on VLM-based embodied agents BEAT!
- 🌟 (2025.10) We released Embodied Reasoning Agent (ERA), a training recipe for VLM-based embodied agents with enhanced reasoning and grounding capability. Explore more on our project page.
- 🎉 (2025.9) GUI-Actor and ADG are accepted to NeurIPS 2025! MergeBench is accepted to the Datasets & Benchmarks Track! Congrats to all co-authors!
- 🎉 (2025.8) MiCRo is accepted to EMNLP 2025 main conference with award nomination!
- 🌟 (2025.6) We’ve released GUI-Actor, a novel GUI grounding model that combines an attention-based action head with a grounding verifier. Explore more on our project page!
- 💻 (2025.5) Starting my internship at Microsoft Research, Redmond.
- 🎉 (2025.5) EmbodiedBench is accepted to ICML 2025 as an oral paper! Thanks to my co-authors!
Selected Projects (Led or Co-Led)
Multimodal GUI Agent and Embodied Agent
GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL. Preprint 2026. [code] [website]
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning. Preprint 2025 [code] [website]
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents. ICML 2025 (Oral). [code] [website]
GUI-Actor: Attention-based Grounding with Verifiable Action Head for GUI Agents. NeurIPS 2025. [code] [website]
Multimodal Math Reasoning
- DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models. ICLR 2025. [code] [website]
ML for LLMs
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs. NeurIPS 2024. [code]
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment. ICML 2024. [code]
Rethinking Diverse Human Preference Learning through Principal Component Analysis. ACL 2025 (Findings).
MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning. EMNLP 2025 (Main) Outstanding Paper Award.
Robust Offline RL
Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling. ICLR 2025.
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption. ICLR 2024. (Spotlight) [code]
Corruption-Robust Offline Reinforcement Learning with General Function Approximation. NeurIPS 2023. [code]
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. NeurIPS 2022. (Spotlight) [code]
Goal-conditioned RL
What Is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?. ICML 2023. [code]
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL. ICLR 2022. [code]
Experiences
Research Intern at Microsoft Research, Redmond, 2025. Working on GUI agents and embodied AI.
Research Intern at Tencent AI Lab and Robotics X Lab, 2020-2022 (Multiple internship terms).
Machine Learning Intern at Meituan, 2019.
Services
Conference Reviewer: ICML, ICLR, NeurIPS (Top Reviewer in NeurIPS 2023), ACL/ARR, ICRA, AAMAS.
Journal Reviewer: IEEE Robotics and Automation Letters (RA-L), IEEE Transactions on Neural Networks and Learning Systems (TNNLS), IEEE Transactions on Artificial Intelligence (TAI), Machine Learning, Journal of Artificial Intelligence Research.
Teaching Assistant: COMP 4211 Machine Learning, HKUST; COMP 1021 Introduction to Computer Science, HKUST
Hobbies
In my leisure time, I enjoy sports like running, table tennis, and swimming. During my time at Tsinghua University, I was an amateur long-distance runner. In 2019, I completed a half marathon (21.0975 km) in 1 h 30 min and a full marathon (42.195 km) in 3 h 36 min. However, since starting my PhD I haven’t had time for regular running training, so I’ve let it slide. Hopefully I’ll get a chance to update my record once I graduate🙂.