About me
I am a PhD student in Computer Science at UIUC, advised by Prof. Tong Zhang and Prof. Huan Zhang. Previously, I earned my bachelor’s and master’s degrees from the Department of Automation at Tsinghua University and CSE, HKUST. My current research interests lie in Foundation Models for Agents, Trustworthy LLMs/VLMs, and Deep Reinforcement Learning. My long-term goal is to develop agent foundation models with strong perception, reasoning, and planning capabilities, enabling scalable and reliable autonomous systems.
Prior to my PhD, I was fortunate to work closely with Prof. Chongjie Zhang (Washington University in St. Louis), Dr. Lei Han (Noitom Robotics, formerly Tencent AI Lab/Robotics X Lab), and Prof. Meng Fang (University of Liverpool).
News
- 🌟 (2026.2) We released GUI-Libra, a data-efficient post-training receipe for GUI agents that uses 81K open-source data to achieve strong performance on online environments. Check out our paper and code for more details!
- 🎉 (2026.1) BEAT and DROCO are accepted to ICLR 2026. Congrats to all co-authors!
- 🎉 (2025.11) MiCRo won the EMNLP 2025 Outstanding Paper Award. Huge congrats to the team!
- 🌟 (2025.11) Check out our new paper about visual backdoor attacks on VLM-based embodied agents BEAT!
- 🌟 (2025.10) We released Embodied Reasoning Agent (ERA), a training recipe for VLM-based embodied agents with enhanced reasoning and grounding capability. Explore more on our project page.
- 🎉 (2025.9) GUI-Actor and ADG are accepted to NeurIPS 2025! MergeBench is accepted to the Datasets & Benchmarks Track! Congrats to all co-authors!
- 🎉 (2025.8) MiCRo is accepted to EMNLP 2025 main conference with award nomination!
- 🌟 (2025.6) We’ve released GUI-Actor, a novel GUI grounding model that combines an attention-based action head with a grounding verifier. Explore more on our project page!
- 💻 (2025.5) Starting my internship at Microsoft Research, Redmond.
- 🎉 (2025.5) EmbodiedBench is accepted to ICML 2025 as an oral paper! Thanks to my co-authors!
Selected Projects (Led or Co-Led)
Multimodal GUI Agent and Embodied Agent
GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL. Preprint 2026. [code] [website]
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning. Preprint 2025 [code] [website]
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents. ICML 2025 (Oral). [code] [website]
GUI-Actor: Attention-based Grounding with Verifiable Action Head for GUI Agents. NeurIPS 2025. [code] [website]
Multimodal Math Reasoning
- DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models. ICLR 2025. [code] [website]
ML for LLMs
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs. NeurIPS 2024. [code]
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment. ICML 2024. [code]
Rethinking Diverse Human Preference Learning through Principal Component Analysis. ACL 2025 (Findings).
MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning. EMNLP 2025 (Main) Outstanding Paper Award.
Robust Offline RL
Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling. ICLR 2025.
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption. ICLR 2024. (Spotlight) [code]
Corruption-Robust Offline Reinforcement Learning with General Function Approximation. NeurIPS 2023. [code]
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. NeurIPS 2022. (Spotlight) [code]
Goal-conditioned RL
What Is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?. ICML 2023. [code]
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL. ICLR 2022. [code]
Experiences
Research Intern at Microsoft Research, Redmond, 2025.
Research Intern at Tencent AI Lab and Robotics X Lab, 2020-2022 (Multiple internship terms).
Machine Learning Intern at Meituan, 2019.
Services
Conference Reviewer: ICML, ICLR, NeurIPS (Top Reviewer in NeurIPS 2023), ACL/ARR, ICRA, AAMAS.
Journal Reviewer: IEEE Robotics and Automation Letters (RA-L), IEEE Transactions on Neural Networks and Learning Systems (TNNLS), IEEE Transactions on Artificial Intelligence (TAI), Machine Learning, Journal of Artificial Intelligence Research.
Teaching Assistant: COMP 4211 Machine Learning, HKUST; COMP 1021 Introduction to Computer Science, HKUST
Hobbies
In my leisure time, I enjoy sports like running, table tennis, and swimming. During my time at Tsinghua University, I was an amateur long-distance runner. In 2019, I completed a half marathon (21.0975 km) in 1 h 30 min and a full marathon (42.195 km) in 3 h 36 min. However, since starting my PhD I haven’t had time for regular running training, so I’ve let it slide. Hopefully I’ll get a chance to update my record once I graduate🙂.