About me

I am a PhD student in Computer Science at UIUC, advised by Prof. Tong Zhang and Prof. Huan Zhang. Previously, I earned my bachelor’s and master’s degrees from the Department of Automation at Tsinghua University and CSE, HKUST. My current research interests lie in Foundation Models for Agents, Trustworthy LLMs/VLMs, and Deep Reinforcement Learning. My long-term goal is to develop agent foundation models with strong perception, reasoning, and planning capabilities, enabling scalable and reliable autonomous systems.

Prior to my PhD, I was fortunate to work closely with Prof. Chongjie Zhang (Washington University in St. Louis), Dr. Lei Han (Noitom Robotics, formerly Tencent AI Lab/Robotics X Lab), and Prof. Meng Fang (University of Liverpool).

News

  • 🌟 (2026.2) We released GUI-Libra, a data-efficient post-training receipe for GUI agents that uses 81K open-source data to achieve strong performance on online environments. Check out our paper and code for more details!
  • 🎉 (2026.1) BEAT and DROCO are accepted to ICLR 2026. Congrats to all co-authors!
  • 🎉 (2025.11) MiCRo won the EMNLP 2025 Outstanding Paper Award. Huge congrats to the team!
  • 🌟 (2025.11) Check out our new paper about visual backdoor attacks on VLM-based embodied agents BEAT!
  • 🌟 (2025.10) We released Embodied Reasoning Agent (ERA), a training recipe for VLM-based embodied agents with enhanced reasoning and grounding capability. Explore more on our project page.
  • 🎉 (2025.9) GUI-Actor and ADG are accepted to NeurIPS 2025! MergeBench is accepted to the Datasets & Benchmarks Track! Congrats to all co-authors!
  • 🎉 (2025.8) MiCRo is accepted to EMNLP 2025 main conference with award nomination!
  • 🌟 (2025.6) We’ve released GUI-Actor, a novel GUI grounding model that combines an attention-based action head with a grounding verifier. Explore more on our project page!
  • 💻 (2025.5) Starting my internship at Microsoft Research, Redmond.
  • 🎉 (2025.5) EmbodiedBench is accepted to ICML 2025 as an oral paper! Thanks to my co-authors!

Selected Projects (Led or Co-Led)

Multimodal GUI Agent and Embodied Agent

Multimodal Math Reasoning

ML for LLMs

Robust Offline RL

Goal-conditioned RL


Experiences

  • Microsoft Research Intern at Microsoft Research, Redmond, 2025.

  • Tencent Research Intern at Tencent AI Lab and Robotics X Lab, 2020-2022 (Multiple internship terms).

  • Meituan Machine Learning Intern at Meituan, 2019.

Services

Conference Reviewer: ICML, ICLR, NeurIPS (Top Reviewer in NeurIPS 2023), ACL/ARR, ICRA, AAMAS.

Journal Reviewer: IEEE Robotics and Automation Letters (RA-L), IEEE Transactions on Neural Networks and Learning Systems (TNNLS), IEEE Transactions on Artificial Intelligence (TAI), Machine Learning, Journal of Artificial Intelligence Research.

Teaching Assistant: COMP 4211 Machine Learning, HKUST; COMP 1021 Introduction to Computer Science, HKUST

Hobbies

In my leisure time, I enjoy sports like running, table tennis, and swimming. During my time at Tsinghua University, I was an amateur long-distance runner. In 2019, I completed a half marathon (21.0975 km) in 1 h 30 min and a full marathon (42.195 km) in 3 h 36 min. However, since starting my PhD I haven’t had time for regular running training, so I’ve let it slide. Hopefully I’ll get a chance to update my record once I graduate🙂.