About me
I’m a postgraduate student at CSE, the Hong Kong University of Science and Technology, supervised by Prof. Tong Zhang. I received my master’s and bachelor’s degrees from the Department of Automation at Tsinghua University. I am furtunate to work closely with Prof. Chongjie Zhang (Washington University in St. Louis), Dr. Lei Han (Tencent AI Lab), and Prof. Meng Fang (University of Liverpool). My research interests lie in deep reinforcement learning (RL), especially goal-conditioned RL, offline RL, model-based RL, and the application of RL algorithms to game AI and robotics.
Currently, I am actively researching ways to improve the robustness and generalization abilities of deep reinforcement learning. Feel free to contact me by email if you are interested in discussing or collaborating with me.
Publications
Preprint
Rui Yang, Xiaoman Pan, Feng Luo, Shuang Qiu, Han Zhong, Dong Yu, Jianshu Chen
Conference Papers
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption. International Conference on Learning Representations (ICLR) 2024. $\color{red}{\text{(Spotlight)}}$
Rui Yang $^*$, Han Zhong $^*$, Jiawei Xu $^*$, Amy Zhang, Chongjie Zhang, Lei Han, Tong Zhang.
Corruption-Robust Offline Reinforcement Learning with General Function Approximation. Neural Information Processing Systems (NeurIPS) 2023.
Chenlu Ye $^*$, Rui Yang $^*$, Quanquan Gu, Tong Zhang.
What Is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?. International Conference on Machine Learning (ICML) 2023.
Rui Yang, Yong Lin, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang.
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. Neural Information Processing Systems (NeurIPS) 2022. $\color{red}{\text{(Spotlight)}}$
Rui Yang $^*$, Chenjia Bai $^*$, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han.
Exploiting Reward Shifting in Value-Based Deep RL. Neural Information Processing Systems (NeurIPS) 2022.
Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, Bolei Zhou.
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL. International Conference on Learning Representations (ICLR), 2022.
Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang.
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization. International Conference on Learning Representations (ICLR), 2021.
Lanqing Li, Rui Yang, Dijun Luo.
Workshop Papers
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models. NeurIPS 2023 GCRL Workshop. $\color{red}{\text{(Spotlight)}}$
Mianchu Wang $^*$, Rui Yang $^*$, Xi Chen, Meng Fang.
MHER: Model-based Hindsight Experience Replay. NeurIPS 2021 Workshop DeepRL.
Rui Yang, Meng Fang, Lei Han, Yali Du, Feng Luo, Xiu Li.
Journal Papers
Efficient Multi-Goal Reinforcement Learning via Value Consistency Prioritization. Journal of Artificial Intelligence Research (JAIR), 2023.
Jiawei Xu $^*$, Shuxing Li $^*$, Rui Yang, Chun Yuan, Lei Han.
A survey on sparse reward algorithms in reinforcement learning-theory and experiment. CAAI Transactions on Intelligent Systems (in Chinese), 2020.
Rui Yang, Jiangpeng Yan, Xiu Li.
Educations
- 2022.09 - now, postgraduat student, Department of Computer Science and Engineering, the Hong Kong University of Science and Technology.
- 2019.09 - 2022.07, Master, Department of Automation, Tsinghua University.
- 2015.09 - 2019.07, Bachelor, Department of Automation, Tsinghua University.
Experiences
Tencent Robotics X (Oct. 2020 – Jun. 2021)
Researched on efficient goal-conditioned RL algorithms.
Tencent AI Lab (Jun. 2020 – Sep. 2020)
Researched on multi-step HER and offline meta RL.
Meituan Financial Service Group (Jul. 2019 – Sep. 2019)
Controlled the advertising speed of Meituan app via a pacing PI control algorithm.
Services
Conference Reviewer: ICML (2022,2024), ICLR (2024), NeurIPS (2022,2023 $\color{red}{\text{Top Reviewer}}$), ICRA (2023), AAMAS(2024).
Journal Reviewer: IEEE Robotics and Automation Letters (RA-L), IEEE Transactions on Artificial Intelligence (TAI), Machine Learning.
Others
During my leisure time, I like sports such as running, table tennis and swimming. I used to be an amateur long-distance runner at Tsinghua University. I finished a half marathon (21.0975 km) in 1h30min and a marathon (42.195 km) in 3h36min.