Guoxi Zhang

prof_pic.jpg

Beijing Institute for General Artificial Intelligence,

2 Yiheyuan Road, Haidian,

Beijing, China 100089.

I am a research scientist at the Beijing Institute for General Artificial Intelligence. Previously, I received my PhD from the Graduate School of Informatics at Kyoto University, where I was fortunate to work with Prof. Hisashi Kashima.

My research lies at the intersection of reinforcement learning, embodied AI, and robotics. I am interested in building learning systems that acquire robust and generalizable behaviors from offline data, human feedback, visual observations, and autonomous interaction.

My recent work focuses on reward learning, vision-language-guided robot learning, neuro-symbolic reinforcement learning, and reliable long-horizon memory for embodied agents.

Research Interests
Reinforcement learning, embodied AI, robot learning, reward learning, vision-language models, neuro-symbolic learning

News

May 9, 2026 Our paper, SPHERE: Mitigating the Loss of Spectral Plasticity in Mixture-of-Experts for Deep Reinforcement Learning, has been accepted by ICML 2026.
Mar 2, 2026 Our paper, MVR: Multi-view Video Reward Shaping for Reinforcement Learning, has been accepted by ICLR 2026.

Selected Publications

  1. SPHERE: Mitigating the Loss of Spectral Plasticity in Mixture-of-Experts for Deep Reinforcement Learning
    Lirui Luo, Guoxi Zhang, Hongming Xu, Cong Fang, and Qing Li
    In Forty-Third International Conference on Machine Learning, 2026
  2. MVR: Multi-view Video Reward Shaping for Reinforcement Learning
    Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, and Qing Li
    In The Fourteenth International Conference on Learning Representations, 2026
  3. SYNERGAI: Perception Alignment for Human-Robot Collaboration
    Yixin Chen, Guoxi Zhang, Yaowei Zhang, Hongming Xu, Peiyuan Zhi, Qing Li, and Siyuan Huang
    In 2025 IEEE International Conference on Robotics and Automation (ICRA), 2025
  4. End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations  Spotlight (top 3.5%)
    Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, and Qing Li
    In Proceedings of the Forty-First International Conference on Machine Learning, 2024