Guoxi Zhang
Beijing Institute for General Artificial Intelligence,
2 Yiheyuan Road, Haidian,
Beijing, China 100089.
I am a research scientist at the Beijing Institute for General Artificial Intelligence. Previously, I received my PhD from the Graduate School of Informatics at Kyoto University, where I was fortunate to work with Prof. Hisashi Kashima.
My research lies at the intersection of reinforcement learning, embodied AI, and robotics. I am interested in building learning systems that acquire robust and generalizable behaviors from offline data, human feedback, visual observations, and autonomous interaction.
My recent work focuses on reward learning, vision-language-guided robot learning, neuro-symbolic reinforcement learning, and reliable long-horizon memory for embodied agents.
Research Interests
Reinforcement learning, embodied AI, robot learning, reward learning, vision-language models, neuro-symbolic learning
News
| May 9, 2026 | Our paper, SPHERE: Mitigating the Loss of Spectral Plasticity in Mixture-of-Experts for Deep Reinforcement Learning, has been accepted by ICML 2026. |
|---|---|
| Mar 2, 2026 | Our paper, MVR: Multi-view Video Reward Shaping for Reinforcement Learning, has been accepted by ICLR 2026. |



