Guoxi Zhang


Beijing Institute for General Artificial Intelligence,

2 Yiheyuan Road, Haidian,

Beijing, China 100089.

I am senior research engineer at Beijing Institute for General Artificial Intelligence. Previously, I was a PhD student in Graduate School of Informatics, Kyoto university, where I have the fortune to work with Prof. Hisashi Kashima.

I am interested in leveraging human guidance for reinforcement learning. In particular, I have been working on preference-based reinforcement learning, which aims at training agents that comply with human preferences.