Guoxi Zhang
Beijing Institute for General Artificial Intelligence,
2 Yiheyuan Road, Haidian,
Beijing, China 100089.
I am senior research engineer at Beijing Institute for General Artificial Intelligence. Previously, I was a PhD student in Graduate School of Informatics, Kyoto university, where I have the fortune to work with Prof. Hisashi Kashima.
I am interested in leveraging human guidance for reinforcement learning. In particular, I have been working on preference-based reinforcement learning, which aims at training agents that comply with human preferences.