Guoxi Zhang

Graduate School of Informatics, Kyoto University
Yoshidahonmachi, Sakyo Ward,
Kyoto, Japan 606-8501
I am a PhD student in Graduate School of Informatics, Kyoto university, where I have the fortune to work with Prof. Hisashi Kashima. I am interested in leveraging human guidance to train reinforcement learning agents. In particular, I have been working on preference-based reinforcement learning, which aims at training agents that comply with human preferences.
I am actively seeking researcher roles starting from Oct. 2023.