Guoxi Zhang


Graduate School of Informatics, Kyoto University

Yoshidahonmachi, Sakyo Ward,

Kyoto, Japan 606-8501

I am a PhD student in Graduate School of Informatics, Kyoto university, where I have the fortune to work with Prof. Hisashi Kashima. I am interested in leveraging human guidance to train reinforcement learning agents. In particular, I have been working on preference-based reinforcement learning, which aims at training agents that comply with human preferences.

I am actively seeking researcher roles starting from Oct. 2023.