Global ETD Search

About
The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.

1	Offline Reinforcement Learning from Imperfect Human Guidance / 不完全な人間の誘導からのオフライン強化学習 Zhang, Guoxi 24 July 2023 (has links) 京都大学 / 新制・課程博士 / 博士(情報学) / 甲第24856号 / 情博第838号 / 新制\|\|情\|\|140(附属図書館) / 京都大学大学院情報学研究科知能情報学専攻 / (主査)教授鹿島, 久嗣, 教授河原, 達也, 教授森本, 淳 / 学位規則第4条第1項該当 / Doctor of Informatics / Kyoto University / DFAM Offline Reinforcement Learning Preference-based Reinforcement Learning Human-in-the-loop Reinforcement Learning 007