Global ETD Search

Return to search

Model-based active learning in hierarchical policies

Hierarchical task decompositions play an essential role in the design of complex simulation and decision systems, such as the ones that arise in video games. Game designers find it very natural to adopt a divide-and-conquer philosophy of specifying hierarchical policies, where decision modules can be constructed somewhat independently. The process of choosing the parameters of these modules manually is typically lengthy and tedious. The hierarchical reinforcement learning (HRL) field has produced elegant ways of decomposing policies and value functions using semi-Markov decision processes. However, there is still a lack of demonstrations in larger nonlinear systems with discrete and continuous variables. To narrow this gap between industrial practices and academic ideas, we address the problem of designing efficient algorithms to facilitate the deployment of HRL ideas in more realistic settings. In particular, we propose Bayesian active learning methods to learn the relevant aspects of either policies or value functions by focusing on the most relevant parts of the parameter and state spaces respectively. To demonstrate the scalability of our solution, we have applied it to The Open Racing Car Simulator (TORCS), a 3D game engine that implements complex vehicle dynamics. The environment is a large topological map roughly based on downtown Vancouver, British Columbia. Higher level abstract tasks are also learned in this process using a model-based extension of the MAXQ algorithm. Our solution demonstrates how HRL can be scaled to large applications with complex, discrete and continuous non-linear dynamics. / Science, Faculty of / Computer Science, Department of / Graduate

http://hdl.handle.net/2429/737

Hierarchical Reinforcement Learning

Decision Theory

Bayesian Active Learning

Robotics

Identifer	oai:union.ndltd.org:UBC/oai:circle.library.ubc.ca:2429/737
Date	05 1900
Creators	Cora, Vlad M.
Publisher	University of British Columbia
Source Sets	University of British Columbia
Language	English
Detected Language	English
Type	Text, Thesis/Dissertation
Format	1153699 bytes, application/pdf
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International, http://creativecommons.org/licenses/by-nc-nd/4.0/

Page generated in 0.0018 seconds

Model-based active learning in hierarchical policies

Description

Links & Downloads

Tags

Additional Fields