Global ETD Search

Return to search

Action selection in modular reinforcement learning

Modular reinforcement learning is an approach to resolve the curse of dimensionality problem in traditional reinforcement learning. We design and implement a modular reinforcement learning algorithm, which is based on three major components: Markov decision process decomposition, module training, and global action selection. We define and formalize module class and module instance concepts in decomposition step. Under our framework of decomposition, we train each modules efficiently using SARSA($\lambda$) algorithm. Then we design, implement, test, and compare three action selection algorithms based on different heuristics: Module Combination, Module Selection, and Module Voting. For last two algorithms, we propose a method to calculate module weights efficiently, by using standard deviation of Q-values of each module. We show that Module Combination and Module Voting algorithms produce satisfactory performance in our test domain. / text

http://hdl.handle.net/2152/25916

Modular reinforcement learning

Action selection

Module weight

Identifer	oai:union.ndltd.org:UTEXAS/oai:repositories.lib.utexas.edu:2152/25916
Date	16 September 2014
Creators	Zhang, Ruohan
Source Sets	University of Texas
Language	English
Detected Language	English
Type	Thesis
Format	application/pdf

Page generated in 0.0023 seconds

Action selection in modular reinforcement learning

Description

Links & Downloads

Tags

Additional Fields