Global ETD Search

Return to search

Automated discovery of options in reinforcement learning

AI planning benefits greatly from the use of temporally-extended or macro-actions. Macro-actions allow for faster and more efficient planning as well as the reuse of knowledge from previous solutions. In recent years, a significant amount of research has been devoted to incorporating macro-actions in learned controllers, particularly in the context of Reinforcement Learning. One general approach is the use of options (temporally-extended actions) in Reinforcement Learning [22]. While the properties of options are well understood, it is not clear how to find new options automatically. In this thesis we propose two new algorithms for discovering options and compare them to one algorithm from the literature. We also contribute a new algorithm for learning with options which improves on the performance of two widely used learning algorithms. Extensive experiments are used to demonstrate the effectiveness of the proposed algorithms.

http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=80881

Artificial Intelligence.

Computer Science.

Identifer	oai:union.ndltd.org:LACETR/oai:collectionscanada.gc.ca:QMM.80881
Date	January 2004
Creators	Stolle, Martin
Contributors	Precup, Doina (advisor)
Publisher	McGill University
Source Sets	Library and Archives Canada ETDs Repository / Centre d'archives des thèses électroniques de Bibliothèque et Archives Canada
Language	English
Detected Language	English
Type	Electronic Thesis or Dissertation
Format	application/pdf
Coverage	Master of Science (School of Computer Science.)
Rights	All items in eScholarship@McGill are protected by copyright with all rights reserved unless otherwise indicated.
Relation	alephsysno: 002085355, proquestno: AAIMQ98746, Theses scanned by UMI/ProQuest.

Page generated in 0.0018 seconds

Automated discovery of options in reinforcement learning

Description

Links & Downloads

Tags

Additional Fields