Global ETD Search

Return to search

Scaling solutions to Markov Decision Problems

The Markov Decision Problem (MDP) is a widely applied mathematical model useful for describing a wide array of real world decision problems ranging from navigation to scheduling to robotics. Existing methods for solving MDPs scale poorly when applied to large domains where there are many components and factors to consider.

In this dissertation, I study the use of non-tabular representations and human input as scaling techniques. I will show that the joint approach has desirable optimality and convergence guarantees, and demonstrates several orders of magnitude speedup over conventional tabular methods. Empirical studies of speedup were performed using several domains including a clone of the classic video game, Super Mario Bros. In the course of this work, I will address several issues including: how approximate representations can be used without losing convergence and optimality properties, how human input can be solicited to maximize speedup and user engagement, and how that input should be used so as to insulate against possible errors.

http://hdl.handle.net/1853/42906

Reinforcement learning

Machine learning

Planning

Artificial intelligence

Markov decision processes

Markov processes

Mathematical models

Dynamic programming

Identifer	oai:union.ndltd.org:GATECH/oai:smartech.gatech.edu:1853/42906
Date	14 November 2011
Creators	Zang, Peng
Publisher	Georgia Institute of Technology
Source Sets	Georgia Tech Electronic Thesis and Dissertation Archive
Detected Language	English
Type	Dissertation

Page generated in 0.0021 seconds

Scaling solutions to Markov Decision Problems

Description

Links & Downloads

Tags

Additional Fields