Global ETD Search

Return to search

Coping with the Curse of Dimensionality by Combining Linear Programming and Reinforcement Learning

Reinforcement learning techniques offer a very powerful method of finding solutions in unpredictable problem environments where human supervision is not possible. However, in many real world situations, the state space needed to represent the solutions becomes so large that using these methods becomes infeasible. Often the vast majority of these states are not valuable in finding the optimal solution. This work introduces a novel method of using linear programming to identify and represent the small area of the state space that is most likely to lead to a near-optimal solution, significantly reducing the memory requirements and time needed to arrive at a solution. An empirical study is provided to show the validity of this method with respect to a specific problem in vehicle dispatching. This study demonstrates that, in problems that are too large for a traditional reinforcement learning agent, this new approach yields solutions that are a significant improvement over other nonlearning methods. In addition, this new method is shown to be robust to changing conditions both during training and execution. Finally, some areas of future work are outlined to introduce how this new approach might be applied to additional problems and environments.

linear programming

machine learning for scheduling

reinforcement learning

scheduling under uncertainty

vehicle dispatching

Computer Engineering

Operational Research

Identifer	oai:union.ndltd.org:UTAHS/oai:digitalcommons.usu.edu:etd-1555
Date	01 May 2010
Creators	Burton, Scott H.
Publisher	DigitalCommons@USU
Source Sets	Utah State University
Detected Language	English
Type	text
Format	application/pdf
Source	All Graduate Theses and Dissertations
Rights	Copyright for this work is held by the author. Transmission or reproduction of materials protected by copyright beyond that allowed by fair use requires the written permission of the copyright owners. Works not in the public domain cannot be commercially exploited without permission of the copyright owner. Responsibility for any use rests exclusively with the user. For more information contact Andrew Wesolek (andrew.wesolek@usu.edu).

Page generated in 0.0021 seconds

Coping with the Curse of Dimensionality by Combining Linear Programming and Reinforcement Learning

Description

Links & Downloads

Tags

Additional Fields