Global ETD Search

Return to search

Utilizing Trajectory Optimization in the Training of Neural Network Controllers

Applying reinforcement learning to control systems enables the use of machine learning to develop elegant and efficient control laws. Coupled with the representational power of neural networks, reinforcement learning algorithms can learn complex policies that can be difficult to emulate using traditional control system design approaches. In this thesis, three different model-free reinforcement learning algorithms, including Monte Carlo Control, REINFORCE with baseline, and Guided Policy Search are compared in simulated, continuous action-space environments. The results show that the Guided Policy Search algorithm is able to learn a desired control policy much faster than the other algorithms. In the inverted pendulum system, it learns an effective policy up to three times faster than the other algorithms. In the cartpole system, it learns an effective policy up to nearly fifteen times faster than the other algorithms.

Reinforcement Learning

Optimal Control

Trajectory Optimization

iLQR

Guided Policy Search

Neural Networks

Controls and Control Theory

Robotics

Identifer	oai:union.ndltd.org:CALPOLY/oai:digitalcommons.calpoly.edu:theses-3517
Date	01 September 2019
Creators	Kimball, Nicholas
Publisher	DigitalCommons@CalPoly
Source Sets	California Polytechnic State University
Detected Language	English
Type	text
Format	application/pdf
Source	Master's Theses

Page generated in 0.0059 seconds

Utilizing Trajectory Optimization in the Training of Neural Network Controllers

Description

Links & Downloads

Tags

Additional Fields