Global ETD Search

CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA / CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING

Made available in DSpace on 2016-08-17T14:53:22Z (GMT). No. of bitstreams: 1
Dissertacao Allan James.pdf: 3170694 bytes, checksum: 054a9e74e81a7c2099800246d0b6c530 (MD5)
Previous issue date: 2012-09-28 / Coordenação de Aperfeiçoamento de Pessoal de Nível Superior / The union of methodologies for optimal control and dynamics programming has
stimulated the development of algorithms for realization of discrete control
systems of the type linear quadratic regulator (DLQR). The methodology is based
on reinforcement learning methods based on temporal differences and
approximate dynamic programming. The proposed method combines the
approach of the value function by method RLS (recursive least squares) and
approximate policy iteration schemes heuristic dynamic programming (HDP). The
approach is directed to the assessment of convergence of the solution DLQR and
the heuristic weighting matrices 􀜳 and 􀜴 of the utility function associated with
DLQR. The investigation of convergence properties related to consistency,
persistent excitation and polarization of the RLS estimator is performed. The
methodology involved in a project achievements online DLQR controllers and is
evaluated in a fourth order multivariable dynamic system. / A união das metodologias de controle ótimo e de programação dinâmica tem
impulsionado o desenvolvimento de algoritmos para realizações de sistemas de
controle discreto do tipo regulador linear quadrático (DLQR). A metodologia
utilizada neste trabalho é fundamentada sobre métodos de aprendizagem por
reforço baseados em diferenças temporais e programação dinâmica aproximada.
O método proposto combina a aproximação da função valor através do método
RLS (mínimos quadrados recursivos) e iteração de política aproximada em
esquemas de programação dinâmica heurística (HDP). A abordagem é orientada
para a avaliação da convergência da solução DLQR e para a sintonia heurística
das matrizes de ponderação 􀜳 e 􀜴da função de utilidade associada ao DLQR. É
realizada a investigação das propriedades de convergência relacionadas à
consistência, excitação persistente e polarização do estimador RLS. A
metodologia contempla realizações de projetos de forma online de controladores
DLQR e é avaliada em um sistema dinâmico multivariável de quarta ordem.

Programação Dinâmica Heurística

Controle Multivariável

Controle Ótimo

Regulador Quadrático Linear Discreto

Mínimos Quadrados Recursivos

Controle Digital

Heuristic Dynamic Programming

Multivariable Control

Optimal Control

Discrete Linear Quadratic Regulator

Recursive Least Squares

Digital Control

Identifer	oai:union.ndltd.org:IBICT/oai:tede2:tede/494
Date	28 September 2012
Creators	Maciel, Allan James Ferreira
Contributors	Fonseca Neto, João Viana da, Serra, Ginalber Luiz de Oliveira
Publisher	Universidade Federal do Maranhão, PROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCET, UFMA, BR, Engenharia
Source Sets	IBICT Brazilian ETDs
Language	Portuguese
Detected Language	Portuguese
Type	info:eu-repo/semantics/publishedVersion, info:eu-repo/semantics/masterThesis
Format	application/pdf
Source	reponame:Biblioteca Digital de Teses e Dissertações da UFMA, instname:Universidade Federal do Maranhão, instacron:UFMA
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.0018 seconds

CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA / CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING

Description

Links & Downloads

Tags

Additional Fields