• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 23
  • 2
  • 2
  • 1
  • 1
  • Tagged with
  • 34
  • 34
  • 17
  • 10
  • 8
  • 7
  • 6
  • 5
  • 5
  • 4
  • 4
  • 4
  • 4
  • 4
  • 3
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
21

Online Adaptive Model-Free MIMO Control of Lighter-Than-Air Dirigible Airship

Boase, Derek 22 January 2024 (has links)
With the recent advances in the field of unmanned aerial vehicles, many applications have been identified. In tasks that require high-payload-to-weight ratios, flight times in the order of days, reduced noise and/or hovering capabilities, lighter-than-air vehicles present themselves as a competitive platform compared to fixed-wing and rotor based vehicles. The limiting factor in their widespread use in autonomous applications comes from the complexity of the control task. The so-called airships are highly-susceptible to aerodynamic forces and pose complex nonlinear system dynamics that complicate their modeling and control. Model-free control lends itself well as a solution to this type of problem, as it derives its control policies using input-output data, and can therefore learn complex dynamics and handle uncertain or unknown parameters and disturbances. In this work, two multi-input multi-output algorithms are presented on the basis of optimal control theory. Leveraging results from reinforcement learning, a single layer, partially connected neural network is formulated as a value function appropriator in accordance with Weierstrass higher-order approximation theorem. The so-called critic-network is updated using gradient descent methods on the mean-squared error of the temporal difference equation. In the single-network controller, the control policy is formulated as a closed form equation that is parameterized on the weights of the critic-network. A second controller is proposed that uses a second single-layer partially connected neural network, the actor-network, to calculate the control action. The actor-network is also updated using gradient descent on the squared error of the temporal difference equation. The controllers are employed in a highly realistic simulation airship model in nominal conditions and in the presence of external disturbances in the form of turbulent wind. To verify the validity and test the sensitivity of the algorithms to design parameters (the initialization of certain terms), ablation studies are carried out with multiple initial parameters. Both of the proposed algorithms are able to track the desired waypoints in both the nominal and disturbed flight tests. Furthermore, the performance of the controllers is compared to a modern, state-of-the-art multi-input multi-output controller. The two proposed controllers outperform the comparison controller in all but one flight test, with up to four fold reduction in the integral absolute error and integral time absolute error metrics. On top of the quantitative improvements seen in the proposed controllers, both controllers demonstrate a reduction in system oscillation and actuator chattering with respect to the comparison algorithm.
22

Nussbaum gain based iterative learning control for a class of multi-input multi-output nonlinear systems.

Jiang, Ping, Chen, H. January 2005 (has links)
Yes / An adaptive iterative learning control(ILC) approach is proposed for a class of multi-input multi-output (MIMO) uncertain nonlinear systems without prior knowledge about system control gain matrices. The Nussbaum-type gain and the positive definite discrete matrix kernel are proposed for dealing with selection of the unknown control gain and learning of the repeatable uncertainties, respectively. Asymptotic convergence for a trajectory tracking within a finite time interval is achieved through repetitive tracking. Simulations are carried out to show the validity of the proposed control method.
23

A universal iterative learning stabilizer for a class of MIMO systems.

Jiang, Ping, Chen, H., Bamforth, C.A. January 2006 (has links)
No / Design of iterative learning control (ILC) often requires some prior knowledge about a system's control matrix. In some applications, such as uncalibrated visual servoing, this kind of knowledge may be unavailable so that a stable learning control cannot always be achieved. In this paper, a universal ILC is proposed for a class of multi-input multi-output (MIMO) uncertain nonlinear systems with no prior knowledge about the system control gain matrix. It consists of a gain matrix selector from the unmixing set and a learned compensator in a form of the positive definite discrete matrix kernel, corresponding to rough gain matrix probing and refined uncertainty compensating, respectively. Asymptotic convergence for a trajectory tracking within a finite time interval is achieved through repetitive tracking. Simulations and experiments of uncalibrated visual servoing are carried out in order to verify the validity of the proposed control method.
24

[en] LEARNING CONTROL OF HIGH FREQUENCY SERVO: HYDRAULIC SYSTEMS / [pt] CONTROLE POR APRENDIZADO DE SISTEMAS SERVO: HIDRÁULICOS DE ALTA FREQÜÊNCIA

JUAN GERARDO CASTILLO ALVA 28 October 2008 (has links)
[pt] Sistemas hidráulicos são usados onde se requerem forças e torques relativamente altos, alta velocidade de resposta para o início, parada e reversão da velocidade. Eles são usados em sistemas industriais, em robótica, simuladores de movimento, plantas automatizadas, exploração de minérios, prensas, e especialmente em sistemas de testes de fadiga de materiais. As máquinas de testes de fadiga baseadas em sistemas servo-hidráulicos têm como propósito fazer ensaios nos materiais para prever a vida útil em serviço. Os ensaios de fadiga são quase sempre independentes da freqüência de trabalho. Para uma dada resistência do material e magnitudes das tensões alternadas e médias aplicadas, a vida à fadiga depende essencialmente do número de ciclos de carga aplicados ao material testado. Por esse motivo, trabalhar com a máquina de ensaios de materiais a uma freqüência elevada traz vantagens de redução de tempo e custo dos ensaios, sem interferir nos resultados. A aplicação da carga pode ser repetida milhões de vezes, em freqüências típicas de até cem vezes por segundo para metais. Para se atingirem estas freqüências, relativamente altas para um teste de fadiga, é necessário um sistema de controle eficiente. Nesta dissertação, técnicas de controle por aprendizado são desenvolvidas e aplicadas a uma máquina de ensaios de materiais, permitindo a aplicação de carregamentos de amplitude variável em alta freqüência. A metodologia proposta consiste em fazer um controle do tipo bang-bang, restringindo à servo-válvula do sistema a trabalhar sempre nos seus limites extremos de operação, i.e., procurando mantê-la sempre completamente aberta em uma ou outra direção. Devido à dinâmica do sistema, os pontos de reversão devem ficar antes dos picos e vales de força ou tensão desejada. O instante de reversão é um parâmetro que depende de diversos fatores, como a amplitude e carga média da solicitação, e também é influenciado por zonas mortas causadas, e.g., por folgas na fixação dos corpos de prova. Para que a servo-válvula trabalhe no limite de seu funcionamento, o algoritmo de aprendizado obtém os instantes ótimos para as reversões, associados a variáveis adimensionais com valores entre 0 e 1, armazenados em tabelas específicas para cada tipo de carregamento. A lei de aprendizado preenche e atualiza constantemente os valores das tabelas durante a execução dos testes, melhorando a resposta do sistema a cada evento. Apresentam-se a modelagem dinâmica de uma máquina servohidráulica e de sua malha de controle, e simulações comparando o controle PID com o controle por aprendizado proposto. A validação experimental é feita em uma máquina servo-hidráulica de ensaios de fadiga. Para este fim, um software de controle em tempo real foi especialmente desenvolvido e implementado em um sistema computacional CompactRIO. Os resultados demonstram a eficiência da metodologia proposta. / [en] Hydraulic systems are used where relatively high forces and torques are required, or when high response speeds are necessary. They are used in industrial systems, robotics, movement simulators, automated plants, ore exploration, presses, and especially in fatigue testing systems. Fatigue tests are usually performed on servo-hydraulic systems, in order to predict the behavior of materials and their life in service. Fatigue tests are almost always independent of the loading frequency. For a given material and magnitudes of alternate and mean stresses, the fatigue life depends essentially on the number of applied load cycles on the tested material. For this reason, working with the material testing machine at high frequencies brings the advantages of reduction in time and cost, without altering the results. The application of the load can be repeated millions of times, in frequencies of up to one hundred times per second for metals, or even more. To achieve such frequencies, relatively high for a fatigue test, it is necessary to use an efficient control system. In this thesis, learning control techniques are developed and applied to a materials testing machine, allowing the application of constant or variable amplitude loads in high frequency. The proposed methodology consists of implementing a bang-bang type control, restricting the system servo-valve to always work at its extreme limits of operation, i.e., always keeping it completely open in one or the other direction. Due to the system dynamics, the reversion instant must happen before achieving the peaks and valleys of desired force (or stress, strain, etc.). The reversion instant is a parameter that depends on several factors, such as the alternate and mean loading components. It is also influenced by dead zones caused, e.g., by the slack in the mounting between a CTS specimen and the machine pins. As the servo-valve works in its limits of operation, the learning algorithm tries to obtain the optimal instants for the reversions, associating them to a non dimensional variable with values between 0 and 1, stored in specific tables. The learning law constantly updates the values of the table during the execution of the tests, improving the system response. In this work, the dynamic modeling of a servo-hydraulic machine is presented, together with its control scheme. Simulations are performed to compare results from PID and learning controls. The experimental validation is made using a servohydraulic testing machine. For this purpose, real time control software is developed and implemented in a CompactRIO computational system. The results demonstrate the efficiency of the proposed methodology.
25

Auto-Calibration and Control Applied to Electro-Hydraulic Poppet Valves

Opdenbosch, Patrick 12 November 2007 (has links)
Modern control design is sometimes accompanied by the challenge of dealing with nonlinear systems or plants. In some situations, due to the complexity of the plant and the unavailability of suitable models, the controls engineer opts for developing control schemes based on look-up tables. These tables, typically populated with the steady state inverse input-output characteristics of the plant, are used to compensate the plant via open-loop or closed-loop to solve the control problem. In an effort to present a new alternative, a general theoretical framework for online auto-calibration and control of general nonlinear systems is developed in this dissertation. This technique simultaneously learns the inverse input-state mapping (i.e. the calibration mapping) of the plant while forcing its state to follow a prescribed desired trajectory. The main requirements for the successful application of the novel control law are knowledge of the order of the plant and some generic data to initialize the inverse mapping. This last requirement can be easily fulfilled by using steady-state data or the equilibrium points of the plant. In this approach, the inverse mapping is learned from the current and past states. The learning is accomplished in a composite manner by employing input and state errors. The map is used simultaneously in the feedforward path to control the plant. The performance of the plant subject to this novel controller is validated through simulations and experimental data. The new control method is applied to a novel Electro-Hydraulic Poppet Valve (EHPV). These valves are used in a Wheatstone bridge arrangement for motion control of hydraulic actuators. This is preferred over the conventional use of spool valves due to the energy savings potential. It is shown in this dissertation that this method improves the value of using these types of valves for motion control in hydraulics. This is due to the combination of self-learning (auto-calibration) and better performance for a more efficient operation of hydraulic equipment. Additionally, it is shown that the auto-calibration of the valves can be used for health monitoring of the same, which consequently improves their reliability and expedites maintenance downtime.
26

Iterative Evaluation and Control Methods for Disturbance Suppression on a High Precision Motion Servo

Thunberg, Claes, Kastensson, Klara January 2023 (has links)
Moore’s law states that the number of transistors in an Integrated Circuit (IC) doubles every two years. Ever-increasing performance in mask writing machinery is therefore required being the first step in the manufacturing process. Many factors affect the quality of the end product, with the motion control system playing an important role. This thesis analyzes the performance of the motion controller for the positioning system in a mask writer application. The target motion in the X-axis in the mask writer is by design highly repetitive and predictable. As of today a feedforward-feedback controller is used, tuned for low deviation during writing. In this thesis it is found that the motion control can be improved by exploiting the repetitive nature of the motion task. Two iterative methods are explored, Iterative Feedback Tuning (IFT) and Iterative Learning Control (ILC). IFT is implemented as a parameter optimizing method for the existing Proportional-Integral-Derivative (PID) controller. Given suboptimal initial parameters the algorithm converges to a global minimum using a cost function to minimize total deviation and constraints on the maximum deviation. With the optimized parameter settings an improvement of a 31 % decrease in total deviation is seen compared to the default setting. ILC is implemented as a replacement to the current controller in an exposure motion. With the use of saved data from previous iterations the control signal is updated and refined to better suit the target motion. ILC is a promising method within high precision motion control by virtue of not needing a model of the system and its ability to suppress reoccurring disturbances. The algorithm achieves an improvement of a 94% decrease in total deviation during writing compared to the current controller. However, with this implementation long term stability is not guaranteed. A stable implementation of the algorithm tested on a test rig achieves an improvement of a 79.8% decrease in deviation during writing compared to the current feedforward-feedback controller. Additionally, correlations between parameter values of the current feedback controller and servo characteristics are analyzed to aid in the manual tuning process. Tuning the PID controller for fast rise time decreases the total deviation during writing. The derivative gain in the controller should be high to decrease the overshoot caused by the aggressive controller. This will induce some oscillations into the system, however not at the cost of performance as a result of the smooth motion during writing.
27

Sensor Fusion and Control Applied to Industrial Manipulators

Axelsson, Patrik January 2014 (has links)
One of the main tasks for an industrial robot is to move the end-effector in a predefined path with a specified velocity and acceleration. Different applications have different requirements of the performance. For some applications it is essential that the tracking error is extremely small, whereas other applications require a time optimal tracking. Independent of the application, the controller is a crucial part of the robot system. The most common controller configuration uses only measurements of the motor angular positions and velocities, instead of the position and velocity of the end-effector. The development of new cost optimised robots has introduced unwanted flexibilities in the joints and the links. The consequence is that it is no longer possible to get the desired performance and robustness by only measuring the motor angular positions.  This thesis investigates if it is possible to estimate the end-effector position using Bayesian estimation methods for state estimation, here represented by the extended Kalman filter and the particle filter. The arm-side information is provided by an accelerometer mounted at the end-effector. The measurements consist of the motor angular positions and the acceleration of the end-effector. In a simulation study on a realistic flexible industrial robot, the angular position performance is shown to be close to the fundamental Cramér-Rao lower bound. The methods are also verified in experiments on an ABB IRB4600 robot, where the dynamic performance of the position for the end-effector is significantly improved. There is no significant difference in performance between the different methods. Instead, execution time, model complexities and implementation issues have to be considered when choosing the method. The estimation performance depends strongly on the tuning of the filters and the accuracy of the models that are used. Therefore, a method for estimating the process noise covariance matrix is proposed. Moreover, sampling methods are analysed and a low-complexity analytical solution for the continuous-time update in the Kalman filter, that does not involve oversampling, is proposed.  The thesis also investigates two types of control problems. First, the norm-optimal iterative learning control (ILC) algorithm for linear systems is extended to an estimation-based norm-optimal ILC algorithm where the controlled variables are not directly available as measurements. The algorithm can also be applied to non-linear systems. The objective function in the optimisation problem is modified to incorporate not only the mean value of the estimated variable, but also information about the uncertainty of the estimate. Second, H∞ controllers are designed and analysed on a linear four-mass flexible joint model. It is shown that the control performance can be increased, without adding new measurements, compared to previous controllers. Measuring the end-effector acceleration increases the control performance even more. A non-linear model has to be used to describe the behaviour of a real flexible joint. An H∞-synthesis method for control of a flexible joint, with non-linear spring characteristic, is therefore proposed. / En av de viktigaste uppgifterna för en industrirobot är att förflytta verktyget i en fördefinierad bana med en specificerad hastighet och acceleration. Exempel på användningsområden för en industrirobot är bland annat bågsvetsning eller limning. För dessa typer av applikationer är det viktigt att banföljningsfelet är extremt litet, men även hastighetsprofilen måste följas så att det till exempel inte appliceras för mycket eller för lite lim. Andra användningsområden kan vara punktsvetsning av bilkarosser och paketering av olika varor. För dess applikationer är banföljningen inte det viktiga, istället kan till exempel en tidsoptimal banföljning krävas eller att svängningarna vid en inbromsning minimeras. Oberoende av applikationen är regulatorn en avgörande del av robotsystemet. Den vanligaste regulatorkonfigurationen använder bara mätningar av motorernas vinkelpositioner och -hastigheter, istället för positionen och hastigheten för verktyget, som är det man egentligen vill styra.  En del av utvecklingsarbetet för nya generationers robotar är att reducera kostnaden men samtidigt förbättra prestandan. Ett sätt att minska kostnaden kan till exempel vara att minska dimensionerna på länkarna eller köpa in billigare växellådor. Den här utvecklingen av kostnadsoptimerade robotar har infört oönskade flexibiliteter i leder och länkar. Det är därför inte längre möjligt att få den önskade prestandan och robustheten genom att bara mäta motorernas vinkelpositioner och -hastigheter. Istället krävs det omfattande matematiska modeller som beskriver dessa oönskade flexibiliteter. Dessa modeller kräver mycket arbete att dels ta fram men även för att identifiera parametrarna. Det finns automatiska metoder för att beräkna modellparametrarna men oftast krävs det en manuell justering för att få bra prestanda.  Den här avhandlingen undersöker möjligheterna att beräkna verktygspositionen med hjälp av bayesianska metoder för tillståndsskattning. De bayesianska skattningsmetoderna beräknar tillstånden för ett system iterativt. Med hjälp av en matematisk modell över systemet predikteras vad tillståndet ska vara vid nästa tidpunkt. Efter att mätningar av systemet vid den nya tidpunkten har genomförts justeras skattningen med hjälp av dessa mätningar. De metoder som har använts i avhandlingen är det så kallade extended Kalman filtret samt partikelfiltret.  Informationen på armsidan av växellådan ges av en accelerometer som är monterad på verktyget. Med hjälp av accelerationen för verktyget och motorernas vinkelpositioner kan en skattning av verktygspositionen beräknas. I en simuleringsstudie för en realistisk vek robot har det visats att skattningsprestandan ligger nära den teoretiska undre gränsen, känd som Raooch mätstörningar som påverkar roboten. För att underlätta trimningen så har en metod för att skatta processbrusets kovariansmatris föreslagits. En annan viktig del som påverkar prestandan är modellerna som används i filtren. Modellerna för en industrirobot är vanligtvis framtagna i kontinuerlig tid medan filtren använder modeller i diskret tid. För att minska felen som uppkommer då de tidskontinuerliga modellerna överförs till diskret tid har olika samplingsmetoder studerats. Vanligtvis används enkla metoder för att diskretisera vilket innebär problem med prestanda och stabilitet. För att hantera dessa problem införs översampling vilket innebär att tidsuppdateringen sker med en mycket kortare sampeltid än vad mätuppdateringen gör. För att undvika översampling kan det motsvarande tidskontinuerliga filtret användas för att prediktera tillstånden vid nästa diskreta tidpunkt. En analytisk lösning med låg beräkningskomplexitet till detta problem har föreslagits.  Vidare innehåller avhandlingen två typer av reglerproblem relaterade till industrirobotar. För det första har den så kallade norm-optimala iterative learning control styrlagen utökats till att hantera fallet då en skattning av den önskade reglerstorheten används istället för en mätning. Med hjälp av skattningen av systemets tillståndsvektor kan metoden nu även användas till olinjära system vilket inte är fallet med standardformuleringen. Den föreslagna metoden utökar målfunktionen i optimeringsproblemet till att innehålla inte bara väntevärdet av den skattade reglerstorheten utan även skattningsfelets kovariansmatris. Det innebär att om skattningsfelet är stort vid en viss tidpunkt ska den skattade reglerstorheten vid den tidpunkten inte påverka resultatet mycket eftersom det finns en stor osäkerhet i var den sanna reglerstorheten befinner sig.  För det andra har design och analys av H∞-regulatorer för en linjär modell av en vek robotled, som beskrivs med fyra massor, genomförts. Det visar sig att reglerprestandan kan förbättras, utan att lägga till fler mätningar än motorns vinkelposition, jämfört med tidigare utvärderade regulatorer. Genom att mäta verktygets acceleration kan prestandan förbättras ännu mer. Modellen över leden är i själva verket olinjär. För att hantera detta har en H∞-syntesmetod föreslagits som kan hantera olinjäriteten i modellen. / Vinnova Excellence Center LINK-SIC
28

Développement d'une commande à modèle partiel appris : analyse théorique et étude pratique / Development of a control law based on learned sparse model : theorical analysis and practical study

Nguyen, Huu Phuc 16 December 2016 (has links)
En théorie de la commande, un modèle du système est généralement utilisé pour construire la loi de commande et assurer ses performances. Les équations mathématiques qui représentent le système à contrôler sont utilisées pour assurer que le contrôleur associé va stabiliser la boucle fermée. Mais, en pratique, le système réel s’écarte du comportement théorique modélisé. Des non-linéarités ou des dynamiques rapides peuvent être négligées, les paramètres sont parfois difficiles à estimer, des perturbations non maitrisables restent non modélisées. L’approche proposée dans ce travail repose en partie sur la connaissance du système à piloter par l’utilisation d’un modèle analytique mais aussi sur l’utilisation de données expérimentales hors ligne ou en ligne. A chaque pas de temps la valeur de la commande qui amène au mieux le système vers un objectif choisi a priori, est le résultat d’un algorithme qui minimise une fonction de coût ou maximise une récompense. Au centre de la technique développée, il y a l’utilisation d’un modèle numérique de comportement du système qui se présente sous la forme d’une fonction de prédiction tabulée ayant en entrée un n-uplet de l’espace joint entrées/état ou entrées/sorties du système. Cette base de connaissance permet l’extraction d’une sous-partie de l’ensemble des possibilités des valeurs prédites à partir d’une sous-partie du vecteur d’entrée de la table. Par exemple, pour une valeur de l’état, on pourra obtenir toutes les possibilités d’états futurs à un pas de temps, fonction des valeurs applicables de commande. Basé sur des travaux antérieurs ayant montré la viabilité du concept en entrées/état, de nouveaux développements ont été proposés. Le modèle de prédiction est initialisé en utilisant au mieux la connaissance a priori du système. Il est ensuite amélioré par un algorithme d’apprentissage simple basé sur l’erreur entre données mesurées et données prédites. Deux approches sont utilisées : la première est basée sur le modèle d’état (comme dans les travaux antérieurs mais appliquée à des systèmes plus complexes), la deuxième est basée sur un modèle entrée-sortie. La valeur de commande qui permet de rapprocher au mieux la sortie prédite dans l’ensemble des possibilités atteignables de la sortie ou de l’état désiré, est trouvée par un algorithme d’optimisation. Afin de valider les différents éléments proposés, cette commande a été mise en œuvre sur différentes applications. Une expérimentation réelle sur un quadricoptère et des essais réels de suivi de trajectoire sur un véhicule électrique du laboratoire montrent sacapacité et son efficacité sur des systèmes complexes et rapides. D’autres résultats en simulation permettent d’élargir l’étude de ses performances. Dans le cadre d’un projet partenarial, l’algorithme a également montré sa capacité à servir d’estimateur d’état dans la reconstruction de la vitesse mécanique d’une machine asynchrone à partir des signaux électriques. Pour cela, la vitesse mécanique a été considérée comme l’entrée du système. / In classical control theory, the control law is generally built, based on the theoretical model of the system. That means that the mathematical equations representing the system dynamics are used to stabilize the closed loop. But in practice, the actual system differs from the theory, for example, the nonlinearity, the varied parameters and the unknown disturbances of the system. The proposed approach in this work is based on the knowledge of the plant system by using not only the analytical model but also the experimental data. The input values stabilizing the system on open loop, that minimize a cost function, for example, the distance between the desired output and the predicted output, or maximize a reward function are calculated by an optimal algorithm. The key idea of this approach is to use a numerical behavior model of the system as a prediction function on the joint state and input spaces or input-output spaces to find the controller’s output. To do this, a new non-linear control concept is proposed, based on an existing controller that uses a prediction map built on the state-space. The prediction model is initialized by using the best knowledge a priori of the system. It is then improved by using a learning algorithm based on the sensors’ data. Two types of prediction map are employed: the first one is based on the state-space model; the second one is represented by an input-output model. The output of the controller, that minimizes the error between the predicted output from the prediction model and the desired output, will be found using optimal algorithm. The application of the proposed controller has been made on various systems. Some real experiments for quadricopter, some actual tests for the electrical vehicle Zoé show its ability and efficiency to complex and fast systems. Other the results in simulation are tested in order to investigate and study the performance of the proposed controller. This approach is also used to estimate the rotor speed of the induction machine by considering the rotor speed as the input of the system.
29

Robust Iterative Learning Control for Linear and Hybrid Systems with Applications to Automotive Control

Mishra, Kirti D. January 2020 (has links)
No description available.
30

Learning model predictive control with application to quadcopter trajectory tracking

Maji, Abhishek January 2020 (has links)
In thiswork, we develop a learning model predictive controller (LMPC) for energy-optimaltracking of periodic trajectories for a quadcopter. The main advantage of this controller isthat it is “reference-free”. Moreover, the controller is able to improve its performance overiterations by incorporating learning from the previous iterations. The proposed learningmodel predictive controller aims to learn the “best” energy-optimal trajectory over timeby learning a terminal constraint set and a terminal cost from the history data of previousiterations. We have shown howto recursively construct terminal constraint set and terminalcost as a convex hull and a convex piece-wise linear approximation of state and inputtrajectories of previous iterations, respectively. These steps allow us to formulate theonline planning problem for the controller as a convex optimization program, therebyavoiding the complex combinatorial optimization problems that alternative formulationsin the literature need to solve. The data-driven terminal constraint set and terminal costnot only ensure recursive feasibility and stability of LMPC but also guarantee convergenceto the neighbourhood of the optimal performance at steady state. Our LMPC formulationincludes linear time-varying system dynamics which is also learnt from stored state andinput trajectories of previous iterations.To show the performance of LMPC, a quadcopter trajectory learning problem in thevertical plane is simulated in MATLAB/SIMULINK. This particular trajectory learningproblem involves non-convex state constraints, which makes the resulting optimal controlproblem difficult to solve. A tangent cut method is implemented to approximate the nonconvexconstraints by convex ones, which allows the optimal control problem to be solvedby efficient convex optimization solvers. Simulation results illustrate the effectiveness ofthe proposed control strategy. / Vi utvecklar en lärande modell-prediktiv regulator för energi-optimalt följande av periodiskatrajektorier för en quadkopter. Den huvudsakliga fördelen med denna regulator äratt den är “referensfri”. Dessutom så klarar regulatorn att förbättra sin prestanda medtiden genom att inkorporera inlärning från föregående iterationer. Syftet med den föreslagnalärande modell-prediktiva regulatorn är att över en viss tid lära sig den “bästa”energioptimala trajektorian genom att lära sig den terminala bivillkorsmängden och denterminala kostnaden från historiskt data från tidigare iterationer. Vi har visat hur man kanrekursivt konstruera terminala bivillkorsmängder och terminala kostnader som konvexahöljen respektive konvexa styckvis linjära approximationer av tillstånds- och insignalstrajektoriernafrån tidigare iterationer. Dessa steg gör det möjligt att formulera onlineplaneringsproblemet för regulatorn som ett konvext optimeringsproblem och på så visundvika de komplexa kombinatoriska optimeringsproblemen som ofta krävs för alternativametoder som kan hittas andra publikationer. Den datadrivna terminala bivillkorsmängdenoch terminala kostnaden garanterar inte bara rekursiv tillåtenhet och stabilitet av LMPC,utan även konvergens till en omgivning av den optimala prestandan efter att ha uppnåttjämvikt. Vår LMPC-formulering innehåller linjär och tidsvarierande systemdynamik, somockså lärs från lagrade tillstånds- och insignalstrajektorier från tidigare iterationer.För att visa prestandan av LMPC så simuleras iMATLAB/SIMULINK ett problem ominlärning av quadkopter-trajektorier i det vertikala planet. Just det trajektorieinlärningsproblemetinnehåller icke-konvexa tillståndsbivillkor, vilket gör det resulterande optimeringsproblemetsvårt att lösa. En tangentsnitt-metod är implementerad för att approximera deicke-konvexa bivillkoren med hjälp av konvexa bivillkor, vilket möjliggör lösningen avdet optimala regleringsproblemet med effektiva lösare för konvexa optimeringsproblem.Simuleringsresultaten visar effektivitet av den föreslagna regleringsmetoden.

Page generated in 0.0395 seconds