Global ETD Search

1	Řízení pohybu rotačního inverzního kyvadla / Control of a rotational inverted pendulum Bednář, Ladislav January 2020 (has links) The goal of this work is modelling and design of an inverted pendulum prototype. The work presents a mathematical model of a rotary pendulum, modelling of a BLDC motors and also a 3D model of the pendulum prototype is present. The work mentions design of the state space controller and swing up control of the inverted pendulum. Dynamics obtained from the mathematical model is used to create a 3D dynamic model of a pendulum, with the use of the Simscape toolbox. The work deals with control of a BLDC motors with use of vector control. The algorithm is implemented on the CompactRIO platform. Later, hardware is developed, containing STMicroelectronis microcontroller, capable of replacing the CompactRIO platform.
2	Benchmarking Deep Reinforcement Learning on Continuous Control Tasks : AComparison of Neural Network Architectures and Environment Designs / Prestandajämförelse av djup förstärkningsinlärning för kontinuerliga system : En jämförelse av neurala nätverksarkitekturer och miljödesigner Sahlin, Daniel January 2022 (has links) Deep Reinforcement Learning (RL) has received much attention in recent years. This thesis investigates how reward functions, environment termination conditions, Neural Network (NN) architectures, and the type of the deep RL algorithm aect the performance for continuous control tasks. To this end, the Furuta pendulum swing-up task is adopted as the primary benchmark, since it oers low input- and state-dimensionality without being trivial. Focusing on model-free algorithms, the results indicate that DDPG, an actorcritic algorithm, performs significantly better than other algorithms. They also suggest that larger NN architectures may benefit performance in some instances. Comparing reward functions, Potential Based Reward Shaping (PBRS) applied to a sparse reward signal shows promising results compared to a reward function of previous work, and combining PBRS with large negative rewards for terminations due to unwanted behavior seems to improve performance for some algorithms. However, although designs such as PBRS can improve performance they are shown to not be necessary to achieve adequate performance, and the same applies to environment terminations upon unwanted behavior. Attempting to apply a DDPG agent trained in a simulator to a physical Furuta pendulum results in performance that closely resembles what is observed in the simulator for certain training seeds. The results and test suite of this thesis are available on GitHub and should hopefully help inspire future research in environment design and NN architectures for deep RL. Specifically, future work may investigate whether extensive parametertuning alters the results. / Djup förstärkningsinlärning har fått mycket uppmärksamhet de senaste åren. Detta arbete undersöker hur belöningsfunktioner, miljöers termineringsvillkor, neurala nätverksarkitekturer, och typen av djup förstärkningsinlärningsalgoritm påverkar prestandan för kontroll av kontinuerliga system. För att uppnå detta används uppsvängning av Furuta-pendeln som primärt referensproblem, ty det har få indata- och tillståndsdimensioner utan att vara trivialt. Fokus riktas mot modellfria algoritmer, där resultaten indikerar att DDPG, en aktörkritisk algoritm, presterar signifikant bättre än andra algoritmer. Resultaten indikerar också att större nätverksarkitekturer kan ge bättre prestanda i vissa fall. Vid jämförelse av belöningsfunktioner visar potentialbaseradbelöningsutformning (PBRS) applicerat på en gles belöningsfunktion lovande resultat jämfört med en belöningsfunktion från tidigare forskning, och kombinationen av PBRS med stora negativa belöningar för termineringar på grund av oönskat beteende verkar förbättra prestandan för vissa algoritmer. Dock, även om designer så som PBRS kan förbättra prestandan påvisas det att de inte är nödvändiga för att uppnå adekvat prestanda, och detsamma gäller miljötermineringar vid oönskat beteende. Försöket med applicering av en DDPG-agent tränad i en simulator på en fysisk Furuta-pendel resulterar i prestanda som nära efterliknar vad som uppnås i simulatorn för särskilda träningsfrön. Resultaten och testsviten för detta projekt finns tillgängliga på GitHub och kommer förhoppningsvis inspirera framtida forskning inom miljödesign och neurala nätverksarkitekturer för djup förstärkningsinlärning. Specifikt så kan framtida arbeten utreda huruvida utförlig parameterjustering påverkar resultaten. Deep learning Reinforcement learning Reward functions Neural networks Furuta pendulum Djupinlärning Förstärkningsinlärning Belöningsfunktioner Neurala nätverk Furuta-pendel Computer Sciences Datavetenskap (datalogi)
3	Návrh řízení rotačního inverzního kyvadla / Control Design of the Rotation Inverted Pendulum Cejpek, Zdeněk January 2019 (has links) Aim of this thesis is building of a simulator model of a rotary (Furuta) pendulum and design of appropriate regulators. This paper describes assembly of a nonlinear simulator model, using Matlab–Simulink and its library Simscape–Simmechanics. Furthermore the paper discuss linear discrete model obtained from the system response, using least squares method. This linear model serves as aproximation of the system for designing of two linear discrete state space regulators with sumator. These regulators are supported by a simple swing–up regulator and logics managing cooperation.

1

Page generated in 0.0328 seconds