Global ETD Search

11	Causal Models over Infinite Graphs and their Application to the Sensorimotor Loop / Kausale Modelle über unendlichen Grafen und deren Anwendung auf die sensomotorische Schleife - stochastische Aspekte und gradientenbasierte optimale Steuerung Bernigau, Holger 27 April 2015 (has links) (PDF) Motivation and background The enormous amount of capabilities that every human learns throughout his life, is probably among the most remarkable and fascinating aspects of life. Learning has therefore drawn lots of interest from scientists working in very different fields like philosophy, biology, sociology, educational sciences, computer sciences and mathematics. This thesis focuses on the information theoretical and mathematical aspects of learning. We are interested in the learning process of an agent (which can be for example a human, an animal, a robot, an economical institution or a state) that interacts with its environment. Common models for this interaction are Markov decision processes (MDPs) and partially observable Markov decision processes (POMDPs). Learning is then considered to be the maximization of the expectation of a predefined reward function. In order to formulate general principles (like a formal definition of curiosity-driven learning or avoidance of unpleasant situation) in a rigorous way, it might be desirable to have a theoretical framework for the optimization of more complex functionals of the underlying process law. This might include the entropy of certain sensor values or their mutual information. An optimization of the latter quantity (also known as predictive information) has been investigated intensively both theoretically and experimentally using computer simulations by N. Ay, R. Der, K Zahedi and G. Martius. In this thesis, we develop a mathematical theory for learning in the sensorimotor loop beyond expected reward maximization. Approaches and results This thesis covers four different topics related to the theory of learning in the sensorimotor loop. First of all, we need to specify the model of an agent interacting with the environment, either with learning or without learning. This interaction naturally results in complex causal dependencies. Since we are interested in asymptotic properties of learning algorithms, it is necessary to consider infinite time horizons. It turns out that the well-understood theory of causal networks known from the machine learning literature is not powerful enough for our purpose. Therefore we extend important theorems on causal networks to infinite graphs and general state spaces using analytical methods from measure theoretic probability theory and the theory of discrete time stochastic processes. Furthermore, we prove a generalization of the strong Markov property from Markov processes to infinite causal networks. Secondly, we develop a new idea for a projected stochastic constraint optimization algorithm. Generally a discrete gradient ascent algorithm can be used to generate an iterative sequence that converges to the stationary points of a given optimization problem. Whenever the optimization takes place over a compact subset of a vector space, it is possible that the iterative sequence leaves the constraint set. One possibility to cope with this problem is to project all points to the constraint set using Euclidean best-approximation. The latter is sometimes difficult to calculate. A concrete example is an optimization over the unit ball in a matrix space equipped with operator norm. Our idea consists of a back-projection using quasi-projectors different from the Euclidean best-approximation. In the matrix example, there is another canonical way to force the iterative sequence to stay in the constraint set: Whenever a point leaves the unit ball, it is divided by its norm. For a given target function, this procedure might introduce spurious stationary points on the boundary. We show that this problem can be circumvented by using a gradient that is tailored to the quasi-projector used for back-projection. We state a general technical compatibility condition between a quasi-projector and a metric used for gradient ascent, prove convergence of stochastic iterative sequences and provide an appropriate metric for the unit-ball example. Thirdly, a class of learning problems in the sensorimotor loop is defined and motivated. This class of problems is more general than the usual expected reward maximization and is illustrated by numerous examples (like expected reward maximization, maximization of the predictive information, maximization of the entropy and minimization of the variance of a given reward function). We also provide stationarity conditions together with appropriate gradient formulas. Last but not least, we prove convergence of a stochastic optimization algorithm (as considered in the second topic) applied to a general learning problem (as considered in the third topic). It is shown that the learning algorithm converges to the set of stationary points. Among others, the proof covers the convergence of an improved version of an algorithm for the maximization of the predictive information as proposed by N. Ay, R. Der and K. Zahedi. We also investigate an application to a linear Gaussian dynamic, where the policies are encoded by the unit-ball in a space of matrices equipped with operator norm. Wahrscheinlichkeitstheorie kausale Modelle grafische Modelle Starke Markov Eigenschaft optimale Steuerung mengenwertige Analysis Informationstheorie stochastische Optimierung Informationstheorie prediktive Information bestärkendes Lernen Markov-Entscheidungsproblem sensomotorische Schleife stochastische Konvergenz probability theory causal models graphical models strong Markov property optimal control set-valued analysis stochastic optimization information theory predictive information reinforcement learning sensorimotor loop stochastic convergence ddc:500
12	Causal Models over Infinite Graphs and their Application to the Sensorimotor Loop: Causal Models over Infinite Graphs and their Application to theSensorimotor Loop: General Stochastic Aspects and GradientMethods for Optimal Control Bernigau, Holger 04 July 2015 (has links) Motivation and background The enormous amount of capabilities that every human learns throughout his life, is probably among the most remarkable and fascinating aspects of life. Learning has therefore drawn lots of interest from scientists working in very different fields like philosophy, biology, sociology, educational sciences, computer sciences and mathematics. This thesis focuses on the information theoretical and mathematical aspects of learning. We are interested in the learning process of an agent (which can be for example a human, an animal, a robot, an economical institution or a state) that interacts with its environment. Common models for this interaction are Markov decision processes (MDPs) and partially observable Markov decision processes (POMDPs). Learning is then considered to be the maximization of the expectation of a predefined reward function. In order to formulate general principles (like a formal definition of curiosity-driven learning or avoidance of unpleasant situation) in a rigorous way, it might be desirable to have a theoretical framework for the optimization of more complex functionals of the underlying process law. This might include the entropy of certain sensor values or their mutual information. An optimization of the latter quantity (also known as predictive information) has been investigated intensively both theoretically and experimentally using computer simulations by N. Ay, R. Der, K Zahedi and G. Martius. In this thesis, we develop a mathematical theory for learning in the sensorimotor loop beyond expected reward maximization. Approaches and results This thesis covers four different topics related to the theory of learning in the sensorimotor loop. First of all, we need to specify the model of an agent interacting with the environment, either with learning or without learning. This interaction naturally results in complex causal dependencies. Since we are interested in asymptotic properties of learning algorithms, it is necessary to consider infinite time horizons. It turns out that the well-understood theory of causal networks known from the machine learning literature is not powerful enough for our purpose. Therefore we extend important theorems on causal networks to infinite graphs and general state spaces using analytical methods from measure theoretic probability theory and the theory of discrete time stochastic processes. Furthermore, we prove a generalization of the strong Markov property from Markov processes to infinite causal networks. Secondly, we develop a new idea for a projected stochastic constraint optimization algorithm. Generally a discrete gradient ascent algorithm can be used to generate an iterative sequence that converges to the stationary points of a given optimization problem. Whenever the optimization takes place over a compact subset of a vector space, it is possible that the iterative sequence leaves the constraint set. One possibility to cope with this problem is to project all points to the constraint set using Euclidean best-approximation. The latter is sometimes difficult to calculate. A concrete example is an optimization over the unit ball in a matrix space equipped with operator norm. Our idea consists of a back-projection using quasi-projectors different from the Euclidean best-approximation. In the matrix example, there is another canonical way to force the iterative sequence to stay in the constraint set: Whenever a point leaves the unit ball, it is divided by its norm. For a given target function, this procedure might introduce spurious stationary points on the boundary. We show that this problem can be circumvented by using a gradient that is tailored to the quasi-projector used for back-projection. We state a general technical compatibility condition between a quasi-projector and a metric used for gradient ascent, prove convergence of stochastic iterative sequences and provide an appropriate metric for the unit-ball example. Thirdly, a class of learning problems in the sensorimotor loop is defined and motivated. This class of problems is more general than the usual expected reward maximization and is illustrated by numerous examples (like expected reward maximization, maximization of the predictive information, maximization of the entropy and minimization of the variance of a given reward function). We also provide stationarity conditions together with appropriate gradient formulas. Last but not least, we prove convergence of a stochastic optimization algorithm (as considered in the second topic) applied to a general learning problem (as considered in the third topic). It is shown that the learning algorithm converges to the set of stationary points. Among others, the proof covers the convergence of an improved version of an algorithm for the maximization of the predictive information as proposed by N. Ay, R. Der and K. Zahedi. We also investigate an application to a linear Gaussian dynamic, where the policies are encoded by the unit-ball in a space of matrices equipped with operator norm. info:eu-repo/classification/ddc/500 ddc:500
13	Search for heavy Higgs bosons A/H decaying to a top-quark pair in pp collisions at \sqrt(s)=8 TeV with the ATLAS detector Stănescu-Bellu, Mădălina 30 April 2021 (has links) In dieser Dissertation wird die Suche nach schweren neutralen pseudoskalaren A und skalaren H Higgs-Bosonen vorgestellt, die in gg-Fusionen erzeugt werden, und in ein Top-Antitop-Quark-Paar zerfallen. Gesucht wurde im vollständigen Datensatz von Proton–Proton-Kollisionen bei einer Schwerpunktsenergie von 8 TeV die vom ATLAS-Detektor am Large Hadron Collider aufgezeichnet wurde und einer integrierten Luminosität von 20.3 fb−1 entspricht. Der Signalprozess und der Haupthintergrund aus der Top-Quark-Paar-Produktion über starke gg-Fusionen-Prozesse, interferieren heftig, was zu einer Verzerrung des reinen Breit-Wigner-Resonanzpeak in eine Peak-Dip-Struktur führt. Diese Analyse ist die erste am LHC, die die Interferenz zwischen Signal und Hintergrundprozessen vollständig berücksichtigt. Die Suche stützt sich auf die statistische Analyse des invarianten Top-Quark-Paar-Massenspektrum, welches aus Ereignissen mit einem Elektron oder Myon mit hohem Transversalimpuls, einer hohen fehlenden Transversalenergie von dem nicht detektierten Neutrino und mindestens vier Jets rekonstruiert wird. In den Daten wird keine signifikante Abweichung vom erwarteten Standardmodell-Hintergrund beobachtet. Die Ausschließungsgrenzen wurden abgeleitet im Kontext des Typ II Two-Higgs-Doublet Model, für Higgs-Bosonen mit einer Masse von 500 und 750 GeV und mit niedrigerem tan(\beta)-Parameter, bei der tan(\beta) das Verhältnis der Vakuumerwartungswerte der beiden Higgs-Dublett-Felder ist. Diese Parameterregionen sind weitgehend unerforscht in Untersuchungen von beliebigen Endzuständen. / In this thesis a search is presented for heavy neutral pseudoscalar A and scalar H Higgs bosons, produced in gg fusion and decaying into a top-antitop quark pair. The search is conducted on the full proton-proton collisions dataset recorded by the ATLAS detector at the Large Hadron Collider at a centre-of-mass collision energy of 8 TeV and corresponding to an integrated luminosity of 20.3 fb−1. The signal process and the main background from top quark pair production via the gg fusion strong process, interfere heavily, distorting the signal shape from the pure Breit-Wigner resonance peak to a peak-dip structure. This analysis is the first one at the LHC that fully takes into account the interference between a signal and the background processes. The search relies on the statistical analysis of the top quark pair invariant mass spectrum, which is reconstructed in signal candidate events with a high-transverse momentum electron or muon, large missing transverse energy from the undetected neutrino and at least four jets. No significant deviation from the expected SM background is observed in data. Exclusion limits are derived in the context of the type-II Two-Higgs-Doublet Model, for Higgs boson masses of 500 and 750 GeV and in the low tan(\beta) parameter region, where tan(\beta) is the ratio of the vacuum expectation values of the two Higgs doublet fields. These parameter regions have been largely unexplored by searches in any final state. Physik LHC ATLAS Higgs 8TeV Skalar Pseudoskalar A H schwer Interferenz Top-Quark Top-Quark-Paar ttbar Top-Schleife Breit-Wigner Peak-Dip-Struktur semileptonisch 2HDM BSM physics LHC ATLAS Higgs 8TeV scalar pseudoscalar A H heavy interference top-quark top-quark-pair ttbar top-loop Breit-Wigner peak-dip semileptonic 2HDM BSM 530 Physik ddc:530
14	Modelling closed-loop receptive fields: On the formation and utility of receptive fields in closed-loop behavioural systems / Entwicklung rezeptiver Felder in autonom handelnden, rückgekoppelten Systemen Kulvicius, Tomas 20 April 2010 (has links) No description available. 500 Naturwissenschaften allgemein Mathematics and Computer Science Computational Neuroscience Rückgekoppelte Systeme Wahrnehmungs-Handlungs-Schleife Visuelle Rezeptive Felder Temporal-Sequence-Learning-Algorithmus Autonom Fahrender Roboter Entropie Energie Eingang/Ausgangs-Verhältnis Optimale Agenten Ortsfelder Q-Lernalgorithmus Eigenmarkierung Zielgerichtete Orientierung Computational Neuroscience Closed-Loop Learning Systems Sensorimotor Loop Visual Receptive Fields Temporal Sequence Learning Driving Robot Entropy Input/Output Ratio Energy Optimal Agents Place Fields Place Field Remapping Q-learning Self-marking Goal Directed Navigation 30.03 RA 000: Allgemeine Naturwissenschaften

Page generated in 0.0423 seconds