Global ETD Search

1	A leader-follower partially observed Markov game Chang, Yanling 07 January 2016 (has links) The intent of this dissertation is to generate a set of non-dominated finite-memory policies from which one of two agents (the leader) can select a most preferred policy to control a dynamic system that is also affected by the control decisions of the other agent (the follower). The problem is described by an infinite horizon total discounted reward, partially observed Markov game (POMG). Each agent’s policy assumes that the agent knows its current and recent state values, its recent actions, and the current and recent possibly inaccurate observations of the other agent’s state. For each candidate finite-memory leader policy, we assume the follower, fully aware of the leader policy, determines a policy that optimizes the follower’s criterion. The leader-follower assumption allows the POMG to be transformed into a specially structured, partially observed Markov decision process that we use to determine the follower’s best response policy for a given leader policy. We then present a value determination procedure to evaluate the performance of the leader for a given leader policy, based on which non-dominated set of leader polices can be selected by existing heuristic approaches. We then analyze how the value of the leader’s criterion changes due to changes in the leader’s quality of observation of the follower. We give conditions that insure improved observation quality will improve the leader’s value function, assuming that changes in the observation quality do not cause the follower to change its policy. We show that discontinuities in the value of the leader’ criterion, as a function of observation quality, can occur when the change of observation quality is significant enough for the follower to change its policy. We present conditions that determine when a discontinuity may occur and conditions that guarantee a discontinuity will not degrade the leader’s performance. This framework has been used to develop a dynamic risk analysis approach for U.S. food supply chains and to compare and create supply chain designs and sequential control strategies for risk mitigation. Risk analysis Markov decision process Real-time decision making Value of information
2	Simulation-optimization in real-time decision making Zhang, Xuemei January 1997 (has links) No description available. Real-Time Simulation Real-Time Decision Making Tabu Search Short-Term Memory Component
3	An Electric Field Approach : A Strategy for Sony Four-Legged Robot Soccer Johansson, John January 2001 (has links) Using physical analogies when solving computational problems is not uncommon. The Electric Field Approach is such an analogy using potential fields for describing the situation in a robotic soccer match and for proposing the next step in optimizing the situation. The approach was developed and implemented during the summer of 2000 and was later used and tested during the fourth Robot Soccer World Cup in Melbourne, Australia. The results of the games prove that the approach is applicable in this narrow but interesting domain, combining artificial intelligence and robotics. The theory of the approach is general and can also be applied on various other domains. This is not the first potential field approach, but the ability of both handling navigation and manipulation of the environment is unique. robocup potential fields real-time decision-making legged robot AIBO Computer Sciences Datavetenskap (datalogi) Software Engineering Programvaruteknik

1

Page generated in 0.0715 seconds