Global ETD Search

1	Reglerentwurf zur dezentralen Online-Steuerung von Lichtsignalanlagen in Straßennetzwerken Lämmer, Stefan 05 November 2007 (has links) (PDF) Die Dissertationsschrift widmet sich einer systemtheoretischen Untersuchung zur verkehrsabhängigen Steuerung von Lichtsignalanlagen in Straßennetzwerken. Aus einem mathematischen Modell für den Verkehrsablauf auf Knotenzufahrten wird ein Verfahren abgeleitet, mit dem sich Umschaltzeitpunkte und Phasenwechsel flexibel an das tatsächliche Verkehrsgeschehen anpassen lassen. Der Ansatzpunkt ist, die einzelnen Knotenpunkte des Netzwerks lokal zu optimieren. Eine &quot;Grüne Welle&quot; soll sich von selbst einstellen, und zwar genau dann, wenn dadurch lokal Wartezeiten eingespart werden. Indem die lokale Optimierung in ein lokales Stabilisierungsverfahren eingebettet wird, können Instabilitäten aufgrund netzwerkweiter Rückkopplungen ausgeschlossen werden. Das vorgestellte Verfahren setzt sich aus drei Teilen zusammen: (i) einem lokalen Prognoseverfahren zur Bewertung von Schaltzuständen und Phasenübergängen bezüglich zukünftig entstehender Wartezeiten, (ii) einem lokalen Optimierungsverfahren, das jeder Phase einen dynamischen Prioritätsindex zuweist und die Phase mit höchster Priorität zur Bedienung auswählt und (iii) einem lokalen Stabilisierungsverfahren, das zum Einhalten einer mittleren und einer maximalen Bedienperiode korrigierend in die lokale Optimierung eingreift. Indem die Knotenpunkte ausschließlich über die Verkehrsströme gekoppelt sind, ergeben sich die Umschaltzeitpunkte unmittelbar aus den Ankunftszeitpunkten der Fahrzeuge selbst. Die Phasenwechsel stellen sich somit von selbst bedarfsgerecht ein. Simulationsergebnisse machen deutlich, dass sich aufgrund der höheren Flexibilität sowohl die Wartezeiten als auch der Kraftstoffverbrauch senken lassen. Lichtsignalsteuerung Dezentralisierung Verkehrsregelung Ampel Ampelsteuerung Straßenverkehr Instabilität Stabilität Wartezeiten Straßenverkehrstechnik Stau Grüne Welle Prognoseverfahren Optimierungsverfahren Stabilisierungsverfahren traffic light control self organization decentralization stabilization queueing theory hybrid dynamical systems optimization ddc:620 rvk:ZO 4620
2	Reglerentwurf zur dezentralen Online-Steuerung von Lichtsignalanlagen in Straßennetzwerken Lämmer, Stefan 18 September 2007 (has links) Die Dissertationsschrift widmet sich einer systemtheoretischen Untersuchung zur verkehrsabhängigen Steuerung von Lichtsignalanlagen in Straßennetzwerken. Aus einem mathematischen Modell für den Verkehrsablauf auf Knotenzufahrten wird ein Verfahren abgeleitet, mit dem sich Umschaltzeitpunkte und Phasenwechsel flexibel an das tatsächliche Verkehrsgeschehen anpassen lassen. Der Ansatzpunkt ist, die einzelnen Knotenpunkte des Netzwerks lokal zu optimieren. Eine &quot;Grüne Welle&quot; soll sich von selbst einstellen, und zwar genau dann, wenn dadurch lokal Wartezeiten eingespart werden. Indem die lokale Optimierung in ein lokales Stabilisierungsverfahren eingebettet wird, können Instabilitäten aufgrund netzwerkweiter Rückkopplungen ausgeschlossen werden. Das vorgestellte Verfahren setzt sich aus drei Teilen zusammen: (i) einem lokalen Prognoseverfahren zur Bewertung von Schaltzuständen und Phasenübergängen bezüglich zukünftig entstehender Wartezeiten, (ii) einem lokalen Optimierungsverfahren, das jeder Phase einen dynamischen Prioritätsindex zuweist und die Phase mit höchster Priorität zur Bedienung auswählt und (iii) einem lokalen Stabilisierungsverfahren, das zum Einhalten einer mittleren und einer maximalen Bedienperiode korrigierend in die lokale Optimierung eingreift. Indem die Knotenpunkte ausschließlich über die Verkehrsströme gekoppelt sind, ergeben sich die Umschaltzeitpunkte unmittelbar aus den Ankunftszeitpunkten der Fahrzeuge selbst. Die Phasenwechsel stellen sich somit von selbst bedarfsgerecht ein. Simulationsergebnisse machen deutlich, dass sich aufgrund der höheren Flexibilität sowohl die Wartezeiten als auch der Kraftstoffverbrauch senken lassen. info:eu-repo/classification/ddc/620 ddc:620
3	A Deep Reinforcement Learning Approach for Dynamic Traffic Light Control with Transit Signal Priority Nousch, Tobias, Zhou, Runhao, Adam, Django, Hirrle, Angelika, Wang, Meng 23 June 2023 (has links) Traffic light control (TLC) with transit signal priority (TSP) is an effective way to deal with urban congestion and travel delay. The growing amount of available connected vehicle data offers opportunities for signal control with transit priority, but the conventional control algorithms fall short in fully exploiting those datasets. This paper proposes a novel approach for dynamic TLC with TSP at an urban intersection. We propose a deep reinforcement learning based framework JenaRL to deal with the complex real-world intersections. The optimisation focuses on TSP while balancing the delay of all vehicles. A two-layer state space is defined to capture the real-time traffic information, i.e. vehicle position, type and incoming lane. The discrete action space includes the optimal phase and phase duration based on the real-time traffic situation. An intersection in the inner city of Jena is constructed in an open-source microscopic traffic simulator SUMO. A time-varying traffic demand of motorised individual traffic (MIT), the current TLC controller of the city, as well as the original timetables of the public transport (PT) are implemented in simulation to construct a realistic traffic environment. The results of the simulation with the proposed framework indicate a significant enhancement in the performance of traffic light controller by reducing the delay of all vehicles, and especially minimising the loss time of PT. info:eu-repo/classification/ddc/360 ddc:360
4	Prioritization of an Automated Shuttle for V2X Public Transport at a Signalized Intersection – A Real-life Demonstration Halbach, Maik, Wesemeyer, Daniel, Merk, Lukas, Lauermann, Jan, Heß, Daniel, Kaul, Robert 23 June 2023 (has links) Public transport prioritization is used at signalized intersections to reduce travel times and increase the attractiveness of public transport. In the future, analog communication technologies for public transport prioritization are soon to be replaced by the promising vehicle-to-everything (V2X) technology. This abstract presents a holistic approach using V2X communication in public transport prioritization for an automated vehicle. In order to take full advantage of the V2X technology, this means to V2X-enable the traffic infrastructure and change the way of communication as well as the traffic light control. The approach was implemented and tested under real-life conditions at the research intersection Tostmannplatz in Braunschweig. info:eu-repo/classification/ddc/360 ddc:360
5	Resource Allocation for Sequential Decision Making Under Uncertainaty : Studies in Vehicular Traffic Control, Service Systems, Sensor Networks and Mechanism Design Prashanth, L A January 2013 (has links) (PDF) A fundamental question in a sequential decision making setting under uncertainty is “how to allocate resources amongst competing entities so as to maximize the rewards accumulated in the long run?”. The resources allocated may be either abstract quantities such as time or concrete quantities such as manpower. The sequential decision making setting involves one or more agents interacting with an environment to procure rewards at every time instant and the goal is to find an optimal policy for choosing actions. Most of these problems involve multiple (infinite) stages and the objective function is usually a long-run performance objective. The problem is further complicated by the uncertainties in the sys-tem, for instance, the stochastic noise and partial observability in a single-agent setting or private information of the agents in a multi-agent setting. The dimensionality of the problem also plays an important role in the solution methodology adopted. Most of the real-world problems involve high-dimensional state and action spaces and an important design aspect of the solution is the choice of knowledge representation. The aim of this thesis is to answer important resource allocation related questions in different real-world application contexts and in the process contribute novel algorithms to the theory as well. The resource allocation algorithms considered include those from stochastic optimization, stochastic control and reinforcement learning. A number of new algorithms are developed as well. The application contexts selected encompass both single and multi-agent systems, abstract and concrete resources and contain high-dimensional state and control spaces. The empirical results from the various studies performed indicate that the algorithms presented here perform significantly better than those previously proposed in the literature. Further, the algorithms presented here are also shown to theoretically converge, hence guaranteeing optimal performance. We now briefly describe the various studies conducted here to investigate problems of resource allocation under uncertainties of different kinds: Vehicular Traffic Control The aim here is to optimize the ‘green time’ resource of the individual lanes in road networks that maximizes a certain long-term performance objective. We develop several reinforcement learning based algorithms for solving this problem. In the infinite horizon discounted Markov decision process setting, a Q-learning based traffic light control (TLC) algorithm that incorporates feature based representations and function approximation to handle large road networks is proposed, see Prashanth and Bhatnagar [2011b]. This TLC algorithm works with coarse information, obtained via graded thresholds, about the congestion level on the lanes of the road network. However, the graded threshold values used in the above Q-learning based TLC algorithm as well as several other graded threshold-based TLC algorithms that we propose, may not be optimal for all traffic conditions. We therefore also develop a new algorithm based on SPSA to tune the associated thresholds to the ‘optimal’ values (Prashanth and Bhatnagar [2012]). Our thresh-old tuning algorithm is online, incremental with proven convergence to the optimal values of thresholds. Further, we also study average cost traffic signal control and develop two novel reinforcement learning based TLC algorithms with function approximation (Prashanth and Bhatnagar [2011c]). Lastly, we also develop a feature adaptation method for ‘optimal’ feature selection (Bhatnagar et al. [2012a]). This algorithm adapts the features in a way as to converge to an optimal set of features, which can then be used in the algorithm. Service Systems The aim here is to optimize the ‘workforce’, the critical resource of any service system. However, adapting the staffing levels to the workloads in such systems is nontrivial as the queue stability and aggregate service level agreement (SLA) constraints have to be complied with. We formulate this problem as a constrained hidden Markov process with a (discrete) worker parameter and propose simultaneous perturbation based simulation optimization algorithms for this purpose. The algorithms include both first order as well as second order methods and incorporate SPSA based gradient estimates in the primal, with dual ascent for the Lagrange multipliers. All the algorithms that we propose are online, incremental and are easy to implement. Further, they involve a certain generalized smooth projection operator, which is essential to project the continuous-valued worker parameter updates obtained from the SASOC algorithms onto the discrete set. We validate our algorithms on five real-life service systems and compare their performance with a state-of-the-art optimization tool-kit OptQuest. Being ��times faster than OptQuest, our scheme is particularly suitable for adaptive labor staffing. Also, we observe that it guarantees convergence and ﬁnds better solutions than OptQuest in many cases. Wireless Sensor Networks The aim here is to allocate the ‘sleep time’ (resource) of the individual sensors in an intrusion detection application such that the energy consumption from the sensors is reduced, while keeping the tracking error to a minimum. We model this sleep–wake scheduling problem as a partially-observed Markov decision process (POMDP) and propose novel RL-based algorithms -with both long-run discounted and average cost objectives -for solving this problem. All our algorithms incorporate function approximation and feature-based representations to handle the curse of dimensionality. Further, the feature selection scheme used in each of the proposed algorithms intelligently manages the energy cost and tracking cost factors, which in turn, assists the search for the optimal sleeping policy. The results from the simulation experiments suggest that our proposed algorithms perform better than a recently proposed algorithm from Fuemmeler and Veeravalli [2008], Fuemmeler et al. [2011]. Mechanism Design The setting here is of multiple self-interested agents with limited capacities, attempting to maximize their individual utilities, which often comes at the expense of the group’s utility. The aim of the resource allocator here then is to efficiently allocate the resource (which is being contended for, by the agents) and also maximize the social welfare via the ‘right’ transfer of payments. In other words, the problem is to find an incentive compatible transfer scheme following a socially efficient allocation. We present two novel mechanisms with progressively realistic assumptions about agent types aimed at economic scenarios where agents have limited capacities. For the simplest case where agent types consist of a unit cost of production and a capacity that does not change with time, we provide an enhancement to the static mechanism of Dash et al. [2007] that effectively deters misreport of the capacity type element by an agent to receive an allocation beyond its capacity, which thereby damages other agents. Our model incorporates an agent’s preference to harm other agents through a additive factor in the utility function of an agent and the mechanism we propose achieves strategy proofness by means of a novel penalty scheme. Next, we consider a dynamic setting where agent types evolve and the individual agents here again have a preference to harm others via capacity misreports. We show via a counterexample that the dynamic pivot mechanism of Bergemann and Valimaki [2010] cannot be directly applied in our setting with capacity-limited alim¨agents. We propose an enhancement to the mechanism of Bergemann and V¨alim¨aki [2010] that ensures truth telling w.r.t. capacity type element through a variable penalty scheme (in the spirit of the static mechanism). We show that each of our mechanisms is ex-post incentive compatible, ex-post individually rational, and socially efficient Vehicular Traffic Control Service Systems Sensor Networks Mechanism Design Traffic Signal Control - Q-Learning Traffic Signal Control Signal Control - Threshold Tuning Traffic Light Control Algorithm Adaptive Labor Staffing Sleep-Wake Scheduling Algorithms Reinforcement Learning Vehicular Control Graded Signal Control Adaptive Sleep–wake Control Computer Science

1

Page generated in 0.0797 seconds