Global ETD Search

481	Self-concept, self-reinforcement, and private speech Southmayd, Stephen E. January 1975 (has links) No description available. Child psychology. Self-perception. Reinforcement (Psychology)
482	Extinction of a free operant response in pre-school children following partial and regular schedules of primary and potential secondary reinforcement. Fort, Jane Geraldine 01 January 1960 (has links) (PDF) No description available. Conditioned response Operant behavior Reinforcement Psychology.
483	On-policy Object Goal Navigation with Exploration Bonuses Maia, Eric 15 August 2023 (has links) Machine learning developments have contributed to overcome a wide range of issues, including robotic motion, autonomous navigation, and natural language processing. Of note are the advancements of reinforcement learning in the area of object goal navigation — the task of autonomously traveling to target objects with minimal a priori knowledge of the environment. Given the sparse placement of goals in unknown scenes, exploration is essential for reaching remote objects of interest that are not immediately visible to autonomous agents. Sparse rewards are a crucial problem in reinforcement learning that arises in object goal navigation, as positive rewards are only attained when targets are found at the end of an agent’s trajectory. As such, this work explores object goal navigation and the challenges it presents, along with the relevant reinforcement learning techniques applied to the task. An ablation study of the baseline approach for the RoboTHOR 2021 object goal navigation challenge is presented and used to guide the development of an on-policy agent that is computationally less expensive and obtains greater success in unseen environments. Then, original object goal navigation reward schemes that aggregate episodic and long-term novelty bonuses are proposed, and obtain success rates comparable to the respective object goal navigation benchmark at a fraction of training interactions with the environment. Object Goal Navigation Exploration Bonuses Reinforcement Learning
484	Reinforcement and Cut Growth in Swollen and Unswollen Filled Rubber Compounds Chai, Xiaoli 12 May 2008 (has links) No description available. Polymers reinforcement cut-growth filled rubber swollen
485	Robot Navigation in Cluttered Environments with Deep Reinforcement Learning Weideman, Ryan 01 June 2019 (has links) (PDF) The application of robotics in cluttered and dynamic environments provides a wealth of challenges. This thesis proposes a deep reinforcement learning based system that determines collision free navigation robot velocities directly from a sequence of depth images and a desired direction of travel. The system is designed such that a real robot could be placed in an unmapped, cluttered environment and be able to navigate in a desired direction with no prior knowledge. Deep Q-learning, coupled with the innovations of double Q-learning and dueling Q-networks, is applied. Two modifications of this architecture are presented to incorporate direction heading information that the reinforcement learning agent can utilize to learn how to navigate to target locations while avoiding obstacles. The performance of the these two extensions of the D3QN architecture are evaluated in simulation in simple and complex environments with a variety of common obstacles. Results show that both modifications enable the agent to successfully navigate to target locations, reaching 88% and 67% of goals in a cluttered environment, respectively. robotics reinforcement learning navigation machine learning Robotics
486	A constitutive equation for carbon black filled elastomers Oswal, Ravinder Kumar. January 1980 (has links) Thesis: M.S., Massachusetts Institute of Technology, Department of Chemical Engineering, 1980 / Includes bibliographical references. / by Ravinder Kumar Oswal. / M.S. / M.S. Massachusetts Institute of Technology, Department of Chemical Engineering Chemical Engineering. Elastomers Reinforcement. Carbon-black. Rheology.
487	Influencing Exploration in Actor-Critic Reinforcement Learning Algorithms Gough, Andrew R 01 June 2018 (has links) (PDF) Reinforcement Learning (RL) is a subset of machine learning primarily concerned with goal-directed learning and optimal decision making. RL agents learn based on a reward signal discovered from trial and error in complex, uncertain environments with the goal of maximizing positive reward signals. RL approaches need to scale up as they are applied to more complex environments with extremely large state spaces. Inefficient exploration methods cannot sufficiently explore complex environments in a reasonable amount of time, and optimal policies will be unrealized resulting in RL agents failing to solve an environment. This thesis proposes a novel variant of the Actor-Advantage Critic (A2C) algorithm. The variant is validated against two state-of-the-art RL algorithms, Deep Q-Network (DQN) and A2C, across six Atari 2600 games of varying difficulty. The experimental results are competitive with state-of-the-art and achieve lower variance and quicker learning speed. Additionally, the thesis introduces a metric to objectively quantify the difficulty of any Markovian environment with respect to the exploratory capacity of RL agents. Reinforcement Learning Artificial Intelligence and Robotics Computer Sciences
488	Machine Translation For Machines Tebbifakhr, Amirhossein 25 October 2021 (has links) Traditionally, Machine Translation (MT) systems are developed by targeting fluency (i.e. output grammaticality) and adequacy (i.e. semantic equivalence with the source text) criteria that reflect the needs of human end-users. However, recent advancements in Natural Language Processing (NLP) and the introduction of NLP tools in commercial services have opened new opportunities for MT. A particularly relevant one is related to the application of NLP technologies in low-resource language settings, for which the paucity of training data reduces the possibility to train reliable services. In this specific condition, MT can come into play by enabling the so-called “translation-based” workarounds. The idea is simple: first, input texts in the low-resource language are translated into a resource-rich target language; then, the machine-translated text is processed by well-trained NLP tools in the target language; finally, the output of these downstream components is projected back to the source language. This results in a new scenario, in which the end-user of MT technology is no longer a human but another machine. We hypothesize that current MT training approaches are not the optimal ones for this setting, in which the objective is to maximize the performance of a downstream tool fed with machine-translated text rather than human comprehension. Under this hypothesis, this thesis introduces a new research paradigm, which we named “MT for machines”, addressing a number of questions that raise from this novel view of the MT problem. Are there different quality criteria for humans and machines? What makes a good translation from the machine standpoint? What are the trade-offs between the two notions of quality? How to pursue machine-oriented objectives? How to serve different downstream components with a single MT system? How to exploit knowledge transfer to operate in different language settings with a single MT system? Elaborating on these questions, this thesis: i) introduces a novel and challenging MT paradigm, ii) proposes an effective method based on Reinforcement Learning analysing its possible variants, iii) extends the proposed method to multitask and multilingual settings so as to serve different downstream applications and languages with a single MT system, iv) studies the trade-off between machine-oriented and human-oriented criteria, and v) discusses the successful application of the approach in two real-world scenarios.
489	The effects of positive and negative reinforcement on a learning task in hospitalized patients Lancaster, Gary Robert 01 January 1968 (has links) (PDF) A number of writers have suggested that in comparison to normals, schizophrenics are less responsive to positive records or reinforcers (e.g., Hunt & Cofor, 1944) and overly sensitive to punishment or social censure as compared to normals (Fromm-Reichman 1954). Garmezy & Rodnick (1957) have proposed that schizophrenics are highly sensitive to any censure or disapproval, arising from their interpersonal contacts. They further say that such intolerable levels of anxiety are aroused that schizophrenics are held to be much more strongly motivated than normals to reduce the anxiety by acting to avoid or escape the censorious aspects of the situation. Schizophrenia Reinforcement Psychology Psychology Social and Behavioral Sciences
490	Evolutionary Optimization of Decision Trees for Interpretable Reinforcement Learning Custode, Leonardo Lucio 27 April 2023 (has links) While Artificial Intelligence (AI) is making giant steps, it is also raising concerns about its trustworthiness, due to the fact that widely-used black-box models cannot be exactly understood by humans. One of the ways to improve humans’ trust towards AI is to use interpretable AI models, i.e., models that can be thoroughly understood by humans, and thus trusted. However, interpretable AI models are not typically used in practice, as they are thought to be less performing than black-box models. This is more evident in Reinforce- ment Learning, where relatively little work addresses the problem of performing Reinforce- ment Learning with interpretable models. In this thesis, we address this gap, proposing methods for Interpretable Reinforcement Learning. For this purpose, we optimize Decision Trees by combining Reinforcement Learning with Evolutionary Computation techniques, which allows us to overcome some of the challenges tied to optimizing Decision Trees in Reinforcement Learning scenarios. The experimental results show that these approaches are competitive with the state-of-the-art score while being extremely easier to interpret. Finally, we show the practical importance of Interpretable AI by digging into the inner working of the solutions obtained.

Search results