• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 1172
  • 341
  • 195
  • 154
  • 74
  • 51
  • 42
  • 24
  • 22
  • 21
  • 17
  • 17
  • 17
  • 17
  • 17
  • Tagged with
  • 2632
  • 985
  • 555
  • 366
  • 271
  • 228
  • 227
  • 219
  • 210
  • 202
  • 200
  • 178
  • 152
  • 147
  • 137
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.

Interdependencies in multiple fixed ratio reinforcement schedules

Sergio, Joseph P. January 1980 (has links)
Schuster (1959) and Keehn (1963), working with complex schedules of reinforcement, suggested that post-reinforcement pause (PRP) length was a function of the ratio requirements of the schedules' components. More specifically, Schuster indicated that a contrast effect may be the result of changes in schedule components from higher to lower requirements, or vice-versa. The present study attempted to determine the relationship between PRP length and response rate, and component requirements of a multiple fixed ratio schedule. Schedule interactions were examined when behavior was maintained at a given pair of schedules for a prolonged time, and when both schedules were studied at various values.In general, the results of the present study lend support to the position that component requirements of a multiple FR schedule do not interact, as demonstrated by the PRP, to cause a contrast effect to emerge. This study also indicates that an inverse relationship appears to exist between response rate and FR ratio size.

Autonomous inter-task transfer in reinforcement learning domains

Taylor, Matthew Edmund 07 September 2012 (has links)
Reinforcement learning (RL) methods have become popular in recent years because of their ability to solve complex tasks with minimal feedback. While these methods have had experimental successes and have been shown to exhibit some desirable properties in theory, the basic learning algorithms have often been found slow in practice. Therefore, much of the current RL research focuses on speeding up learning by taking advantage of domain knowledge, or by better utilizing agents’ experience. The ambitious goal of transfer learning, when applied to RL tasks, is to accelerate learning on some target task after training on a different, but related, source task. This dissertation demonstrates that transfer learning methods can successfully improve learning in RL tasks via experience from previously learned tasks. Transfer learning can increase RL’s applicability to difficult tasks by allowing agents to generalize their experience across learning problems. This dissertation presents inter-task mappings, the first transfer mechanism in this area to successfully enable transfer between tasks with different state variables and actions. Inter-task mappings have subsequently been used by a number of transfer researchers. A set of six transfer learning algorithms are then introduced. While these transfer methods differ in terms of what base RL algorithms they are compatible with, what type of knowledge they transfer, and what their strengths are, all utilize the same inter-task mapping mechanism. These transfer methods can all successfully use mappings constructed by a human from domain knowledge, but there may be situations in which domain knowledge is unavailable, or insufficient, to describe how two given tasks are related. We therefore also study how inter-task mappings can be learned autonomously by leveraging existing machine learning algorithms. Our methods use classification and regression techniques to successfully discover similarities between data gathered in pairs of tasks, culminating in what is currently one of the most robust mapping-learning algorithms for RL transfer. Combining transfer methods with these similarity-learning algorithms allows us to empirically demonstrate the plausibility of autonomous transfer. We fully implement these methods in four domains (each with different salient characteristics), show that transfer can significantly improve an agent’s ability to learn in each domain, and explore the limits of transfer’s applicability. / text

Rullarmering - Ett rationellt sätt att armera

Bertilsson, Thomas, Ekstrand, Mattias January 2008 (has links)
This is a degree project performed at Halmstad University in Sweden on the subject carpet of reinforcement. This project is developed in association with Celsa Steel Service in Halmstad. The purpose of this report is to investigate how much time that can be saved if carpet of reinforcement is used instead of the traditional way to work with reinforcement and see what kind of parameters that is important between the choices. We are also going to see how this product affects the situation for reinforcement workers from an ergonomic point of view. Carpet of reinforcement is made by straight reinforcement’s bars that are welded together with steel bands. These rolls are made at a factory with a machine that welds together the bars in determined centre distance, with desired dimensions after the construction engineers’ blueprints. The big advantage is that the work at the construction site becomes much easier. Also the number of reinforcement workers and the total time of the project can be reduced. All this together saves money. The soft parameters like work environment are also affected in a good way. The rolls are heavy and have to be lifted with a construction crane which saves the reinforcement iron workers body. The traditional work with reinforcement involves a lot of heavy lifting and trying work positions for the workers. Therefore is it fairly usual with wear injuries in the trade. We think that carpets of reinforcement may reduce the amount of wear injuries. Most of the workers that we interviewed seem to like the product. There are two different products that Celsa Steel Service provides the trade with. Bamtec is the oldest of these products and also the most common and appreciated. In connection with building of bridges it is forbidden to weld together the reinforcement because of the risk of fatique fractares in welded zones. Because of this, Bamtec can not be used in bridges. For use in bridges a new product called SpinMaster, without welding, has been developed. Carpet of reinforcement works best at large simple foundation slabs but can also be used in walls.

Response number under a fixed-interval schedule of reinforcement

Gentry, George David 08 1900 (has links)
No description available.

Complex schedules of reinforcement : response-reinforcer temporal relationships, the role of brief stimuli, and unit schedule performance as an operant in pigeons

Bradford, Linda DiAnne 08 1900 (has links)
No description available.

Two-key concurrent responding : choice and delays of reinforcement at constant relative immediacy of reinforcement

Gentry, George David 08 1900 (has links)
No description available.

Distribution of trials and the partial reinforcement effect after minimal acquisition

Young, Brian Jeffrey January 1970 (has links)
Both Frustration Theory and Sequential Effects Theory appear to account with some plausibility for the Partial Reinforcement Effect (PRE) and accompanying phenomena. On the basis of Frustration Theory, however, a PRE should not occur after only minimal acquisition training. On this point, the experimental evidence is conflicting. On the basis of analogous studies employing extended acquisition, trial distribution may be suggested as a factor, among others, which might affect the occurrence of the PRE. All previous studies showing evidence for a PRE after minimal acquisition employed intertrial intervals (ITI's) in acquisition and extinction of no more than 15 minutes. The problem which this study was concerned with was therefore whether a PRE could be obtained after only minimal acquisition when trials were distributed. Accordingly, it was hypothesized that a PRE would occur after a sequence of only five trials under a partial reinforcement (PRF) schedule in an 'RNRNR' sequence, compared with performance following continuous reinforcement (CRF), irrespective of whether trials were highly distributed or highly massed. A factorial 2x2 design was used in which four groups of Ss were trained under two conditions of reward schedule (PRF or CRF), and two conditions of ITI both in acquisition and extinction. The method, apparatus and procedure of a previous study (McCain, 1969) which showed evidence for a PRE after only minimal acquisition were replicated as far as possible, except for changes required by the design or on theoretical grounds. Evidence was found for a PRE under both conditions of ITI in terms of differences in extinction trends, confirming the hypothesis tested. Trial distribution was found to effect the mean running speeds shown by PRF and CRF groups such that greater resistance to extinction was shown by the PRF group in comparison to the CRF group when the ITI was 30 sec., while running measures did not show this effect when the ITI was 24 hours. This latter lack of a significant difference appeared to be mainly due to a reduction in PRF running speeds when the ITI was 24 hours rather than an increase in CRF running speeds. / Arts, Faculty of / Psychology, Department of / Graduate

Effects of Reinforcement History on Stimulus Control Relations

Reyes, Fredy 12 1900 (has links)
Ray (1969) conducted an experiment on multiple stimulus-response relations and selective attention. Ray's (1969) results suggested that stimulus-response relations function as behavioral units. McIlvane and Dube (1996) indicated that if stimulus-response relations are behavioral units the effects of environmental variables on stimulus-response relations should be similar to the effects of environmental variables on single response topographies. This experiment analyzed the effects of reinforcement history on the probability of stimulus-response relations with differing reinforcement histories. In separate conditions random-ratio schedules of reinforcement were contingent on each of four discriminated responses. To assess the effects of reinforcement, during test conditions stimuli controlling different topographies were present concurrently in composite form. Results show that reinforcement history affects the probability of each response topography and that the association between response topographies and their controlling stimuli tends to remain constant throughout variations in reinforcement probability.

Reinforced concrete beams with steel plates for shear

Abdullah, Mohd Sabri January 1993 (has links)
No description available.

Analysis of a cooperative response

Vogler, Roger Ernest, 1938- January 1966 (has links)
No description available.

Page generated in 0.1298 seconds