<p>This thesis is testing out the group of experts regime in the context of reinforcement learning with the aim of reducing the search space used in reinforcement learning. Having tested different abstracion levels with this approach, it is the hyphothesis that using this approach to reduce the search space is best done on a high abstraction level. All though reinforcement learning has many advantages in certain settings, and is a preferred tehcnique in many different contexts, it still has its challenges. This architecture does not solve these, but suggests a way of dealing with the curse of dimentionality, the scaling problem within reinforcement learning systems.</p>
Identifer | oai:union.ndltd.org:UPSALLA/oai:DiVA.org:ntnu-8703 |
Date | January 2007 |
Creators | Anderson, Tore Rune |
Publisher | Norwegian University of Science and Technology, Department of Computer and Information Science, Institutt for datateknikk og informasjonsvitenskap |
Source Sets | DiVA Archive at Upsalla University |
Language | English |
Detected Language | English |
Type | Student thesis, text |
Page generated in 0.0125 seconds