Global ETD Search

Return to search

Hraní nedeterministických her s učením / Playing of Nondeterministic Games with Learning

The thesis is dedicated to the study and implementation of methods used for learning from the course of playing. The chosen game for this thesis is Backgammon. The algorithm used for training neural networks is called the temporal difference learning with use of eligible traces. This algorithm is also known as TD(lambda). The theoretical part describes algorithms for playing games without learning, introduction to reinforcement learning, temporal difference learning and introduction to artificial neural networks. The practical part deals with application of combination of neural networks and TD(lambda) algorithms.

http://www.nusl.cz/ntk/nusl-237050

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:237050
Date	January 2011
Creators	Bukovský, Marek
Contributors	Rozman, Jaroslav, Zbořil, František
Publisher	Vysoké učení technické v Brně. Fakulta informačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0025 seconds

Hraní nedeterministických her s učením / Playing of Nondeterministic Games with Learning

Description

Links & Downloads

Tags

Additional Fields