Global ETD Search

1	Induction of environment and goal models by an adaptive agent in deterministic environment / Adaptyvaus agento aplinkos ir tikslo modelių indukcija deterministinėje aplinkoje Kapočiūtė-Dzikienė, Jurgita 01 February 2011 (has links) If the laws of percepts change, which are described by deterministic Markov decision process, exist in the observable or partially observable environment, then the agent by interacting with the environment and having no initial knowledge is capable to discover those laws using the methods of logical and constructive induction (i.e. is capable to learn environment and goal models); it is capable to learn predicting precisely its own consequences of actions and apply this learned knowledge in order to achieve its own goals in the new unseen situations. Adaptive agent proposed in this dissertation differs from other similar works presented in the literature in three novel potentials that enables this agent to solve the problem of knowledge transferability from one environment into another, when the same laws are valid for the environments; to solve the problem of goal percepts generalization; and to solve the problem of perceptual aliasing in the partially observable environment. During the investigations it was discovered that adaptive agent using created environment model solves the tasks of knowledge transferability into new environments better than other alternative agents (based on Q-learning and ADP methods); using created goal model solves the goal percepts generalization tasks by correctly approximating the reward function and predicting its values in the new environments; solves the problem of perceptual aliasing by transforming the deterministic nth order Markov... [to full text] / Jei stebimoje ar iš dalies stebimoje aplinkoje galioja būsenų kaitos dėsniai, nusakomi deterministiniu Markovo sprendimo procesu, tai agentas, sąveikaudamas su aplinka ir neturėdamas jokių pradinių žinių, gali šiuos dėsnius atrasti loginės ir konstrukcinės indukcijos metodais (išmokti aplinkos ir tikslo modelius), gali išmokti tiksliai prognozuoti savo veiksmų pasekmes ir taikyti šias žinias, kad greičiau pasiektų savo tikslus naujose nematytose situacijose. Disertacijoje siūlomas adaptyvus agentas nuo literatūroje pristatomų panašių darbų skiriasi trimis naujomis galimybėmis, nes: geba spręsti vienoje aplinkoje išmoktų žinių perkeliamumo į naujas aplinkas problemą, kai aplinkoms galioja tie patys dėsniai; tikslo stebėjimų apibendrinimo problemą; stebėjimų daugiareikšmiškumo problemą dalinai stebimoje aplinkoje. Tyrimų metu nustatyta, kad adaptyvus agentas, naudodamas sukurtą aplinkos modelį, žinių perkeliamumo į naujas aplinkas uždavinius sprendžia geriau nei kiti alternatyvūs agentai (grindžiami Q-mokymu ir ADP metodais); tikslo stebėjimų apibendrinimo uždavinius, naudodamas sukurtą tikslo modelį, sprendžia teisingai aproksimuodamas atlygio funkciją ir prognozuodamas pastiprinimo reikšmes naujose aplinkose; stebėjimų daugiareikšmiškumo problemą sprendžia pertvarkydamas deterministinį n-tos eilės Markovo sprendimo procesą į 1-os eilės ir jam sukurdamas aplinkos modelį, atitinkantį baigtinį Muro automatą. Informatics Artifitial intelligence Reinforcement learning Planning Dirbtinis intelektas Mokymas pastiprinimu Planavimas
2	Adaptyvaus agento aplinkos ir tikslo modelių indukcija deterministinėje aplinkoje / Induction of environment and goal models by an adaptive agent in deterministic environment Kapočiūtė-Dzikienė, Jurgita 01 February 2011 (has links) Jei stebimoje ar iš dalies stebimoje aplinkoje galioja būsenų kaitos dėsniai, nusakomi deterministiniu Markovo sprendimo procesu, tai agentas, sąveikaudamas su aplinka ir neturėdamas jokių pradinių žinių, gali šiuos dėsnius atrasti loginės ir konstrukcinės indukcijos metodais (išmokti aplinkos ir tikslo modelius), gali išmokti tiksliai prognozuoti savo veiksmų pasekmes ir taikyti šias žinias, kad greičiau pasiektų savo tikslus naujose nematytose situacijose. Disertacijoje siūlomas adaptyvus agentas nuo literatūroje pristatomų panašių darbų skiriasi trimis naujomis galimybėmis, nes: geba spręsti vienoje aplinkoje išmoktų žinių perkeliamumo į naujas aplinkas problemą, kai aplinkoms galioja tie patys dėsniai; tikslo stebėjimų apibendrinimo problemą; stebėjimų daugiareikšmiškumo problemą dalinai stebimoje aplinkoje. Tyrimų metu nustatyta, kad adaptyvus agentas, naudodamas sukurtą aplinkos modelį, žinių perkeliamumo į naujas aplinkas uždavinius sprendžia geriau nei kiti alternatyvūs agentai (grindžiami Q-mokymu ir ADP metodais); tikslo stebėjimų apibendrinimo uždavinius, naudodamas sukurtą tikslo modelį, sprendžia teisingai aproksimuodamas atlygio funkciją ir prognozuodamas pastiprinimo reikšmes naujose aplinkose; stebėjimų daugiareikšmiškumo problemą sprendžia pertvarkydamas deterministinį n-tos eilės Markovo sprendimo procesą į 1-os eilės ir jam sukurdamas aplinkos modelį, atitinkantį baigtinį Muro automatą. / If the laws of percepts change, which are described by deterministic Markov decision process, exist in the observable or partially observable environment, then the agent by interacting with the environment and having no initial knowledge is capable to discover those laws using the methods of logical and constructive induction (i.e. is capable to learn environment and goal models); it is capable to learn predicting precisely its own consequences of actions and apply this learned knowledge in order to achieve its own goals in the new unseen situations. Adaptive agent proposed in this dissertation differs from other similar works presented in the literature in three novel potentials that enables this agent to solve the problem of knowledge transferability from one environment into another, when the same laws are valid for the environments; to solve the problem of goal percepts generalization; and to solve the problem of perceptual aliasing in the partially observable environment. During the investigations it was discovered that adaptive agent using created environment model solves the tasks of knowledge transferability into new environments better than other alternative agents (based on Q-learning and ADP methods); using created goal model solves the goal percepts generalization tasks by correctly approximating the reward function and predicting its values in the new environments; solves the problem of perceptual aliasing by transforming the deterministic nth order Markov... [to full text] Informatics Adaptyvi elgsena Mokymas pastiprinimu Planavimas Adaptive behavior Reinforcement learning Planning
3	Mokymo pastiprinimu metodų pritaikymas kompiuterinio žaidimo personažui interaktyvioje 2D/3D grafikos sistemoje / An adaptation of reinforcement learning methods to the computer game character in interactive 2D/3D graphic system Miliukas, Vaclovas 15 June 2011 (has links) Šiame darbe buvo pasiūlyti dirbtinio intelekto pritaikymo būdai bei metodai, kompiuterinio žaidimo personažui, dviejų personažų koviniam žaidimui. Pasiūlytais būdais bei metodais buvo bandoma pritaikyti dirbtinį intelektą kompiuterinio žaidimo personažui, šio tipo žaidimui. Buvo tiriama ar pasliūlyti būdai ir metodai tinka šiai kompiuterinių žaidimų klasei. Taip pat buvo tiriamas agento mokymosi greitis, nuo ko jis priklauso bei ieškomi faktoriai, kurie lemia tokio tipo žaidimo personažams sparčiau mokintis ir naudingiau naudotis įgytomis žiniomis. Atrastus ir ištirtus faktorius buvo stengiamasi suderinti taip, kad mokomas personažas balansuotų tarp greitai besimokančio ir tinkamai besinaudojančio išmoktomis žiniomis. / In this work was offered an adjustments of techniques and methods of artificial intelligence for computer player, in two-players combat games. There was tried to adapt the artificial intelligence to the computer player, with those proposed techniques and methods in this type of games. Also was investigated whether those proposed techniques and methods suits for this class of computer games. In this work also was analyzed agent learning speed, on what depends it and explored the factors that determines the rapid learning speed and efficient use of agent knowledge. Discovered and investigated factors were fitted to the rapid learning speed and efficient use of agent knowledge. Informatics Kompiuterinis žaidimas Žaidimo personažas Dirbtinis intelektas Mokymas pastiprinimu Agento mokymosi greitis Computer game Game character Artificial intelligence Reinforcement learning Agent learning speed

1

Page generated in 0.0405 seconds