Global ETD Search

Return to search

Planning, Acting, and Learning in Incomplete Domains

The engineering of complete planning domain descriptions is often very costly because of human error or lack of domain knowledge. Learning complete domain descriptions is also very challenging because many features are irrelevant to achieving the goals and data may be scarce. Given incomplete knowledge of their actions, agents can ignore the incompleteness, plan around it, ask questions of a domain expert, or learn through trial and error.
Our agent Goalie learns about the preconditions and effects of its incompletely-specified actions by monitoring the environment state. In conjunction with the plan failure explanations generated by its planner DeFault, Goalie diagnoses past and future action failures. DeFault computes failure explanations for each action and state in the plan and counts the number of incomplete domain interpretations wherein failure will occur. The questionasking strategies employed by our extended Goalie agent using these conjunctive normal form-based plan failure explanations are goal-directed and attempt to approach always successful execution while asking the fewest questions possible. In sum, Goalie: i) interleaves acting, planning, and question-asking; ii) synthesizes plans that avoid execution failure due to ignorance of the domain model; iii) uses these plans to identify relevant (goal-directed) questions; iv) passively learns about the domain model during execution to improve later replanning attempts; v) and employs various targeted (goal-directed) strategies to ask questions (actively learn).
Our planner DeFault is the first reason about a domain's incompleteness to avoid potential plan failure. We show that DeFault performs best by counting prime implicants (failure diagnoses) rather than propositional models. Further, we show that by reasoning about incompleteness in planning (as opposed to ignoring it), Goalie fails and replans less often, and executes fewer actions. Finally, we show that goal-directed knowledge acquisition - prioritizing questions based on plan failure diagnoses - leads to fewer questions, lower overall planning and replanning time, and higher success rates than approaches that naively ask many questions or learn by trial and error.

Identifer	oai:union.ndltd.org:UTAHS/oai:digitalcommons.usu.edu:etd-2182
Date	01 May 2012
Creators	Weber, Christopher H.
Publisher	DigitalCommons@USU
Source Sets	Utah State University
Detected Language	English
Type	text
Format	application/pdf
Source	All Graduate Theses and Dissertations
Rights	Copyright for this work is held by the author. Transmission or reproduction of materials protected by copyright beyond that allowed by fair use requires the written permission of the copyright owners. Works not in the public domain cannot be commercially exploited without permission of the copyright owner. Responsibility for any use rests exclusively with the user. For more information contact Andrew Wesolek (andrew.wesolek@usu.edu).

Page generated in 0.002 seconds

Planning, Acting, and Learning in Incomplete Domains

Description

Links & Downloads

Tags

Additional Fields