Regression-based Planning with Sensing Actions and Incomplete Information

Regression-based Planning with Sensing Actions and Incomplete Information

[Comparison With Other Contingent Planners] [Comparison With Non-deterministic Planners] [Download]

Comparison With Other Contingent Planners

General Description:

We present a domain-independent regression contingent planner, namely CPR (Conditional Planning based on Regression), that is capable of generating conditional plans and conformant plans in the presence of sensing actions and incomplete information. CPR's search algorithm (influenced by the best-first search strategy) employs a state-based regression formulation [TBZS04] that is in turn based on the 0-approximation semantics discussed in [SB01].

A planning problem considered for CPR is a 4-tuple P = (A,O,I,G) where A is a set of fluents, O is a set of actions, I and G are set of fluent literals that encode what is known in the initial state and what is desired of a goal state, respectively. Note that, the current version of CPR also supports multi-valued domains. Below we illustrate the performance of CPRin comparison with several related contingent planners in the literature.

Please contact me if you have any questions or comments related to this stuff.

Test Results:

Domains:

GridMineSweep:

There is a grid of size N*N. There are mines at some positions on the grid. There is a robot, also on the grid, which can travel North, South, West, and East one step at a time. It can sense whether a mine safety lock is on or off at some predefined positions on the grid, it can turn the lock to make the lock become on or off alternatively, and disarm the mine at some specific locations. The goal is to make the grid have only disarmed mines without causing explosion. Initially, we know the robot's position, positions that the robot can disarm bomb and sense bombs' lock; but we don't know which mines are not disarmed. This domain may generates many possible plans that have unbalanced tree shape with many non-sensing actions and sensing actions.

Mail-man:

A modified version of the logistics domain used in IPC'02. A variant of the logistics domain is also discussed in [BKS04]. Basically, a central post office must direct their mail men to travel to several locations and cities to collect mails from mail-collection boxes. The transportation is either by car (in a local city) or flight (between cities). The mail men do not know a location has mail or has no mail. So one has to go there and check, then collect if available. Initially, we may know some vehicles whereabouts. This is somehow similar to the GridMineSweep but with more types of traveling.

Results:

Domain	CPR		YKA	MBP	POND
Domain	Max	Sum	YKA	MBP	POND
GridMineSweep
5-1	131	133	140	-	300
5-2	1344	1085	310	7400	800
5-3	41184	13752	-	-	12760
10-1	2276	384	6240	30	1360
10-2	4278	15865	394690	-	6460
10-3	-	510130	-	-	89760

MailMan
m11	238	194	4680	40	710
m12	3984	997	11580	40	1760
m13	11518	1256	19090	40	1630
m14	-	12518	23650	40	2010
m21	260	254	-	820	5900
m22	451	260	-	840	6030
m23	3472	1230	-	820	13440
m24	-	14838	-	840	69140

Results for CPR using mutex and Max and Sum heuristics, YKA, MBP, POND (using LUG_RP heuristics): Total time in milliseconds. Note that: (i) Both YKA and MBP do not handle sensing actions with preconditions, we therefore only use observable fluents instead; (ii) MBP running option: -v 0 -explicit_dfs_forward -no_conformant_at_start -no_plan_check. The symbol {-} in the table denotes that we either stop the planner or the planner kills itself after 15 minutes of running.

Downloads:

CPR planner Domains

CPR uses a Graphical User Interface for easy interaction with the planner.

The command line for Windows is: java -cp CPR.jar zeroaprx.CPR
To run in a Linux/Unix environment, please use with the flag -linux: java -cp CPR.jar zeroaprx.CPR -linux

Comparison With Non-deterministic Planners

Results of running CPR and MBP (with the corresponding 3-valued non-deterministic domains)

Domain	CPR		MBP
Domain	Max	Sum	MBP
GridMineSweep
5-1	121	121	40
5-2	1098	1070	13540
5-3	49152	15489	-
10-1	2435	410	190
10-2	4813	17483	-
10-3	-	590790	-

MailMan
m11	299	246	6250
m12	4573	1810	6250
m13	17099	1539	6080
m14	-	12732	6220
m21	345	318	-
m22	275	273	-
m23	3576	1268	-
m24	-	15414	-

Results for CPR using mutex and Max and Sum heuristics, and MBP: Total time in milliseconds. MBP running option: -v 0 -explicit_dfs_forward -no_conformant_at_start -no_plan_check. The symbol {-} in the table denotes that we either stop the planner or the planner kills itself after 15 minutes of running.

** Note: The paper by Cimatti et al [CPRT03] discusses such planners (QBFPLAN by Rintanen, GPT by Bonet and Geffner, SIMPLAN by Kabanza et al, and SGP by Weld et al). Note that, among these non-deterministic planners, MBP is more efficient than QBFPLAN (pages 34 & 35 [CPRT03]) and SGP, MBP is also faster than GPT - a probabilistic planner. SIMPLAN of Kabanza is a domain-dependent planner and thus we don't consider it as ours is a domain-independent planner. Nevertheless, MBP is also faster than SIMPLAN (page 37 [CPRT03]). We therefore only compare our planner's performance with that of MBP.

Downloads:

MBP Domains

Contact Information:

Please contact me if you have any questions or comments related to this stuff.

References

[BKS04] D. Bryce, S. Kambhampati and D. Smith. Planning Graph Heuristics for Belief Space Search. ASU CSE TR. Submitted for journal publication.

[CPRT03] Cimatti, A., Pistore, M., Roveri M., Traverso P. Weak, strong, and strong cyclic planning via symbolic model checking.
Artificial Intelligence Journal 147(1-2): 35-84 (2003)

[SB01] C. Baral and T. Son. Formalizing sensing actions -- a transition function based approach. In Artificial Intelligence Journal, 125 (1-2), pgs 19-91, Jan 2001.

[TBZS04] Le-Chi Tuan, C. Baral, Xin Zhang, Tran Son. Regression With Respect to Sensing Actions and Partial States. AAAI'04.