Belief Point-based POMDP Solution for Policy Tree Pruning

ZHENG Hongyan; WU Bo; FENG Yanpeng; MENG Xianjun

doi:10.3724/SP.J.1219.2013.00053

ZHENG Hongyan, WU Bo, FENG Yanpeng, MENG Xianjun. Belief Point-based POMDP Solution for Policy Tree Pruning[J]. INFORMATION AND CONTROL, 2013, 42(1): 53-57. DOI: 10.3724/SP.J.1219.2013.00053

Citation:

ZHENG Hongyan, WU Bo, FENG Yanpeng, MENG Xianjun. Belief Point-based POMDP Solution for Policy Tree Pruning[J]. INFORMATION AND CONTROL, 2013, 42(1): 53-57. DOI: 10.3724/SP.J.1219.2013.00053

Citation:

ZHENG Hongyan, WU Bo, FENG Yanpeng, MENG Xianjun. Belief Point-based POMDP Solution for Policy Tree Pruning[J]. INFORMATION AND CONTROL, 2013, 42(1): 53-57. DOI: 10.3724/SP.J.1219.2013.00053

Belief Point-based POMDP Solution for Policy Tree Pruning

Graphical Abstract

Graphical Abstract

Abstract

Abstract

Large-scale partially observable Markov decision process (POMDP) suffers from the exponential growth of the policy tree and the difficulty of finding witness points (WPs). Based on the piecewise linearity and convexity of the value function, a belief point-based algorithm is proposed for policy tree incremental pruning and value iteration solution. When policy trees are generating, the algorithm uses boundary points for non-destructive pruning, and exploits intermediate points for destructive pruning. It also makes use of realtime belief states to solve approximate optimal solution. Comparison experiment results show that the proposed algorithm converges quickly and achieve high reward within less time.

FullText(HTML)

References (10)

Cited By

Belief Point-based POMDP Solution for Policy Tree Pruning

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content