An Observation-based Optimization Algorithm for POMDP and Its Simulation

HUANG Jing; YIN Bao-qun; LI Jun

HUANG Jing, YIN Bao-qun, LI Jun. An Observation-based Optimization Algorithm for POMDP and Its Simulation[J]. INFORMATION AND CONTROL, 2008, 37(3): 346-351,376.

Citation:

HUANG Jing, YIN Bao-qun, LI Jun. An Observation-based Optimization Algorithm for POMDP and Its Simulation[J]. INFORMATION AND CONTROL, 2008, 37(3): 346-351,376.

Citation:

HUANG Jing, YIN Bao-qun, LI Jun. An Observation-based Optimization Algorithm for POMDP and Its Simulation[J]. INFORMATION AND CONTROL, 2008, 37(3): 346-351,376.

An Observation-based Optimization Algorithm for POMDP and Its Simulation

Graphical Abstract

Graphical Abstract

Abstract

Abstract

The problem of performance optimization for partially observable Markov decision process(POMDP)is addressed based on the sensitivity analysis of Markov decision process(MDP).The sensitivity analysis formulas are given. Based on these results,two observation-based optimization algorithms,i.e.,policy-gradient and policy-iteration algorithms are developed for POMDP.To verify these algorithms,a simulation based on the problem of admission control is also presented.

FullText(HTML)

References (9)

Cited By

An Observation-based Optimization Algorithm for POMDP and Its Simulation

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content