A Stagewise Action Elimination Algorithm for the Discounted Semi-Markov Problem
作者:
SadjadiD.,
BestwickPaul F.,
期刊:
Journal of the Operational Research Society
(Taylor Available online 1979)
卷期:
Volume 30,
issue 7
页码: 633-637
ISSN:0160-5682
年代: 1979
DOI:10.1057/jors.1979.156
出版商: Taylor&Francis
数据来源: Taylor
摘要:
AbstractAn efficient algorithm for solving discounted semi-Markov (Markov-renewal) problems is proposed. The value iteration method of dynamic programming is used in conjunction with a test for non-optimal actions. A non-optimality test for the discounted semi-Markov processes, which is an extension of Hastings and Van Nunens (1976) test for the undiscounted or discounted returns with infinite or finite planning horizon, is used to identify actions which cannot be optimal at the current stage of a discounted semi-Markov process. The test proposed eliminates actions for one or more stages after which they may enter the set of possibly optimal actions, but such re-entries cease as convergence proceeds.
点击下载:
PDF (1702KB)
返 回