A doscounted uniform one-armed bandit problem
作者:
Toshio Hamada,
期刊:
Sequential Analysis
(Taylor Available online 1992)
卷期:
Volume 11,
issue 1
页码: 1-15
ISSN:0747-4946
年代: 1992
DOI:10.1080/07474949208836241
出版商: Marcel Dekker, Inc.
关键词: bandit problem;discount factor;Bayesian updatingt;optimal strategy;myopic strategy
数据来源: Taylor
摘要:
The problem considered in this paper is to decide when to stop a sequence of the
点击下载:
PDF (211KB)
返 回