COST‐BENEFIT CONSIDERATIONS IN CHOOSING AMONG CROSS‐VALIDATION METHODS
作者:
KEVIN R. MURPHY,
期刊:
Personnel Psychology
(WILEY Available online 1984)
卷期:
Volume 37,
issue 1
页码: 15-22
ISSN:0031-5826
年代: 1984
DOI:10.1111/j.1744-6570.1984.tb02244.x
出版商: Blackwell Publishing Ltd
数据来源: WILEY
摘要:
There are two general methods of cross‐validation: (a) empirical estimation, and (b) formula estimation. In choosing a specific cross‐validation procedure, one should consider both costs (eg. inefficient use of available data in estimating regression parameters) and benefits (eg. accuracy in estimating population cross‐validity). Empirical cross‐validation methods involve significant costs, since they are typically laborious and wasteful of data, but under conditions represented in Monte Carlo studies, they are generally not more accurate than formula estimates. Consideration of costs and benefits suggests that empirical estimation methods are typically not worth the cost, except in a limited number of cases in which Monte Carlo sampling assumptions are not met in thederivationsample. Designs which use multiple samples to estimate the cross‐validity of a single regression equation are clearly preferable to single‐sample designs; the latter are never expected to be more accurate than formula estimates and thus are never worth the cost. Multi‐equation designs are more accurate than single equation designs, but they appear to estimate the wrong parameter, and thus are difficult
点击下载:
PDF
(406KB)
返 回