首页   按字顺浏览 期刊浏览 卷期浏览 Outcome Probabilities for a Record Matching Process with Complete Invariant Information
Outcome Probabilities for a Record Matching Process with Complete Invariant Information

 

作者: Gad Nathan,  

 

期刊: Journal of the American Statistical Association  (Taylor Available online 1967)
卷期: Volume 62, issue 318  

页码: 454-469

 

ISSN:0162-1459

 

年代: 1967

 

DOI:10.1080/01621459.1967.10482920

 

出版商: Taylor & Francis Group

 

数据来源: Taylor

 

摘要:

Record matching processes, which compare sets of identifying information, to decide whether or not a pair of records relate to the same individual or population item, are basic in a wide range of applications in social research, maintenance of files and information retrieval. Such processes may be conveniently described in terms of matching a single incoming record against a master file or list. In order to evaluate different record matching processes, in terms of matching costs and error losses, it is necessary to evaluate the outcome probabilities. It is shown that this can be done for a simple model which assumes that the information used for matching is complete and invariant but, possibly, insufficient to distinguish between all population items, by considering only the class-size probability distributions. The latter can be estimated directly from the list or from a sub-sample drawn from it, by the application of Goodman's [2] results concerning the estimation of the number of classes in a population. The outcome probabilities can then be evaluated by considering the incoming record as randomly drawn from the list, if it should match some item on the list, and as added to the list, to form a new list with approximately the same classification probabilities, if it ought to match no item on the list. A numerical example illustrates an application of the model.

 

点击下载:  PDF (814KB)



返 回