The Use of Names for Linking Personal Records
作者:
HowardB. Newcombe,
MarthaE. Fair,
Pierre Lalonde,
期刊:
Journal of the American Statistical Association
(Taylor Available online 1992)
卷期:
Volume 87,
issue 420
页码: 1193-1204
ISSN:0162-1459
年代: 1992
DOI:10.1080/01621459.1992.10476278
出版商: Taylor & Francis Group
关键词: Data base maintenance;File searching;Probabilistic linkage;Quantitative judgment;Record linkage
数据来源: Taylor
摘要:
The skill of a human who searches large files of personal records depends much on prior knowledge of how the names vary in successive documents pertaining to the same individuals (e.g., as with ANTHONY–TONY, JOSEPH–JOE, WILLIAM–BILL). Now, an essentially exact procedure enables computers to make similar use of an accumulated memory of their own past experiences when searching for, and linking, records that relate to particular persons. This knowledge is further applied to quantify the benefits from various refinements of the rules by which the discriminating powers of names are calculated when they do not precisely agree or are substantially dissimilar. Of the six refinements tested, by far the most important is the recently developed exact approach for calculating the ODDS associated with comparisons of names that are possible synonyms.
点击下载:
PDF (1287KB)
返 回