首页   按字顺浏览 期刊浏览 卷期浏览 Residual Diagnostics for Mixture Models
Residual Diagnostics for Mixture Models

 

作者: BruceG. Lindsay,   Kathryn Roeder,  

 

期刊: Journal of the American Statistical Association  (Taylor Available online 1992)
卷期: Volume 87, issue 419  

页码: 785-794

 

ISSN:0162-1459

 

年代: 1992

 

DOI:10.1080/01621459.1992.10475280

 

出版商: Taylor & Francis Group

 

关键词: EM algorithm;Exponential family mixtures;Nonparametric mixtures;Overdispersion

 

数据来源: Taylor

 

摘要:

A sample is commonly modeled by a mixture distribution if the observations follow a common distribution, but the parameter of interest differs between observations. For example, we observe the lengths but not the ages of a sample offish. It may be reasonable to assume that length is normally distributed about an unknown mean that depends on the age of the fish. Provided there is more than one age class in the sample, then the data are distributed as a mixture of normals. In this article we assume that the data are a random sample from a mixture of exponential family distributions and that for each observation the parameter of interest is sampled independently from an unknown mixing distributionQ. The adequacy of a fitted mixture model can be assessed by examining residuals based on the ratio of the observed to expected fit. Residuals based on the homogeneity model (in whichQis a one-point distribution) display a convexity property when the data follow a mixture model; this becomes the basis for diagnostic plots to detect the presence of mixing. Similar results also are obtained from smoothed residuals; thus the diagnostic also can be applied to sparse or continuous data. The nonparametric maximum likelihood estimate[Qcirc]of the distributionQis known to be discrete. Smoothed residuals obtained from the fitted mixed model provide information about the number of support points in[Qcirc]. This facilitates the use of the EM algorithm to find[Qcirc]. The residuals evaluated at[Qcirc]determine whether or not the maximum likelihood estimate is unique and hence interpretable. Simulated and actual data sets are analyzed to illustrate the power and the utility of these procedures.

 

点击下载:  PDF (1021KB)



返 回