Incorporating additional information to normal linear discriminant rules

成果类型:
Article
署名作者:
Fernandez, Miguel A.; Rueda, Cristina; Salvador, Bonifacio
署名单位:
Universidad de Valladolid
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1198/016214505000001041
发表日期:
2006
页码:
569-577
关键词:
摘要:
The most useful and broadly known rule in the classical two-group linear normal discriminant analysis is Anderson's rule. In this article we propose some alternative procedures that prove useful when prior constraints on the mean vectors are known. These rules are based on new estimators of the difference of means. We prove under mild conditions that the new rules perform better when the common covariance matrix is known. Simulated experiments show that the misclassification errors are lower for the restricted rules defined here in the general case of an unknown covariance matrix. The prior constraints on the mean vector restrict the parameter space to a cone. A family of estimators indexed by a parameter gamma, with 0 <= gamma <= 1, is defined using an iterative procedure in such a way that the estimator with a higher value for gamma takes values closer to the center of the cone with a greater probability. When gamma = 0, the restricted maximum likelihood estimator is given, although the most interesting rule from a theoretical and practical standpoint is obtained when the estimator chosen is given by gamma = 1. The usefulness of the proposed rules with real data is demonstrated by their application to two medical examples, the first dealing with heart attack patients and the second dealing with a diabetes dataset. In the former case, restrictions among surviving and nonsurviving patients are used; in the latter, the restrictions arise from differences between the healthy and diabetic populations.
来源URL: