Correlation and high-dimensional consistency in pattern recognition
成果类型:
Article
署名作者:
Ge, NX; Simpson, DG
署名单位:
University of Illinois System; University of Illinois Urbana-Champaign
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.2307/2669844
发表日期:
1998
页码:
995-1006
关键词:
discrimination
populations
variables
摘要:
Classical discriminant analysis breaks down when the feature vectors are of extremely high dimension; for example, when the basic observation is a random function observed over a fine grid. Alternative methods have been developed assuming a simplified form for the covariance structure. We analyze the high-dimensional asymptotics of some of these methods, emphasizing the effects of correlations such as occur when the baseline is random. For instance, the Euclidean distance classifier, which has been proposed for generic use in high-dimensional classification problems, is dimensionally inconsistent under a simple repeated measurement model. We provide exponential bounds for the error rates of several classifiers. We develop new dimensionally consistent methods to deal with the effects of correlation in high-dimensional problems.