Bandwidth choice for nonparametric classification

成果类型:
Article
署名作者:
Hall, P; Kang, KH
署名单位:
Australian National University; Hankuk University Foreign Studies
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/009053604000000959
发表日期:
2005
页码:
284-306
关键词:
cross-validation ERROR RATE probability RULE regression selection number rates BIAS
摘要:
It is shown that, for kernel-based classification with univariate distributions and two populations, optimal bandwidth choice has a dichotomous character. If the two densities cross at just one point. where their curvature.,, have the same signs, then minimum Bayes risk is achieved using bandwidths which are an order of magnitude larger than those which minimize pointwise estimation error. On the other hand, if the curvature signs are different, or if there are multiple crossing points. then bandwidths of conventional size are generally appropriate. The range of different modes of behavior is narrower in multivariate settings. There, the optimal size of bandwidth is generally the same as that which is appropriate for pointwise density estimation. These properties motivate empirical rules for bandwidth choice.
来源URL: