Achieving higher-order convergence rates for density estimation with binned data

成果类型:
Article
署名作者:
Minnotte, MC
署名单位:
Utah System of Higher Education; Utah State University
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.2307/2670117
发表日期:
1998
页码:
663-672
关键词:
histograms Kernels
摘要:
A method is proposed for achieving fast-O(n(-8/9)) and faster-asymptotic mean integrated squared error (AMISE) convergence rates in density estimation using data condensed to standard histogram bin counts and edges. The method involves the two elements of B splines and the correct mass proportions property, in which the probability mass for the histogram bins equals exactly the fraction of the data found in those bins. It is shown that the asymptotic bias of such an estimate using a spline of order p and bin width h will be O(h(p)), resulting in optimal AMISE of O(n(-2p/(2p+1))) form. Theoretical and numeric comparisons to kernel-based and other estimators show that the histospline estimator is usually comparable and often superior. An extension to the multivariate density estimation setting is described.