Dimension reduction for the conditional mean in regressions with categorical predictors

成果类型:
Article
署名作者:
Li, B; Cook, RD; Chiaromonte, F
署名单位:
Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park; University of Minnesota System; University of Minnesota Twin Cities
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/aos/1065705121
发表日期:
2003
页码:
1636-1668
关键词:
principal hessian directions binary response Graphics
摘要:
Consider the regression of a response Y on a vector of quantitative predictors X and a categorical predictor W. In this article we describe a first method for reducing the dimension of X without loss of information on the conditional mean E(Y\X, W) and without requiring a prespecified parametric model. The method, which allows for, but does not require, parametric versions of the subpopulation mean functions E(Y\X, W = w), includes a procedure for inference about the dimension of X after reduction. This work integrates previous studies on dimension reduction for the conditional mean E(Y\X) in the absence of categorical predictors and dimension reduction for the full conditional distribution of Y\(X, W). The methodology we describe may be particularly useful for constructing low-dimensional summary plots to aid in model-building at the outset of an analysis. Our proposals provide an often parsimonious alternative to the standard technique of modeling with interaction terms to adapt a mean function for different subpopulations determined by the levels of W. Examples illustrating this and other aspects of the development are presented.