您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 统计学 > The Annals of Statistics > 2012 > 2期

CHARACTERIZING L2BOOSTING

成果类型：

Article

署名作者：

Ehrlinger, John; Ishwaran, Hemant

署名单位：

Cleveland Clinic Foundation; University of Miami

刊物名称：

ANNALS OF STATISTICS

ISSN/ISSBN：

0090-5364

DOI：

10.1214/12-AOS997

发表日期：

2012

页码：

1074-1101

关键词：

regression selection Lasso

摘要：

We consider L(2)Boosting, a special case of Friedman's generic boosting algorithm applied to linear regression under L-2-loss. We study L(2)Boosting for an arbitrary regularization parameter and derive an exact closed form expression for the number of steps taken along a fixed coordinate direction. This relationship is used to describe L(2)Boosting's solution path, to describe new tools for studying its path, and to characterize some of the algorithm's unique properties, including active set cycling, a property where the algorithm spends lengthy periods of time cycling between the same coordinates when the regularization parameter is arbitrarily small. Our fixed descent analysis also reveals a repressible condition that limits the effectiveness of L(2)Boosting in correlated problems by preventing desirable variables from entering the solution path. As a simple remedy, a data augmentation method similar to that used for the elastic net is used to introduce L-2-penalization and is shown, in combination with decorrelation, to reverse the repressible condition and circumvents L(2)Boosting's deficiencies in correlated problems. In itself, this presents a new explanation for why the elastic net is successful in correlated problems and why methods like LAR and lasso can perform poorly in such settings.