Selectively acquiring customer information: A new data acquisition problem and an active learning-based solution

成果类型:
Article
署名作者:
Zheng, Zhiqiang; Padmanabhan, Balaji
署名单位:
University of California System; University of California Riverside; University of Pennsylvania
刊物名称:
MANAGEMENT SCIENCE
ISSN/ISSBN:
0025-1909
DOI:
10.1287/mnsc.1050.0488
发表日期:
2006
页码:
697-712
关键词:
selective information acquisition Active learning Data mining
摘要:
This paper presents a new information acquisition problem motivated by business applications where customer data has to be acquired with a specific modeling objective in mind. In the last two decades, there has been substantial work in two different fields-optimal experimental design and machine learning-that has addressed the issue of acquiring data in a selective manner with a specific objective in mind. We show that the problem presented here is different from the classic model-based data acquisition problems considered thus far in the literature in both fields. Building on work in optimal experimental design and in machine learning, we develop a new active learning technique for the information acquisition problem presented in this paper. We demonstrate that the proposed method performs well based on results from applying this method across 20 Web usage and machine learning data sets.