您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 信息管理与信息系统 > Information Systems Research > 2005 > 3期

Maximizing accuracy of shared databases when concealing sensitive patterns

成果类型：

Article

署名作者：

Menon, S; Sarkar, S; Mukherjee, S

署名单位：

University of Texas System; University of Texas Dallas

刊物名称：

INFORMATION SYSTEMS RESEARCH

ISSN/ISSBN：

1047-7047

DOI：

10.1287/isre.1050.0056

发表日期：

2005

页码：

256-270

关键词：

privacy

摘要：

The sharing of databases either within or across organizations raises the possibility of unintentionally revealing sensitive relationships contained in them. Recent advances in data-mining technology have increased the chances of such disclosure. Consequently, firms that share their databases might choose to hide these sensitive relationships prior to sharing. Ideally, the approach used to hide relationships should be impervious to as many data-mining techniques as possible, while minimizing the resulting distortion to the database. This paper focuses on frequent item sets, the identification of which forms a critical initial step in a variety of data-mining tasks. it presents an optimal approach for hiding sensitive item sets, while keeping the number of modified transactions to a minimum. The approach is particularly attractive as it easily handles databases with millions of transactions. Results from extensive tests, conducted on publicly available real data and data generated using IBM's synthetic data generator indicate that the approach presented is very effective, optimally solving problems involving millions of transactions in a few seconds.

来源URL：

访问原文