Assessing data quality for information products: Impact of selection, projection, and Cartesian product
成果类型:
Article
署名作者:
Parssian, A; Sarkar, S; Jacob, VS
署名单位:
University of Illinois System; University of Illinois Springfield; University of Texas System; University of Texas Dallas
刊物名称:
MANAGEMENT SCIENCE
ISSN/ISSBN:
0025-1909
DOI:
10.1287/mnsc.1040.0237
发表日期:
2004
页码:
967-982
关键词:
information quality metrics
relational data model
relational algebra
probability calculus
摘要:
The cost associated with making decisions based on poor-quality data is quite high. Consequently, the management of data quality and the quality of associated data management processes has become critical for organizations. An important first step in managing data quality is the ability to measure the quality of information products (derived data) based on the quality of the source data and associated processes used to produce the information outputs. We present a methodology to determine two data quality characteristics-accuracy and completeness-that are of critical importance to decision makers. We examine how the quality metrics of source data affect the quality for information outputs produced using the relational algebra operations selection, projection, and Cartesian product. Our methodology is general, and can be used to determine how quality characteristics associated with diverse data sources affect the quality of the derived data.
来源URL: