Data quality: A statistical perspective |
| |
Authors: | Alan F. Karr Ashish P. Sanil David L. Banks |
| |
Affiliation: | aNational Institute of Statistical Sciences, 19 T.W. Alexander Drive, 27709-4006 Research Triangle Park, NC, United States;bDuke University, Institute of Statistics and Decision Sciences, Box 90251, 27708 Durham, NC, United States |
| |
Abstract: | ![]() We present the old-but-new problem of data quality from a statistical perspective, in part with the goal of attracting more statisticians, especially academics, to become engaged in research on a rich set of exciting challenges. The data quality landscape is described, and its research foundations in computer science, total quality management and statistics are reviewed. Two case studies based on an EDA approach to data quality are used to motivate a set of research challenges for statistics that span theory, methodology and software tools. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|