Managing data quality of integrated data with known provenance
del Pilar Angeles, Maria and MacKinnon, Lachlan M. (2011) Managing data quality of integrated data with known provenance. International Journal of Information Quality, 2 (3). pp. 244-263. ISSN 1751-0457 (Print), 1751-0465 (Online) (doi:10.1504/IJIQ.2011.040671)Full text not available from this repository.
Users querying a database system will have returned to them a set of data with no indication of the qualitative value of that data. In order to address the issue of data quality, and challenging the presumptions of perfection, atomicity and primary authorship, a toolset has been developed. This project proposes a data quality manager (DQM), which contains a reference model, a measurement model along with an assessment model. The present work aims to identify data quality criteria to measure and assess data quality of derived data, as well as data at multiple levels of granularity. The qualitative information provided by the DQM is enhanced by considering data provenance. The qualitative measures allow the ranking of data sources based on users' specification of the context in a heterogeneous multi database environment. The DQM prototype has been tested and several experiments have been carried out in order to prove that more accurate information is being provided to the users.
|Uncontrolled Keywords:||data quality, data provenance, heterogeneous database systems, information quality, quality management, granularity, data retrieval, information retrieval|
|Subjects:||Q Science > QA Mathematics > QA75 Electronic computers. Computer science|
|School / Department / Research Groups:||School of Computing & Mathematical Sciences|
|Last Modified:||10 May 2013 15:48|
Actions (login required)