While it is relatively easy to amass large amounts of data, sorting, tagging, and presenting data in a way that enables users to glean valuable information is not as straightforward. There can be real variation in the quality of datasets available for purchase. Without a set of common metrics to compare the relative quality of datasets, potential buyers face challenges, especially when choosing amongst similar datasets from multiple data providers.
This set of guidelines recommends a baseline set of data quality metrics which is industry domain agnostic, for adoption by data providers. In addition to describing the methodology for deriving this set of metrics, tools for relaying metrics to end-users are also considered. Having a common set of metrics allows users to more easily compare the quality of different datasets, and match their expectations against available datasets.
Fill in the form below for link to the full document.