Data curation and assessment
Carlo Minotti
SciCat F2F 2022
Outline
2
Advantages
3
Kpi: how do we want to track usage, downloads, users clicks
Adding fields to collections
Pros:
Cons:
4
Kpi: how do we want to track usage, downloads, users clicks
Improve the logging/monitoring (e.g. graylog)
Pros:
Cons:
5
Kpi: how do we want to track usage, downloads, users clicks
Plots/Jupyter notebooks on existing data (e.g. number of triggered retrieve jobs)
Pros:
Cons:
6
Data assessment: can we assess if our metadata/data is correct and how much variability does it contains?
Simple scripts that check formatting in DB (e.g. number of empty fields/values)
Pros:
Cons:
7
Data assessment: can we assess if our metadata/data is correct and how much variability does it contains?
Scripts that check based on well defined interfaces (e.g. OAI-PMH, search-API, Tubingen’s schemas)
Pros:
Cons:
8