ABCDEFGHIJKLMNOPQRSTUVWXYZ
1
2
Best Practice 1: Provide metadata
Provide metadata for both human users and computer applications.
3
Best Practice 2: Provide descriptive metadata
Provide metadata that describes the overall features of datasets and distributions.
4
Best Practice 3: Provide locale parameters metadata
Provide metadata about locale parameters (date, time, and number formats, language).
5
Best Practice 4: Provide structural metadata
Provide metadata that describes the schema and internal structure of a distribution.
6
Best Practice 5: Provide data license information
Provide a link to or copy of the license agreement that controls use of the data.
7
Best Practice 6: Provide data provenance information
Provide complete information about the origins of the data and any changes you have made.
8
Best Practice 7: Provide data quality information
Provide information about data quality and fitness for particular purposes.
9
Best Practice 8: Provide a version indicator
Assign and indicate a version number or date for each dataset.
10
Best Practice 9: Provide version history
Provide a complete version history that explains the changes made in each version.
11
Best Practice 10: Use persistent URIs as identifiers of datasets
Identify each dataset by a carefully chosen, persistent URI.
12
Best Practice 11: Use persistent URIs as identifiers within datasets
Reuse other people's URIs as identifiers within datasets where possible.
13
Best Practice 12: Assign URIs to dataset versions and series
Assign URIs to individual versions of datasets as well as to the overall series.
14
Best Practice 13: Use machine-readable standardized data formats
Make data available in a machine-readable, standardized data format that is well suited to its intended or potential use.
15
Best Practice 14: Provide data in multiple formats
Make data available in multiple formats when more than one format suits its intended or potential use.
16
Best Practice 15: Use standardized terms
Standardized terms should be used to provide data and metadata.
17
Best Practice 16: Reuse vocabularies
Use terms from shared vocabularies, preferably standardized ones, to encode data and metadata.
18
Best Practice 17: Choose the right formalization level
When reusing a vocabulary, opt for a level of formal semantics that fits both the data and its applications.
19
Best Practice 18: Provide data unavailability reference
For data that is not open, provide an explanation about how the data can be accessed and who can access it.
20
Best Practice 19: Provide bulk download
Enable consumers to retrieve the full dataset with a single request.
21
Best Practice 20: Provide Subsets for Large Datasets
If your dataset is large, enable users and applications to readily work with useful subsets of your data.
22
Best Practice 21: Use content negotiation for serving data available in multiple formats
Use content negotiation in addition to file extensions for serving data available in multiple formats.
23
Best Practice 22: Provide real-time access
When data is produced in real time, make it available on the Web in real time or near real time
24
Best Practice 23: Provide data up to date
Make data available in an up-to-date manner, and make the update frequency explicit.
25
Best Practice 24: Make Data Available through an API
Offer an API to serve data if you have the resources to do so.
26
Best Practice 25: Use Web Standards as the foundation of APIs
When designing APIs, use an architectural style that is founded on the technologies of the Web itself.
27
Best Practice 26: Provide complete documentation for your API
Provide complete information on the Web about your API. Update documentation as you add features or make changes.
28
Best Practice 27: Avoid Breaking Changes to Your API
Avoid changes to your API that break client code, and communicate any changes in your API to your developers when evolution happens.
29
Best Practice 28: Assess dataset coverage
Assess the coverage of a dataset prior to its preservation.
30
Best Practice 29: Use a trusted serialization format for preserved data dumps
Use a well established serialization when sending a data dump for long term preservation.
31
Best Practice 30: Update the status of identifiers
Link preserved resources through URIs with their "live" counterparts.
32
Best Practice 31: Gather feedback from data consumers
Provide a readily discoverable means for consumers to offer feedback.
33
Best Practice 32: Make feedback available
Make consumer feedback available to both human users and computer applications.
34
Best Practice 33: Enrich data by generating new data
Enrich your data by generating new data from the raw data when doing so will enhance its value.
35
Best Practice 34: Provide Complementary Presentations
Enrich data by presenting it in complementary, immediately informative ways, such as visualizations, tables, Web applications, or summaries.
36
Best Practice 35: Provide Feedback to the Original Publisher
Let the original publisher know when you are reusing their data. If you find an error or have suggestions or compliments, let them know.
37
Best Practice 36: Follow Licensing Terms
Find and follow the licensing requirements from the original publisher of the dataset.
38
Best Practice 37: Cite the Original Publication
Acknowledge the source of your data in metadata. If you provide a user interface, include the citation visibly in the interface.
39
40
41
Should we use human or people ? software or machine?
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100