Data Ingestion Checklist
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

 
$
%
123
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ABCDEFGHIJKLMNOPQRSTUVWXYZ
1
Applicable sectionsObservation
Comments
2
Register a resource with SciCrunch Registry
Have you registered for a SciCrunch account, https://scicrunch.org/register ?
3
Did you make sure we don't already have the resource by searching SciCrunch using parts of the name if the name is not straight forward?
4
Did you review the "Resources and Curation" documentation in reference to naming conventions, etc. https://confluence.crbs.ucsd.edu/display/NIF/Resources+and+Curation ?
5
Did you create your new resource using the "Create a resource" field at https://scicrunch.org/browse/resourcedashboard ?
6
7
Eligibility for Federation resourcesIs this resource in the SciCrunch Registry?
8
Does this resource contain database or datasets that can be exported as tables?
9
Is the data open to public and accessed by the community without extra requirements?
10
Do the resource owners agree to make their data public through Data Federation?
11
12
Adding resource to DISCO / SitemapIs the resource curated?
13
Did you add the resource to disco, http://disco.neuinfo.org/ , by ?adding it here: http://disco.neuinfo.org/webportal/SciCrunchRegistration.do??
14
If you were having problems did you refer to the YouTube video, http://www.youtube.com/watch?v=MezKrMrN2_Q&list=UUORjtRYZj5Pg3K-ueLEytYw&index=9&feature=plcp ?
15
16
Create an Interop Fileset schedule for crawling: set initially to weekly
17
18
Define the source in cmAdd the source to CM
19
Is the source name consistent with the resource name/database name?
20
Does the name exclude dash, comma, or other punctuations and special characters?
21
Is the source name less than 30 characters?
22
Is the description of the resource relatively consistent with that of the SciCrunch Registry?
23
Is the description of the resource relatively consistent with the data content?
24
25
Define the view in cmAdd the view to CM
26
Is the view name consistent with the database content?
27
Does the name contain dash, comma, or other punctuations and special characters?
28
Is the description briefly stating the information of the view?
29
Does the resource’s “Indexable” radio button set to “Yes"?
30
31
add keywordsDo the keywords include name of resource, abbreviation, data type, a resource ID and view ID as strings?
32
Are concept IDs added for the Ontology terms?
33
Are key ontology terms (organism, technique, anatomical region/structure, related condition, functional level, other?), if applicable, specified? If not specified in the column (e.g., if a view is human MRI and neither of these are specified in the columns) have they been added as keywords? If the view all relates to Diabetes Type 1 and there is no column for this, has it been added as a keyword?
34
35
Define the display of viewAre the headers for similar content across resources/views uniform?
36
Are special characters or colons excluded from column headers? Note: Other punctucation may prevent the view from being generated.
37
Are the column headers unique for each column within a single view?
38
39
Columns formatting Are all columns marked indexable?except?
40
Are all columns marked exportable?except?
41
Are the weights appropriately set according to the curator guidelines?
42
Are the facets appropriately set according to the curator guidelines?
43
Is "Is key" "Yes" to the e_uid column and "No" to all other columns?
44
Are the description, notes and comments mapped to ‘Full Text’?
45
Are Protein and Chemical substance mapped to ‘Molecule’?
46
Are the gene alleles mapped to ‘Genomic locus variant’?
47
Are ids such as e_uid, gene id, NIF ids, PubMed ids and accession numbers mapped to ‘Identifiers’?
48
Are reference and PubMed IDs mapped to ‘Publication’?
49
Are gene symbol and gene name mapped to ‘Gene’?
50
Are anatomical structures mapped to ‘Anatomy’?
51
Are the organism and species mapped to ‘Organism’?
52
53
54
Quality controlDid you add a description to your view?
55
Do the links in the description work?
56
Does each column in the view sort?
57
Are there any empty columns?
58
Did you search each column for something in the column and retrieve an expected result?
59
Does every type of facet retrieve the expected result(s)?
60
When you click on links in each column did they go where you expect?
61
Does searching in the search bar retrieve the records you would expect?
62
Did you do some searches (in the main page search) and get the results you expected? E.g., MRI if your data involves MRI, rat if your data covers rat. (Consider covering the content in each column and content the columns do not cover, where applicable)
63
Did you notify curation team that the resource is ready for release?
64
65
after pushed to productionmonitor the view every Monday to make sure view doesn't drop from production
66
check description shows up
67
make default SciCrunch snippet
68
broadcast on social media
69
monitor source crawling: Frequency that the underlying data appears to update? (Tom); adjust down if changes don't happen often (several months)
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
Loading...
Main menu