VectorBase PopBio Data Submission
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

View only
 
 
ABCDEFGHIJKLMNOPQRSTUVWXYZ
1
VectorBase can accept data in any well formatted format. Well formatted data is one that has all relevant metadata, that is easily machine readable.
2
3
One good example of well formatted data is data in the Minimum Information for Reusable Arthropod Abundance Data (MIReAD) format. See image below.
4
5
While MIReAAD is primarily for reporting abundance records (e.g. from regular population surveillance from traps), this can easily be extended to other kinds of data such as insecticide resistance, blood meal analysis, and pathogen status testing.
6
7
When submitting data, please consider the general MIReAAD data quality standards below also.
8
9
Data quality standards
Language. Once data are ready to be deposited/submitted, all fields and data need to be written in English for VectorBase staff to process.
No abbreviations. Abbreviations (including in columns names) are ambiguous, with the exception of measurement units (e.g. centigrade and meters).
No external legend/key files. While repetitive, all data should be explicitly given within the data table. Separate files mapping ID numbers to GPS locations, full species names, etc., should be avoided.
Unambiguous dates. Because of country-level differences in date formats, data should be reported with 4 digit years, and months provided alphabetically and not numerically (e.g. 4-Jun-2017 or Nov 12, 2015).
Machine-readable file formats. Data should be provided in non-proprietary machine readable formats such as comma-separated text files. PDFs and multiple spreadsheets in the same document should be avoided.
No font styling or subsection headings. Formatting (color, bold, italics, subscripts, sheet tab names, etc.) should not be required for understanding the data. Subsection headings should not be required to understand data; every line of data should be interpretable in isolation from any other line of data.
Highest precision possible. Data should be provided at the highest temporal, spatial, numerical, and taxonomic resolution available. If location (e.g., geographical coordinate) data need to be presented at a lower resolution than available for privacy reasons, this should be made clear in the submission in Study Information (Resource Metadata; Table 1).
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
Rund, S.S.C. et al. (2019) MIReAD, a minimum information standard for reporting arthropod abundance data, Nature Scientific Data
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
Loading...