Data Management
Objectives
-xkcd
Why Manage Data: �Researcher Perspective
Why Data Management
Why Data Management: �Researcher Perspective
CC image by UWW ResNet on Flickr
Why Data Management
Activity: In groups of two, examine, this file. Do you think you could work with this data? Why or why not?
A wildlife biologist for a small field office was the in-house GIS expert and provided support for all the staff’s GIS needs. However, the data were stored on her own workstation. When the biologist relocated to another office, no one understood how the data were stored or managed.
Solution: A state office GIS specialist retrieved the workstation and sifted through files trying to salvage relevant data.
Cost: 1 work month ($4,000) plus the value of
data that were not recovered
Consider that the situation could have been worse, because the data were not being backed up as they would have been if stored on a server.
Poor Science Data Management Example
Importance of Data Management
The climate scientists at the centre of a media storm over leaked emails were yesterday cleared of accusations that they fudged their results and silenced critics, but a review found they had failed to be open enough about their work.
Why Data Management
Why Data Management: �Foundation to Advance Science
Why Data Management
Data Management Facilitates Sharing and
Re-use…
Also see Compliance with funder mandates on W&M guide
Well managed, publicly accessible data is important: why?
Here are a few reasons (from the UK Data Archive):
Why Data Management
Well-Managed Data Can Result in �Re-use, Integration, and New Science
Spatio-Temporal Exploratory Models predict the probability of occurrence of bird species across the United States at a 35 km x 35 km grid.
Land Cover
Potential Uses-
Model results
eBird
Meteorology
MODIS – Remote sensing data
Occurrence of Indigo Bunting (2008)
Jan
Sep
Dec
Jun
Apr
Slide courtesy of DataOne
Why Data Management
“Planet hidden in Hubble archives” Science News �(Feb. 27, 2009)�
A new image processing technique reveals something not before seen in this Hubble Space Telescope image taken 11 years ago: A faint planet (arrows), the outermost of three discovered with ground-based telescopes last year around the young star HR 8799.D. Lafrenière et al., Astrophysical Journal Letters.
“The first thing it tells you is how valuable maintaining long-term archives can be. Here is a major discovery that’s been lurking in the data for about 10 years!” comments Matt Mountain, director of the Space Telescope Science Institute in Baltimore, which operates Hubble.
“The second thing it tells you is having a well calibrated archive is necessary but not sufficient to make breakthroughs — it also takes a very innovative group of people to develop very smart extraction routines that can get rid of all the artifacts to reveal the planet hidden under all that telescope and detector structure.”
New Discoveries
D. Lafrenière et al., ApJ Letters
Why Data Management
SangyaPundir, CC BY-SA 4.0, via Wikimedia Commons
What is the Data Life Cycle?
Plan
Collect
Assure
Describe
Preserve
Discover
Integrate
Analyze
Why Data Management
Plan: Create a data management plan (DMP)
Data management plan guide
Cornell University Research Data Management Service Group’s data management planning page and basic elements from DataOne
DMP Tool
Other planning practices
Collect: Preserve a separate copy of your raw data & use non-proprietary formats & be consistent
Other collecting practices
Assure: Develop a Quality Assurance/Control plan
Other assurance practices
Describe: Document and describe your data (metadata)
Use good practices in file management
-xkcd
Create good metadata
Other description best practices
Preserve: Use the 3-2-1 rule to back up your data (3 copies on 2 media types, at least 1 remote)
Data preservation and sharing
Other preservation practices
Discover, Integrate, Analyze: Documents steps in data processing (create a workflow)
Other practices for discovery, integration & analysis
More Training
Questions?
The full slide deck may be downloaded from:
http://www.dataone.org/education-modules
Suggested citation:
DataONE Education Module: Data Management. DataONE. Retrieved Nov 16, 2016. From http://www.dataone.org/sites/all/documents/L01_DataManagement.pptx
Copyright license information:
No rights reserved; you may enhance and reuse for your own purposes. We do ask that you provide appropriate citation and attribution to DataONE.
Why Data Management