Data Strategies Part 4: �How can Federal agencies harness partnerships and innovation to maximize the impact of Earth science data resources?
January 23, 2025
Raleigh Martin, NSF
Leslie Hsu, USGS
What are we discussing today?
How do we continue to enable and support collaborations that span organizations, including agencies, even if we need to implement data strategies in different ways?
Recap
In July and November 2023, and January 2024, we heard short presentations from: ��USGS, USDA, EPA, NASA, NOAA / NOAA Fisheries, DOE, NIH (NIEHS), and NSF
July 2023: Sharing challenges, plans, and priorities
Introduction to Federal Data Strategies
U.S. Federal Data Strategy
OSTP Memos
Public Access Plans
Agency Data Strategies
Agency presentations
Viv Hutchison, USGS
Cyndy Parr, USDA
Ann Vega, EPA
Joel Scott, NASA
Monica Youngman, NOAA
Karen Sender, NOAA Fisheries
Topics
What are your biggest barriers to implementing data strategy?
Take-aways and next steps.
November 2023: Part 2
Part 2
Compilation of Challenges
Q&A from July questions
New Agency presentations
Jay Hnilo, DOE
Michelle Heacock, NIH (NIEHS)
Sourced New Topics
Data discovery
Hosting files to support Public Access
Paths to data.gov
Policy for data analyzed with generative AI
January 2024: Part 3
Part 3
Compiled links from participating agencies
Discussion of trust and ethics in relation to our data strategies
New Agency presentations
Raleigh Martin, NSF
Sourced New Topics
Ideas for partnership
Don’t forget OSTP coordination groups
Formality versus smaller conversations
Around the table
Today's
Speakers
Nancy Ritchey, NOAA National Centers for Environmental Information
Katie Baynes, National Aeronautics and Space Administration
Leslie Hsu, United States Geological Survey
Raleigh Martin, National Science Foundation
NOAA/NCEI
NESDIS
Provide secure and timely access to global environmental data and information from satellites and other sources to promote and protect the nation's security, environment, economy, and quality of life
Dept of Commerce
Create the conditions for economic growth and opportunity for all communities
NOAA
Science (climate, weather, oceans coasts, and sun), service (share knowledge and information), and stewardship (conserve and manage marine ecosystems)
NCEI
Provide environmental data, products, and services to help drive resilience, prosperity, and equity for current and future generations
National Oceanic and Atmospheric Administration (NOAA)
National Environmental Satellite, Data, and Information Service (NESDIS)
National Centers for Environmental Information (NCEI)
11
NCEI Mission
NCEI provides environmental data, products, and services covering the depths of the ocean to the surface of the sun to drive resilience, prosperity, and equity for current and future generations.
NCEI Vision
A tenacious and trusted leader in environmental information for a rapidly changing world with a focus on driving lasting good across our partnerships, our economy, around the U.S and the world through generations.
September 2024
National Centers for Environmental Information - NCEI
DOC/NOAA/NCEI: Key data investments
Leadership and Guidance:
Infrastructure:
Data Partners:
12
National Oceanic and Atmospheric Administration ⎸National Centers for Environmental Information
Building an Innovative Infrastructure
13
National Oceanic and Atmospheric Administration ⎸National Centers for Environmental Information
14
NCEI’s Value to the Nation
NCEI Stakeholders by Sector
23.5%
21.1%
17.6%
12.4%
10.9%
6.9%
23.5%
21.1%
17.6%
12.4%
10.9%
7.6%
Science, Technology, and Engineering
Ecosystems (Agriculture/Aquaculture)
Transportation and Infrastructure
Energy
Insurance, Finance, and Legal
Health and Emergency Management
Higher Education
September 2024
National Centers for Environmental Information - NCEI
Authoritative Information and Services
“In the context of authoritative products and services, the notion of “authoritative” means…
i
… conferred by users
… credibly
represent earth system
… carefully sourced and transparent
NCEI:
Aim here
“science”
“service”
“stewardship”
September 2024
National Centers for Environmental Information - NCEI
16
Providing Climate Information to Inform the Future
Tornado Climatology
Climate Extremes Index
Regional Snowfall Index
U.S. Drought Monitor
Blended Sea Winds
Monthly U.S. & Global Climate Reports
U.S. Billion-Dollar Weather�& Climate Disasters Report
Hourly Precipitation Data
September 2024
National Centers for Environmental Information - NCEI
17
Coast, Oceans, and Geophysics
Providing data and information from the Sun to Earth’s seafloor
Deep Sea Corals Data Portal
Model Reanalysis
Ocean Exploration Digital Atlas
World Ocean Database
Bathymetry and
Global Relief
Enhanced Magnetic Model
Gulf of Mexico Data Atlas
Passive Acoustics
September 2024
National Centers for Environmental Information - NCEI
Imagine a world of interconnected Data & Services!?
A linked and open “knowledge network”, with many doors of entry
EVERYTHING needed to use and understand the contents of the NOAA archive is embedded in the knowledge graph, and readable by both people and machines
18
National Oceanic and Atmospheric Administration ⎸National Centers for Environmental Information
NOAA/NCEI
5. Ideas for the future (e.g., “It would be great if we could…”)
Advancing Earth System Science End-to-end
03.19.2024
Technology
Flight
Research and Analysis
Data and Modeling
Earth Action
NASA
NASA
NASA
U.S. Geological Survey
Created by an act of Congress in 1879, the USGS provides science for a changing world, which reflects and responds to society’s continuously evolving needs. As the science arm of the Department of the Interior, the USGS brings an array of earth, water, biological, and mapping data and expertise to bear in support of decision-making on environmental, resource, and public safety issues.
Contributors
Leslie Hsu�Coordinator, USGS Community for Data Integration
Mike Frame�Chief Data Officer
Greg Gunther
Chief, Science Data Management Branch
Department of the Interior
U.S. Geological Survey
Core Science Systems
Science Data Management
Science Analytics and Synthesis
Selection of USGS Data Investments
Support throughout the science data lifecycle - most focused on USGS scientists and their partners
1. Digital repositories - ScienceBase, Other disciplinary repositories (e.g., National Water Information System)
2. Identifiers - Asset Identifier Service (previously the USGS DOI Tool)
3. Tools and Resources for data - e.g., Data management page, Metadata Wizard, sciencebasepy, ScienceBase Data Release Tool and Team
4. Trainings and Communities - Data and Software Carpentries Trainings, Data Release Trainings, Community for Data Integration (including seed funding for scalable data projects)
5. Advanced Scientific Computing - HPC Training, Hybrid Computing Solutions
USGS Selected Opportunities
Need | ⟶ ⟶ ⟶ Opportunity |
Increase data discovery and determination of fitness for use | Encourage cross-agency and cross-community disciplinary standards and analysis/use of existing metadata |
Increase accessibility of tools, working code, and authoritative data that are currently restricted | Explore, design, improve, and encourage platforms that allow multi-organization access |
Evolve toward more integratable and analysis-ready data across the USGS disciplines | Collaborate to provide more data in support of AI adoption and training |
Increase USGS data availability | Informed participation in Cloud Vendor Open Data programs |
USGS Data Collaboration examples
Many of our collaborations are at the discipline-based level (Water, earthquakes, fire, invasive species, critical minerals, astrogeology).
Collaboration with Department of the Interior bureaus is also high, since USGS is the science agency providing science for DOI decisions and we fall under the same Dept of Interior policies.
There is an opportunity to aggregate data collaboration examples at a USGS-wide level and use them as models for further progress.
USGS - ideas for the future
Map the major agency-supported systems and capabilities between agencies holding similar data types, building in linkages between separate systems, and demonstrating integration between data holdings. (has this been done?)
How can we avoid multiple copies of Agency key datasets being rehosted and shared within Agencies? Some of this is to due network challenges, size, versioning, awareness, etc.
Agency Open Data collaborative efforts with Cloud Vendors (i.e. NASA, NOAA): Explore how can government agencies more effectively deliver, analyze, and save with a more government wide approach to government open data interactions with Cloud Vendors. Right now, each Agency is interacting independently with Vendors, often getting mixed messages, agreements and benefits.
Communicate more on AI training data, leveraging NAIRR, catalogs.
USGS - additional info
Leadership and Guidance:
The National Science Foundation (NSF) was established in 1950 by Congress to:
NSF Directorate for Geosciences (GEO):
The vast majority of NSF’s budget is allocated through financial assistance awards (grants, cooperative agreements) to external entities (e.g., universities, non-profits)
4 layers of NSF investment:
Gaps and unrealized innovation opportunities
Need | ⟶ ⟶ ⟶ Opportunity |
Sustaining long-term funding for data resources within short-term grant cycles | Diversify cross-agency and cross-sector funding sources for data resources |
Managing scope (i.e., data repository funding limited to serving a specific user community) | Foster collaboration and integration across data resources to broaden impact |
Adapting to change (e.g., data volumes, new methodologies) | Develop communities of practice that enable coordination and diffusion of leading practices |
NSF-funded OpenTopography facility is harnessing cross-agency data products (e.g., 3DEP) and developing cross-sector partnerships (e.g., via IAA with USGS)
The NASA-funded Astromat builds on NSF investment in sample data curation through SESAR
NSF leads the National AI Research Resource (NAIRR) pilot, which
is coordinating access to data, computing, software, and training across 12 federal agencies and many private partners
Ideas for the future
Foster further partnerships and interoperability across agency-supported data repositories
Streamline pathways for transitioning research-oriented data innovation into enterprise data capabilities
Identify use cases around which to accelerate cross-agency and cross-sector partnership
Discussion and
audience input
What interagency partnerships do you benefit from or are aware of?
What Federal agency data innovations do you wish were more broadly applied?
4 Panel Questions to rank/choose from
Closing reflections
How do we continue to enable and support collaborations that span organizations, including agencies, even if we need to implement data strategies in different ways?
Thank You! Let’s Connect.
Sign up for the ESIP Update and get a weekly newsletter for the #EarthScienceData community: