Overview of NSF’s Office of Advanced Cyberinfrastructure
Katie Antypas
Director, Office of Advanced Cyberinfrastructure
Confab25
April 9, 2025
1
NSF Office of Advanced Cyberinfrastructure (OAC)
2
Transforming science and engineering research and education�through an integrated cyberinfrastructure ecosystem
Katie Antypas
Office Director
Amy Walton
Deputy Office Director
OAC’s unique role with partners across NSF Directorates to advance research and education
3
Cyberinfrastruture for all S&E Domains
.
Challenges:�Significant ecosystem disruption across many fronts
4
Nyx simulation of Lyman alpha forest
Data
Growth of data from sensors, detectors and instruments creates new user requirements
Post-Moore
Specialized hardware and accelerators
AI
Rapid advance and integration of AI into scientific applications and workflows
Business Models
New business models and entrants for computing and data infrastructure
Scaling out
Cross-agency initiatives, global competition, and need to broaden access to resources
Opportunities and Strategic Priorities
5
Enabling discovery through integrations of data, software and the infrastructure ecosystem
Growing and developing communities, workforce and partnerships
Investments in new technology adoption and scaling advanced infrastructure
Discovery through Integration
Building Workforce, Communities and Partnerships
Advancing Infrastructure
Data
Post-Moore
AI
Business Models
Scaling out
Challenges
Discovery through Integration
6
Integrates and coordinates access to traditional HPC
Adds integration to private sector resources, AI models, services, collaborations, datasets
Guidance, support, expertise to build community gateways leveraging ACCESS and other resources
Data caching and sharing service
Data discovery services
Integration services for data
Integration of national scale resources
Integration services and aggregation of campus and regional computing
Building Workforce, Communities and Partnerships
7
Cybersecurity framework
Facility data lifecycle
Building a Computational and Data-Intensive Research Workforce & Network in the Mid-Atlantic Region
CyberMAGICS Workshop: 2118061
Advancing Infrastructure
8
GPU and FPGA support for the Galaxy framework
National Deep Inference Fabric
A transparent research computing fabric to allow scientists to probe and alter internals of trillion-parameter AI models:
NAIRR Pilot ��
9
Building new partnerships and communities
Providing novel infrastructure and services
Integrating cyberinfrastructure
National AI Research Resource (NAIRR) - A vision to drive US AI innovation, discovery, and competitiveness
10
Envisioned NAIRR Architecture
National goals
Computing and software
User support and expertise
Datasets and Models
Educational materials and training tools
Central Portal
NAIRR Task Force Report
NAIRR Pilot
26 industry and non-profit partners are contributing in kind
state-of-the-art resources
Research Resources
Computing
Datasets
Educational/training opportunities
Models, software, platforms
Collaborations
DARPA
DOD
DOE
DoEd
FDA
NASA
NIH
NIST
NOAA
NSF
USDA
USGS
USPTO
VA
NAIRR Pilot
Pilot Year 1: We built the key foundations needed for a full NAIRR
12
Expand AI research & education pathways
Portal deployment, request process, on boarding, resource matching, support, training
Establish strong govt-industry partnership
A new multi-lateral model based on voluntary in-kind contributions to bring the most advanced resources to US researchers
14 agencies
26 industry and non-profit partners
Portal
Establish single access point for US users
Integrate & deliver contributed resources
Back-end operations to exploit and track all contributed resources: compute, data, models, expertise
80% of partner resources are engaged and being utilized
600+ requests to date
350 approved projects underway in 42 states
Technical Demo Projects, NAIRR Secure, NAIRR Classroom, community workshops & outreach, data set contributions
Initiated over 20 funded projects and pilots �--> Focus of Y2
Results
NAIRR Pilot
Training the next generation of AI researchers and leaders across the country
13
300+ Research projects supported across 43 states
Researchers must be US based. Researchers come with funding support from 11 different agencies including NSF, NIH, NOAA, DOE, DOD, DARPA, ONR, ARO, NASA, USDA and VA as well as foundation and non-profit funding.
NAIRR Pilot Allocations by Science Category
(A100 GPU hours equiv)
NAIRR Pilot
Research community requires resources of many scales
14
As of 3/31/25
Largest computing awards in collaboration with industry and DOE
Other projects: NAIRR classroom, model access, novel architecture hardware
(GPU Hour bins)
Large
Small
| #of Projects |
Agency Supported | 266 |
Industry Supported | 172 |
Note: Some projects are granted access to multiple resources
NAIRR Pilot
Some NAIRR Pilot project highlights
15
Effective Backdoor Mitigation Depends on the Pre-training Objective PI: Blimes, U of Washington
Training Vision Language Models for Agricultural Resilience
PI: Ganapathysubramanian, Iowa St.
Investigations of compressed language models for accuracy and robustness
PI: Srikumar, U. Utah
Rapid Assessment of Wildland Fire Position and Plume Dynamics using Coordinated Multi-Unmanned Aircraft System Sensing
PI: Scherer, Carnegie Mellon University
NAIRR Pilot
Results of early Request for Information: Researcher challenges and barriers
16
NAIRR Pilot
NAIRR Pilot Dataset Opportunity and �Data Infrastructure Investments
17
National Data Platform
PI. Ilkay Altintas
Advancing the Open Science Data Federation Platform
PI. Bockelman
NSF 25-018
Submissions must discuss:
Under review
NAIRR Pilot
Building a NAIRR community to share best practices, provide critical feedback, and onboard new students
18
We supported an inaugural NAIRR Pilot convening in February:
NAIRR Pilot
Leadership Class Computing Facility
19
20
Leadership Class Computing Facility Awarded
Distributed Science Centers
Leadership Capability
Partners
The LCCF, led by the Texas Advanced Computing Center (TACC)
Will support largest supercomputing capability at an academic institution
Current Status
LCCF provides the computational and data analytics ecosystem that provisions large-scale capabilities for S&E research to enable discoveries that would not be possible otherwise.
Total Facility Funding: ~$457M
Questions
21
Backup
22
NAIRR Pilot Insights and Lessons From Year 1
23
The public-private partnership model is VERY advantageous.
High demand for the array of NAIRR Pilot AI resources.
Intensive coordination is key to connect partners, resources, and users
Access to trusted data and their integration are central challenges
Important to measure performance, impact, and success along the way.
NAIRR Pilot