A Research Agenda for the Evaluation of �AI-Based Weather Forecasting Models
Presented by: Imme Ebert-Uphoff (CIRA)
Jebb Q. Stewart (NOAA)
Jacob T. Radford (CIRA, NOAA)�
With contributions from many others: see next slides.
CIRA = Cooperative Institute for Research in the Atmosphere @ Colorado State University
NOAA = National Oceanic and Atmospheric Administration @ Boulder, Colorado
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Today’s presenters
Jebb Q. Stewart
NOAA
Jacob T. Radford
CIRA, NOAA
Imme Ebert-Uphoff
CIRA
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Other co-authors / contributors
CIRA�
NOAA-GSL
NOAA-PSL
CIRA = Cooperative Institute for Research in the Atmosphere �@ Colorado State University
CIRES = Cooperative Institute for Research in Environmental Sciences
@ University of Colorado Boulder
GSL = Global Systems Laboratory
@ Boulder, Colorado
PSL = Physical Science Laboratory
@ Boulder, Colorado
AI-based forecasting - evaluation needs
Lots of great research already happening, but need coordinated effort that includes:
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Potential Opportunities of pure AI-based models
Key strength #1: �Speed�(Up to 1,000x -10,000x faster than NWP models.)
Create large ensembles
(using Generative AI)
Include complex mechanisms, that are too computationally expensive for NWP.
Increase temporal resolution
Increase spatial resolution
Key strength #2: �Less computational power needed
to run models. �(Still need significant GPU power to train models.)
Potential Opportunities
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Ask fundamental science questions about predictability
Potential Opportunities of pure AI-based models
Key strength #1: �Speed�(Up to 1,000x -10,000x faster than NWP models.)
Key strength #2: �Less computational power needed
to run models. �(Still need significant GPU power to train models.)
Potential Opportunities
AI models present enormous potential to improve our ability to predict the weather!
But …
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
But … we need to answer many questions before we can put these models into widespread use
Sample questions:
Forecast Value:
Questions to help answer the above:
Data-related:
Operational use (e.g., at NOAA):
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
The perfect storm: If someone had tried to create chaos on purpose - they could not have come up with a better combination of disruptive factors.
Model development:
Underlying principles:
Hardware needed:
Feedback from forecasters during development:
Expertise needed:
Where to learn about models:
Time to develop a new model:
Acknowledge impact of AI model development on meteorological community
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
The perfect storm: If someone had tried to create chaos on purpose - they could not have come up with a better combination of disruptive factors.
Model development:
Underlying principles:
Hardware needed:
Feedback from forecasters during development:
Expertise needed:
Where to learn about models:
Time to develop a new model:
AI models seem to
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
The perfect storm: If someone had tried to create chaos on purpose - they could not have come up with a better combination of disruptive factors.
Model development:
Underlying principles:
Hardware needed:
Feedback from forecasters during development:
Expertise needed:
Where to learn about models:
Time to develop a new model:
AI models seem to
This seems to result (anecdotal evidence) in:
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Forecaster perspective
Forecaster perspective - key questions
Goal: Laying the groundwork for AI-based models in forecast operations
Questions we need to answer:
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Forecaster Perspective - key questions
2) How do these differences influence:�
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Status: Very little research has been done in this area.
Need big research effort on all of these topics!
Forecaster Perspective
�
Research needs:
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Output variables currently very limited.
So far primarily optimizing image similarity measures from computer vision, rather than meteorological features.
Version management:
Forecasters not yet trained on use of these models.
Forecasters rarely involved in development and evaluation so far.
Added value for forecasters: �Not yet known
Development by many different groups, primarily AI companies
Development still in
early stages
Hard to maintain access to all models:
Feedback from forecasters:
Not yet available
Hard for meteorological community to keep up and contribute
AI models getting very complex
Need coordination!
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Training Data
Data perspective: Available datasets for training
NWP models are primarily based on physics.
AI models are based on data. Issues that come with the data:
Data issue #1: Limited availability of reliable / HR data sets:�
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Available data sets
Dominant dataset for training of AI global weather forecasting models so far: �
New datasets: �
Research needs:
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Data perspective: New biases?
Data issue #2: If the training data are biased, the AI models inherit those biases.�
Depending on data, we risk introducing new types of biases that NWP models do not have.�Example: Data quality may differ regionally based on available sensors - due to terrain, but also due to economic/historical differences, etc. �
Coded bias - documentary
Research needs:
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Data readiness/bias: Sample resources/approaches
NCAI (NOAA Center for AI) & ESIP (Earth Science Information Partners) developed
NSF AI Institute for Research on Trustworthy AI in Weather, Climate, and Coastal Oceanography (AI2ES; https://www.ai2es.org/):
Research needs: Good start above, but a lot more work needed, especially for
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Access to trained AI models
Access to trained AI models
Trends in 2022-2023:
Potential trends in 2024:
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Trends in AI4NWP model development - increasing model complexity
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Access to real-time and archived forecasts from AI models
Access to real-time forecasts
�
Sample visualization of real-time forecasts from various AI models available:�
ECMWF’s model charts for AI-based weather models: �https://charts.ecmwf.int/catalogue/packages/ai_models/ �Models: FourCastNet v2, Pangu-weather, GraphCast, AIFS (ECMWF’s own model), FuXi.�Forecasts initialized with IFS data.
CIRA-NOAA’s real-time visualizations for purely AI-based weather models:�https://aiweather.cira.colostate.edu/�Models: FourCastNet v2, Pangu-weather, GraphCast, GFS, IFS.�Forecasts initialized with GFS initial conditions.
Research needs:
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Example: Our Real-Time Website
Developed by �Jacob Radford (CIRA/NOAA-GSL)
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Example: Our Real-Time Website
Developed by �Jacob Radford (CIRA/NOAA-GSL)
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Insert figure of comparison mode here.
Your tool is worth spreading over two slides!
Want to simplify inter-model comparisons
Access to archives of AI model forecasts
�
Research community needs easy access to multi-year archive of forecasts.�
Sample activity:
CIRA-NOAA is building an archive of forecasts
saving forecasts in 6h time steps.
Research needs:
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Developed by Jacob Radford (CIRA/NOAA-GSL)
Derived fields: Severe-weather parameters
We have developed accurate, efficient, unit-tested code to add the following params to the CIRA archive:
Initial archive: one month (May 2023) for all models (4 daily runs of GraphCast, FourCastNet v2, Pangu)
Full archive: coming soon (needs HPC resources)
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Developed by Ryan Lagerquist (CIRA/NOAA-GSL)
Evaluation of AI models
Evaluate output of AI model as if it was an NWP model.
Applying objective validation measures from NWP models
There is a fairly regular, standard suite of verification applied to either global, coarse resolution, or regional, fine-scale model forecasts that can be used to evaluate forecast output.
There are two broad classes of meteorological model output fields: continuous and feature-specific. �
Deterministic forecast verification
For ensembles, probabilistic verification
Credit: Material on this slide provided by Jeff Duda.
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Status: Lots of research in the works.
Test weaknesses of AI models that NWP models do not have - so no tests exist for those
Develop tests for specific weaknesses of AI models�
Suggested sample topics (not an exhaustive list):
Activity: See prior slide - work by Ryan Lagerquist.�
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Status: Lots of research in the works.
Evaluation of Forecasts of Tropical Cyclones
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Track and Intensity MAE and Intensity Bias for 2023 AI4NWP TC Forecasts
Track Error
Intensity Error
Intensity Bias
Conclusions - key research needs
Need coordinated comprehensive evaluation effort that includes:
Physical validation of AI models
Value to forecasters / impact on decision making
AI pitfalls / Bias identification and mitigation
NWP experts
AI experts
Social scientists
Forecasters
R2O - operational perspective
Software managers
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)
Need to involve:
Urgent need: bring forecasters and social scientists more into the discussion. Social scientists are needed to elicit feedback from forecasters + many other roles.
Red frame:
biggest needs
Acknowledgements
Funding for this work was provided by:
Thank You to the conveners of these EGU sessions on “Forecasting the Weather”:
Yong Wang, Aitor Atencia, kan dai, Lesley De Cruz, Daniele Nerini.
Kyle Hilburn CIRA/CSU
EGU General Assembly 2024 - Session AS1.2 “Forecasting the Weather” (Apr 15, 2024)