Pangeo: A community platform for open, reproducible and scalable geoscience
Rich Signell (Open Science Computing, LLC) …� … and the Pangeo Community !
North Carolina Institute for Climate Studies, March 5, 2024
Pangeo is a Community
Pangeo is a Platform
DATA
Cloud-friendly ndarray data
dask.distributed dask-jobqueue dask-mpi dask-kubernetes dask-cloudprovider dask-gateway LocalCluster() SlurmCluster() KubeCluster() FargateCluster()
Live Demo time!
Pangeo for numerical model output
Pangeo for numerical model output
Pangeo lives in a rich Python ecosystem
Pangeo is in production
Pangeo is in production
The High Speed Network (100GbE+)
Cost of Cloud Storage
No Egress Fees
Egress Fees
See this notebook for how this was calculated…
Zarr format
Zarr format
Zarr format
Kerchunk
Kerchunk
Cloud-Optimized Data
Benefits of the Pangeo Framework:
Deploying Pangeo
Learning Pangeo
Benefits of Cloud Native for Science: