Project Pythia
A Community Learning Resource for Geoscientists
Background
Python has become the number one programming language for science [source: IEEE Spectrum 2019, opensource.com]
The “Scientific Python Ecosystem” is enormous and ever growing
Jupyter Notebooks have emerged as a simplified, web-based mechanism for programming, creating shareable workflows, and supporting back-end computation on “Big Iron”
There is an increasing demand for moving analysis workflows to public and private clouds, and make HPC workflows more public and portable
A snapshot of the Scientific Python Ecosystem
Source: VanderPlas 2017, slide 52
So what is the problem?
Project Pythia Goals
The Pythia Portal
Training resources
Sample content
Content developed and vetted through coursework at the University at Albany
The Pythia Platform
Binder-like utilities that support interactive cloud execution environments for each Jupyter Notebook
Targeted Cloud resource providers:
Pythia Portal + Pythia Platforms
Open Development
Project Pythia will be a community-owned resource and will follow an Open Development model. The user community is expected to contribute by:
All Project Pythia-developed content will be hosted on GitHub
Project Pythia sounds awesome! When can I or my students use it?
https://projectpythia.github.io
Summary
Project Pythia will be a community-owned educational resource for helping geoscientists at all levels of their career become proficient with the Scientific Python Ecosystem
A particular focus will be scalable, cloud-ready workflows
Community engagement will be essential for the success of the project
Acknowledgements
NSF Earth Cube program (award #2026899)
Pangeo community
Numerous technical staff doing the heavy lifting at NCAR, Unidata, University at Albany