Project Pythia Meeting Notes
Zoom | password: pythia23
2024-1-23: Education Working Group:
Facilitator: | Note-taker:
Attendees: [Name / Institution / GitHub Username]
Quick Updates:
- Please sign in on this page above (and add any quick updates, including accomplishments)
.
Agenda + Notes:
2023-10-09: Education Working Group:
Facilitator: Drew Camron | Note-taker:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne / @mgrover1
Quick Updates:
- Please sign in on this page above (and add any quick updates, including accomplishments)
- KT / Pangeo Showcase (ArrayLake) - will ask for ProjectPythia to join private beta
- MG - presenting on ESGF + CMIP6 cookbook at Thursday ESGF webinar
- New example with hvPlot + ENSO
- Adding another one using globus compute for serverless
- Invitation info
Greetings Colleagues:
The next ESGF Webinar will be presented by Max Grover (Argonne National Laboratory) on Thursday, October 12, 2023, at 17:00 CET / 15:00 UTC / 11:00 EDT / 10:00 CDT / 9:00 MDT / 8:00 PDT. The title and abstract are as follows:
Open Science in ESGF: Collaborations and Innovations with CMIP6 Cookbooks
Maxwell Grover, Argonne National Laboratory
User computing is a key focus of the United States Department of Energy-funded Earth System Grid Federation (ESGF) 2 project. The Climate Model Intercomparison Project v6 (CMIP6) dataset is just one example of a critical dataset hosted by ESGF cyberinfrastructure. We worked with existing communities that have developed a suite of example computational workflows, mainly the Pangeo community educational working group, Project Pythia (https://projectpythia.org/cmip6-cookbook). We tested the existing content, and contributed to new example workflows, highlighting new tools developed as a part of the ESGF2-US project. These recent updates include comparisons with observational datasets such as data from the DOE Atmospheric Radiation Measurement (ARM) user facility, and tutorials on using Globus Compute frameworks to execute climate analysis workflows at scale.
Please register at https://docs.google.com/forms/d/e/1FAIpQLScc9hr9WkmHM1zLA__ZCCYibNGMvSVfEztrBmX4wGGdJgAvPA/viewform?usp=sf_link to receive an email and calendar invitation with a link to join the webinar on the day of the event.
Agenda + Notes:
- Make sure to repost - great way to bring people in!
- New resource highlight post structure
- Questions and to-do items for new meeting schedule
- New meeting notes/agenda routine?
- Calendar updates + some documentation
- Discourse announcement
- Consider doing a Pangeo Showcase: 11/29, 12/6, 12/13 available at this time
2023-10-03: Outreach Working Group
Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Drew Camron / Unidata / @dcamron
- Max Grover / Argonne / @mgrover1
- Julia Kent / NCAR / @jukent
Agenda + Notes:
- Brian leading this White House challenge
- Starting to draft things up
- Not to be used for hackathons
- Encourage public to engage on Discourse, this is for quick turn around with the active team
Action items:
- Draft highlight of cookbooks
2023-10-02: Infrastructure Working Group
Facilitator: John Clyne | Note-taker: ???
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- Orhan Eroglu / NCAR / @erogluorhan
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- James Munroe / 2i2c / @jmunroe
Quick Updates:
- Please sign in on this page above (and add any quick updates, including accomplishments)
- JC: John giving a presentation on Pythia to UCAR’s new Center for Ocean Leadership program on Wednesday.
- KT: Fall semester class will be playing in the Git/GitHub Foundations sandbox the next week or two
- BR: Looking into submitting something to the The White House Office of Science & Technology Policy Open Science Recognition Challenge. Main requirement is a 5000-character narrative for a general audience “telling the story of an open science project”. Winning submissions will be highlighted as Champion of Open Science, featured on federal websites and possibly invited to a future workshop. Due date 11/22/23
- BR: Full list of GEO OSE awards has been released.
- DC: cookbook review template update
Agenda + Notes:
- Community meeting - 1st Monday of the month, 1pm MT
- Kick off Nov. 6
- Attendance: everyone
- Chair - Brian
- OWG - unchanged (every other Tuesday, 9am)
- Iteration planning meeting - 3rd Monday of the month, 1pm MT
- Kick off Oct 16
- Attendance: TBD
- Chair - Orhan
- First meeting of the new schedule
- Who should attend?
- OE suggests any folks developing for Pythia can be present, consider excluding “stakeholders” regularly
- Irregular contributors can fluctuate attendance; “core” developers should attend regularly
- Regular attendees identified from major project planning for next 4-5 iterations
- Open Ad Hoc meeting – 4th Monday of the month, 1pm MT
- Kick off Oct 23
- Attendance: varies
- Starting in CY24, pending Brian and Kevin’s academic schedule:
- Community meeting - 1st Tuesday of the month, 1pm MT
- OWG - TBD
- Iteration planning meeting - TBD
- Open Ad Hoc meeting – 4th Tuesday of the month, 1pm MT
- Scheduling to be revisited with updated academic schedules
- We should choose a specific asynchronous space in support of fewer face-to-face meetings
- Considerations re Zulip, Slack, etc.
- JC will explore costs of Slack for the project. Multiple voices in support of persistent Slack workspace for small team with support for intermittent hackathon/community participation
- Status of high priority IWG issues:
- Pathway for community submissions of data to JS2 - none currently exist
- Only real path for JS2 at moment is directly through KT - if there’s repeat need for particular data right now, start here
- Requested administrative permission (had view only)
- Supposedly administrators are JC, BR, KT, and RM based on a comment in an issue from 2 years ago
- Update: just got access! Thank you everyone - it’s always a different menu
- Automation in progress now!
- Currently testing theme specs against variety of sphinx extensions, generally positive so far
- Fix in, should be passing now
- Automation of requesting reviews from identified cookbook maintainers
- cookbook-template#114 - likely not an issue, being addressed by BR
- cookbook-actions#62 - could be addressed quickly, but has some nuance
- cookbook-actions#70 - could be done easily if chosen, but is very low priority and currently exists only to run once as a snapshot of the workflow as copied from template
- physical-oceanography-cookbook#37 - tied to larger data problem
Action Items:
2023-09-25: Education Working Group
Facilitator: Brian Rose | Note-taker: ??
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
Quick Updates:
- Please sign in on this page above (and add any quick updates, including accomplishments)
- KT/OWG: September Quarterly Newsletter: suggest any subjects that do not already appear in the outline (and sign up for one of the listed topics)
- Tuesday Wednesday 9/26 9/27 5:00PM EDT/3:00 MDT: Binderhub will be down for maintenance (estimate 30 minutes)
- JK - running into permissions errors exploring Google Analytics Python API
- MG - thoughts on applying for OSTP Open Science recognition for Pythia? https://www.challenge.gov/?challenge=ostp-year-of-open-science-recognition-challenge
Agenda + Notes:
- People should have received a poll about meeting times; please reply ASAP so we can move forward
- Need to decide when to get started, and when the first Community meeting will be
- We will discuss asynchronously on this
- NCAR SIParCS 2024 internships [JC]
- project proposals due October 9 at noon
- Ideas on submissions?
- Share thoughts with JC about topics
- Room for team-based mentorship like we did 2 years ago?
- Interest within CGD about a CESM-related cookbook?
- Discussion: What makes a cookbook a cookbook, and not just a notebook? [JC]
- When do you encourage people to create a cookbook and not just a notebook?
- We’ve made it really simple for people run the code in Cookbooks. Don’t need lots of previous knowledge etc.
- A notebook demonstrates a relevant task, should we encourage people to turn it into a Cookbook?
- Would these recent ESDS blog posts make good cookbooks?
- What about this Project Raijin notebook on visualizing MPAS and CAM-SE without resampling?
- There are obvious benefits to having cookbooks for the above content
- There are also challenges (e.g. data access, additional effort for authors)
- Are there things we should do to encourage and facilitate creation of cookbooks for topics like the examples above?
- Should making a Cookbook imply taking responsibility for maintenance?
- Are people making these blog posts being turned off by the burden of making a cookbook?
- Can we keep making it easier to turn a notebook into a Cookbook?
- Keep looking at cookiecutter solutions?
- Easier to get a blog post going within NCAR due to reliance on local data
- Scope of cookbook should be to answer some kind of question. It’s bigger than a one-off notebook. Everything should be as easily reproducible as possible. If not binderhub, then simple instructions for local execution.
- Running cookbooks on privileged compute services? Can we route notebook execution to binderhubs running on institutional compute services that can access local data (e.g. at NCAR)
- There’s still too much boilerplate in going from our Template to a working cookbook.
2023-09-19: Outreach Working Group
Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / Kevin Tyle
- Max Grover / Argonne / @mgrover1
Agenda + Notes:
- Considering implications if Twitter/X goes to a mandatory pay model
- BlueSky: Matplotlib, PyART, condaforge et al. all have presences now
- LinkedIn: some interaction following the repost of the NSF funding announcement; Anaconda/Panel are using it fairly extensively now
- JK/DC exploring the Sphinx blog theme
Action items:
- Set up BlueSky account
- Advertise Hackathon 2024 save-the-date on existing platforms and BlueSky
- Newsletter outline by end of next week (9/29)
2023-09-18: Infrastructure Working Group
Zoom | password: pythia23
Facilitator: Brian Rose | Note-taker: Kevin Tyle
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne / @mgrover1
- James Munroe / 2i2c / @jmunroe
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates, including accomplishments)
- KT:
- Just under 50% of our Discover Access allocation has been used to date
- At current rate, we’d hit 100% in December
- Plan to ask for the full allocation we applied for (2x what we were granted, which was expected) by early November
- Plan for an amped-up allocation for the next phase of Pythia
- Get metrics on usage of our binderhub
- Good opportunity to add some additional cookbooks/notebooks that exploit this dataset
- To do: open issues on the Cookbook
Agenda + Notes:
- Checking on action items from last IWG:
- Progress on metrics - Julia
- Plan to follow the google ARCO-ERA5 example
- “Iteration planning meeting” – term from Agile method
- 2i2c: iteration planning meetings occur every 2 weeks
- What is the difference between product owners, stakeholders, and PIs, as it pertains to Pythia?
- Product owner: Orhan
- Stakeholders == PIs
- Stakeholders can also include members of the community
- It’s a bit of a gray area … we don’t have to lock into any strict definition at this time
- The group is in agreement to move forward with this plan!
- TBD: meeting day (Mondays can be problematic due to holidays)
- May not want to align with Pangeo weekly meeting … might be best to have Pythia meetings be analogous to Paneo working groups, that meet on their own schedule (e.g. Array working group)
- Mechanics of cookbooks reviews [JC]
- What is the process (the PR doesn’t include preview, code changes, etc)?
- GitHub issues, and ultimately managing them with GH projects as the best single means of management of those issues
- Cookbook preview infrastructure will function as intended on PR’s of contributed cookbooks. This means “preview” … not the reviewNB. That has to be enabled manually. But may not be necessary anymore … rich text git diffs are now available on GH
- Leverage what PyOpenSci has already done
- How to handle warnings in cookbooks?
- Still an open question. Warnings today may mean failures tomorrow. But across-the-board cookbook failures warrant a more short-fuse investigation
- 161: assigned to medium priority
- 158: likely can be closed, RF will confirm
- 153: assigned to low priority
- 348: high priority: JK will re-title; DC, BR, MG will add presentations
- 77: medium priority
- 424: medium priority
- 157: medium priority
- 114: BR/KT will assess
Action Items:
- Open issue on getting usage metrics for Binderhub
- Open issue for new cookbooks / additions to existing ARCO-ERA5 cookbook given the new ERA5 ARCO dataset availability
- Decide on a day for the meetings
- Reach out to Pangeo folks regarding new Pythia meeting plan schedule (esp. Week 1)
2023-09-11: Education Working Group
Zoom | password: pythia23
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Julia Kent / NCAR / @jukent
Quick Updates:
- Please sign in on this page above (and add any quick updates, including accomplishments)
- KT: on vacation this week
- JC: Our NSF annual report has been approved
- BR:
- the UAlbany NSF report also approved
- Planning to submit a Pythia abstract to AGU Ocean Sciences
Agenda + Notes:
- Posting on the Pangeo Discourse for feedback
- Quick start guide? We should be more opinionated about how to use Python (tools, env manager)
- JK: This issue on metrics says that MetPy has scripts for this - can someone link them to me?
- Issues with displaying metrics on Pythia website
- Quick metrics discussion on #319
- 2 cookbooks under review:
- Waiting for one more review from John C.
- Keeping track of cookbooks
- How complete should a cookbook be before publishing it to the gallery?
- Should not include cookbooks that are broken or empty
- Can be a single notebook
- JC: as soon as it is useful (Minimum Viable Product - MVP)
- DC: we can use releases to allow for further development
2023-08-28: Education Working Group
Zoom | password: pythia23
Facilitator: Rob Ford | Note-taker: John Clyne
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
Quick Updates:
- Please sign in on this page above (and add any quick updates, including accomplishments)
- JC: submitting annual report today
- KT: AMS Abstract deadline this Thursday, 8/31
- DC: LinkedIn repost with grant announcement made - interact!
- BR: the Albany NSF Annual report also due… will be putting it together this week
Agenda + Notes:
- Set up a timeline
- Explore tools to facilitate/automate
- 2 cookbooks under review:
- Waiting for one more review
- Needs reviewers. Use as a test case for new review criteria?
- N.B. this notebooks requires GPUs, which we don’t have, so this won’t have binder support for now.
- BR: one more cookbook queued up: Cordex vis notebook. Plan is to use this as another review criteria test case.
- JK: has a viz cookbook that is ready to go - another guinea pig.
- BR: we should add a statement in cookbooks, such as VAPOR, that aren’t binderized, that the notebooks can be run locally.
- Summer hackathon - confirmed for 2024 for June 10-14 (John)
- Main Seminar Room: 8-5pm, M-F
- Monday, June 10: Library, Damon & Chapman 10-3pm
- Tuesday, June 11: Library & Chapman & Penthouse 10-3pm
- Wednesday, June 12: Library, Damon & Chapman 10-3pm
- Thursday, June 13: Library, Damon 10-3pm, Penthouse 10-12/1-3pm
- Friday, June 14: Library, Damon & Chapman 10-3pm
- BR: we should put out a save the date. How about on our new blog. John will put a blog together.
2023-08-22: Outreach Working Group
Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- AMS abstracts due Aug 24
- Python-tech abstract + educational abstract (provisional title: “Cooking up a Storm”)
- We have an invite code for BlueSky (thanks to Hannah Aizenman)
Agenda + Notes:
- Action items from last meeting:
- Rope in Thomas for Geology discussion [DC]
- September newsletter: about grant acceptance, about COESSING collaboration, UAlbany article about Pythia
- Takeaway priorities: discourse / linkedin > twitter / new portal news page & blog > email list
- Reach out to geologists
- Blog posts, summarize year of cookbooks, what next grant looks like.
- Upcoming hackathon link was kind of hacky, want a more permanent place for updates
- Make past publications more visible and updated
- Update footer of portal and explore upstreaming footer content and formatting to theme
- September for first newsletter to go on Portal (and email list?)
Action Items:
2023-08-21: Infrastructure Working Group
Zoom | password: pythia23
Facilitator: John Clyne | Note-taker: Brian Rose
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Orhan Eroglu / NCAR / @erogluorhan
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- James Munroe / 2i2c / @jmunroe
- Drew Camron / Unidata / @dcamron
- Max Grover / Argonne / @mgrover1
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates, including accomplishments)
- JC: Planning to submit NSF report *sometime* this week. If you have further input, please let me know ASAP. Thanks for your contributions!
- JC: Summer hackathon confirmed for week of June 10-14. Please add to your calendar!!! In Boulder at the Mesa lab
- DC: time to continue Pythia Cookbook Review Criteria discussion at next EWG
- MG: Presenting on Radar Cookbook at AMS Radar next week
Agenda + Notes:
- AMS deadline is later this week (Thursday Aug 24 – but might be extended)
- OWG chatted about 2 abstracts
- Kevin: Pythia in educational setting (Programming languages of open science conference – teaching, training, and building communities around Python)
- Drew: Community and technology updates
- Look for abstracts later this week
- Chance that Pythia might be invited for a core science keynote… stay tuned on this.
- Should we reach out to Cookbook contributors to encourage submissions of content to AMS?
- Post to the hackathon slack [DREW]
- Authorship of conference abstracts: How do we decide what names are included?
- Bigger issue of tracking contributions across all of Pythia (and getting it on our home page)
- This should be automatable with a github “enterprise” account.
- Open a trial and test it out?
- But pricing is per member of org, so this is not likely to be viable solution for Pythia
- Look at Jupyter project’s “tiers” of participation
- Short term action: use last year’s author list, remove names of people who are definitely gone from the project, add new folks.
- Update status, reassess “high priority” items
- Lots of issues are “high priority”, but there can only be one “highest priority”. What is it?
- Going through and closing or updating lots of issues
- Still leaving behind quite a few high priority issues
- Which issues are the biggest roadblocks for growth of community and content?
- Data storage for Cookbooks?
- Storage (1 TB) is available on our current Jetstream2 allocation. We just need to develop pangeo-forge recipes to get data written to the right storage buckets
- The Sphinx theme problem? It’s not currently holding anyone back but our whole infrastructure might break with no warning.
- Also security risks from using old package versions
- Authentication on our Binderhub?
- Who has the ability to push solutions forward?
- Metrics issue might be lower effort
- Consensus on highest priority items for IWG:
- These two tasks may need further consideration to prioritize
- Agile for Pythia - some scattered ideas/points
- Orhan looking into similar projects and their Agile applications as well as reading
- Met with NCAR’s Cloud Pilot project’s Nick Cote
- Start with an Agile (Scrum, Kanban, Scrumban, etc.) and know things would change over time because of learning the structure
- Continuous feedback and improvement
- Determine Team capacity
- Re Pythia’s pile of high priority issues: Looking at issues of the last x (say 30) days would give some ideas
Action Items:
- Post to hackathon slack encouraging Cookbook contributors to submit content to AMS meeting [DREW]
- Progress on metrics [JULIA]
- Take a look at possible Sphinx theme solutions [DREW / KEVIN]
2023-08-14: Education Working Group
Zoom | password: pythia23
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / Kevin Tyle
- Rob Ford / UAlbany / @r-ford
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne / @mgrover1
- Brian Rose / UAlbany / @brian-rose
Quick Updates:
Agenda + Notes:
- AMS Abstracts due soon (8/24). Current plan for 1 educational “in the classroom” abstract (KT) and 1 “tech/community update” abstract (DC). JK will also be co-presenting a data viz short course (including Vapor)
- Avenues for grant announcement and asynchronous community updates - will start a “newsletter” report, aiming for 2-3 month frequency to be posted to Discourse/LinkedIn and directly to the portal via blog (JK)
- Metrics are accessible … more comprehensive and meaningful than Twitter
- Once we get more content posted, the metrics will be more useful. Right now just 10 followers … mostly Pythia core members
- Portal review and update, including hidden “about” section w/ publications
- Dave Hart posted this story on the CISL website about Pythia 2.0
- Summer hackathon, 2024. Two options for dates (John)
- Main Seminar Room available 8am to 10am, 3pm - 5pm
- *** Agreed to pick June 10-14th. Room availability is key ***
- Can attend: Ryan, John, Brian, Drew, Kevin, Julia, Max
- Maybe attend: Orhan, Julia, James Munroe
- Library, Damon, and Chapman available for breakouts
- June 17-21st: Not enough room availability and NCAR is closed 6/19!!
- Can attend: Ryan, John, Brian, Drew, Kevin, Orhan, James, Julia, Max
- Maybe attend: Julia
- Library available all week. Chapman available M,W,Th
- Additional rooms/times may open up as people release holds
- Hybrid will be the plan. Worked well in 2023.
- Everyone is welcome and encouraged to review and provide input (comments or suggested text)
- Should we add content to better support open science? Draft forum posting:
- Pythia’s educational materials cover many of the technologies and practices that are vital to Open Science principles such as transparency, accessibility, and reproducibility. But for a scientist not accustomed with many of these technologies, putting everything together in a way that allows them to create a shareable, reproducible workflow may not be obvious. Should Pythia consider adding a section that leverages the Foundations modules and provides more of a step-by-step guide to creating a reproducible workflow with a Jupyter Notebook that is backed by GitHub reproducibility and shareability? Such a guide might include a template repo with a Notebook and conda environment management, and so on.
- JK: Great idea! Environment maintenance is a very frequently-asked question. Will be presenting something similar in a workshop later this month.
- BR: Cookbook template already has environment reproducibility built-in. Not a fan of pinning packages, though.
- JC: Thinking more of a trimmed down version of the cookbook template … but maybe the existing cookbook template is enough.
- JK: might be a good idea to at least reference the versions of the most important packages included
- Cookbook template may be a bit too over-expansive for a use-case where one would just want to reproduce their workflow
- KT: include version #’s in a text document, rather than strictly pin them in environment.yml
- MG: Upload docker image with each unique version
- BR: there is a plugin that shows versions … watermark?
- JC: will create an issue and then post and link to it on the forum
- Pythia 1.0 (EarthCube) -> Pythia 2.0 (GEO OSE)
- 2.0 partners: Albany (lead, PI: Brian Rose), NCAR, Unidatata, 2i2c
- Overlap in PoP (Period of Performance):
- 1.0: Sept 2020 - August, 2024
- 2.0: January, 2024 - December, 2027
- Strive for continuity: PI’s Clyne (1.0) and Rose (2.0) in close consultation on major decisions
- Changes in meeting structure under discussion
- Community meeting: focused on engagement with community. Current meetings are problematic in terms of organization and not being welcoming to community attendees.
- Most of our “getting things done” meetings are focused on infrastructure. Create a new meeting structure that is more tied into the community needs
- Management meeting: operational aspects of achieving our goals
- KT: tie community meetings into the weekly Pangeo meetings?
- MG: Pangeo weekly call morphs into a combined Pangeo/Pythia community call
- BR: 2.0 explicitly calls for a Pangeo Education Steering Council, tasked with setting high-level priorities for what the community is looking for. Avoid having both a Pangeo/Pythia “council” AND the explicitly-proposed Steering Council.
- Turn into checkboxes a bot comments on a PR for adding a file to the Cookbook gallery
- Review for final discussion and implementation at 8/28 EWG call
- Should this be on mainpage vs in the appendix?
- Thoughts on footer menu?
- Thoughts on left navbar structure of ‘about’ etc
- Site needs overhaul, but we need to go forward on this PR!
- A site overhaul might be better suited to a more formal overhaul, ideally with insights from professional website developers
- Sphinx-pythia-theme still needs attention … JK researching it
- Approve existing PR and open new issue so we can move forward
Action Items:
- See items highlighted in bold/purple above
2023-08-08: Outreach Working Group
Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- AMS abstracts due Aug 24
- Consider python-tech abstract + educational abstract
Agenda + Notes:
- Press release about grant acceptance, about COESSING collaboration, UAlbany article about Pythia
- Twitter, LinkedIn, Pangeo discourse
- Email list?
- Takeaway priorities: discourse / linkedin > twitter / new portal news page & blog > email list
- Reach out to geologists
- Blog posts, summarize year of cookbooks, what next grant looks like.
- Upcoming hackathon link was kind of hacky, want a more permanent place for updates
- Make past publications more visible and updated
- Update footer of portal and explore upstreaming footer content and formatting to theme
- September for first newsletter to go on Portal (and email list?)
Action Items:
- Don’t lose track of September newsletter summary
- Rope in Thomas for Geology discussion [DC]
2023-08-07: Infrastructure Working Group
Zoom | password: pythia23
Facilitator: Note taker:
Attendees: [Name / Institution / GitHub Username]
- Rob Ford / UAlbany / @r-ford
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- James Munroe / 2i2c / @jmunroe
- Brian Rose / UAlbany / @brian-rose
- Chinwendu Agbaetuo / @dindustack
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Facilitator sign-up for IWG has been updated through end of year. Please sign up for a slot!
- Facilitator sign-up for EWG needs sign-ups from 11 September to end of year
- Before EWG Monday 8/14, add and comment on Cookbook Review Criteria doc
- JC: NSF Annual report is due August 31. If you have been tagged for input, please provide it by Friday, August 18.
- JC: working on dates for next year’s hackathon. The summer calendar is already VERY full. Please respond to Lisa’s poll ASAP so we can get this nailed down. We may not be able to accommodate everyone’s preferences.
- JC: Paige Martin (and organizers) using Foundations for Coastal Ocean Environment Summer School In Nigeria and Ghana this month. (COESSING)
- JC: Can someone provide some more specific guidance on this post?
- KT: Binderhub
- BR: press release about our newly awarded NSF grant coming out later this week
- Further discussion on tomorrow’s OWG on additional/complementary posts / blog articles that can follow up on the UAlbany article (Brian will not be able to attend, but others on the OWG will discuss).
Agenda + Notes:
- Q&A with Chinwendu
- Issue with a cookbook environment: https://github.com/ProjectPythia/interactive-sentinel-2-cookbook/issues/2
2023-07-31: Education Working Group
Zoom | password: pythia23
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Max Grover / Argonne / @mgrover1
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JK - working on an advanced visualization cookbook
- BR: our Pythia 2.0 NSF grant has been awarded. A public news release is coming by end of this week
- No facilitator has signed up for next week, please sign up here
Agenda + Notes:
- AGU abstracts: Due this Wednesday, 8/2!
- BR will share an abstract for review asap, planning to submit and present
- AMS abstracts due Aug 24. Will be discussed at upcoming OWG and likely to be well-represented by Pythia.
- BR will likely submit at Ocean Sciences as well
- Consider non-talk presence at eg AMS. Deadlines for short courses, town halls are passed. May be able to blend with Unidata Python @ AMS Student Conference session
- Due August 31
- Stay tuned for writing assignments
- Metrics needed
- No Cost Extension approved (EarthCube grant PoP runs through August, 2024)
- Next year’s hackathon (John)
- Limited dates available for summer. Is fall or spring an option?
- Goal for a cookbook author: follow along with each of the enumerated checkbox items
- Comments?
- BR: Zenodo DOI link check has now been disabled in the template.
- “Good” programming practices:
- BR: appreciate John’s point about best practices, e.g. avoid loops when array syntax is possible. Reviewers should be aware of obvious ways for the code to be more expressive and less error-prone. As we develop a “feel” for this process, we may then be able to describe/define some of these best practices as part of the checklist.
- JC: agree with BR
- BR: each cookbook should have a team. This way whoever is part of the team will be tagged for reviews, etc.
- Timeline: what can an author expect in terms of how long it will take for a cookbook to proceed through review and to final publication on the gallery
- JL: what’s the best way for someone who has a cookbook idea to get an initial “green light” to start proposing/working on a cookbook?
- BR: Discourse/other informal channels would be best.
Action Items:
- DC: will take initial pass at the checklist. Nothing will be deleted yet. Next meeting, get a more formal document finalized.
- BR: whatever final list looks like, it needs to be in two places: contributor’s guide, and as a form that will appear as part of a PR.
2023-07-24: Outreach Working Group
Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Andrew Huang / Anaconda / @ahuang11
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
- general ideas around good coding practices, like commenting code, doc strings, (re)using functions rather than e.g. copying and pasting code between projects, code review
- python for climate, mostly xarray but if there are other toolboxes that are particularly helpful, introducing them as well
- how to make a maintain a github repo
Agenda + Notes:
- AGU Abstract Submission Deadline: Wednesday 8/2
- The organizers “are looking to include time for a few 25 minute workshops. While discussing options, we thought about having some sort of interactive workshop focused around building a community around open science, and I thought of Project Pythia as a great example of this. I'm not sure exactly how it would work, but I'm curious if you or anyone else involved in the project would be interested in giving some sort of workshop on this topic. We're not looking so much for "here's how to use this tool", but "here's how we operate to create an effective community". “This would be an invited talk.T
- Please consider “Session Title: IN002. Accelerating Science: The Convergence of Inclusive Computational Resources, Cloud-Optimized Data, Open-Source Tools, and Open Collaborative Communities. “ (disclosure: James M. is a session organizer)
- Possible submission for CMIP6 cookbook
- The Dawn of the Era of Cloud based Climate Informatics
- Over the past two decades, the size of data available from climate models and observations has grown several-folds. As a result, the traditional approach of downloading climate data and manipulating it locally is becoming increasingly challenging, even for researchers with access to large national computing infrastructures. We've been hearing for a few years how remote computing is the solution to our problems with data volume. Despite examples and demos showing this can actually work, we're not seeing widespread adoption yet from the scientific community. Is it just inertia or is something missing in that model that is preventing it from widespread adoption? This session is intended to bring together submissions that seek to facilitate this new approach to climate informatics. Submissions on what technologies should be adopted, what feedbacks have been received and obstacles encountered to-date and how to bring value to interdisciplinary climate research are encouraged.
- ESIP Meeting: `kerchunk` + `titiler-xarray`; on the fly web tile generation

- Kerchunk “makes a NetCDF file look like a Zarr file, a cloud-native, efficient file format for reading”
- Visualizing a streamed dataset with kerchunk
- Considerations for coding practices
- “Download as .py” an option?
- Scripts instead of notebook, e.g. https://www.ncl.ucar.edu/Applications/coamps.shtml
- “Too many cookbooks” leading to duplicate implementations, potential noise, reduced quality, difficulty to maintenance
- Repository of “applications” or script https://www.ncl.ucar.edu/Applications/
- Template reorg: copy “how to run notebooks” section to primary sidebar
- Considerations for viewability of both galleries
Action Items
- Drew Camron open issue for zenodo link + release + link check failure and workflow
- Drew Camron will explore LinkedIn (and mention at EWG)
2023-07-24: Infrastructure Working Group
Zoom | password: pythia23
Facilitator: Kevin Tyle; Note taker: Drew Camron
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Rob Ford / UAlbany / @r-ford
- James Munroe / 2i2c / @jmunroe
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
Quick Updates:
Agenda + Notes:
- Action items from previous IWG meeting:
- Dates for next year’s Pythia hackathon? Need to get dates settled ASAP from NCAR logistics point of view.
- Action: JC will get the ball rolling with a list of potential dates
- PRs for sandbox. What do we do with them? See latest.
- OPen issues for this (JC)
- Add a bot to auto-comment on new PRs in the sandbox (JK)
- Add a bot to auto-close stale PRs in the sandbox
- AGU abstracts related to Pythia? Deadline coming up early August
- Will be discussed at tomorrow’s OWG meeting
- Please consider “Session Title: IN002. Accelerating Science: The Convergence of Inclusive Computational Resources, Cloud-Optimized Data, Open-Source Tools, and Open Collaborative Communities. “ (disclosure: James M. is a session organizer)
- Persistent sessions: appears easy to implement (ref: https://www.youtube.com/watch?v=dwidVhPtQT4&ab_channel=JupyterCon)
- Multiple simultaneous user sessions: harder
- KT will re-enable GH auth for BinderHub, needs to explore multiple sessions support
- KT is exploring BinderHub on top of JupyterHub w/ JetStream GPU support for cookbooks that can benefit from. Unidata has some expertise here if needed
- Public cloud storage option for cookbook data
2023-07-17: Education Working Group
Zoom | password: pythia23
Facilitator: Drew Camron | Note-taker: Kevin Tyle
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Max Grover / Argonne / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- Elena Romashkova / NCAR / @rmshkv
- Erin Rhoades / Unidata / @erin6541
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Drew: Facilitator signup form has been updated to add our next several meeting dates. Please sign up at the link above.
- Julia - absent today due to CISL director search. I will be attending AGU but likely to present on GeoCAT content. OWG was unable to meet last week due to SciPy, but does anyone plan on presenting on Pythia content? Abstracts due Aug 2.
- BR: I plan to attend AGU. OWG will meet next week and decide on a plan
- KT: AMS 2024 deadline is 8/24
- BR: had a meeting with Shelley Stall and Brian Sedora from the AGU Notebooks Now! Initiative to talk about alignment between Notebooks Now! and Pythia cookbooks.
- They are excited about what Pythia is doing and eager to help promote our content as resources for AGU members
- In turn, AGU can help turn Cookbooks into scholarly objects, e.g. pitching a “special issue” of a journal devoted to cookbooks.
- We will hopefully keep talking about all this
- Elena - physical oceanography cookbook update, got data access fixed for 2 / 5 notebooks, working on the others
- Max - one of the radar cookbook contributors (Millind Sharma) will be presenting on the radar cookbook + Project Pythia at our ARM/ASR Programmatic meeting around DC in a few weeks 🎉
- RM: It was clear at SciPy that our educational material is very relevant
Agenda (with notes from the meeting):
- Drew and Kevin presented Friday afternoon; link to presentation will be placed in our Google Drive space
- Drew noted that links to our public presentations are buried in our website … needs to be improved (add to IWG Project Board)
- And updated with lots of more recent talks!
- Lots of interest in the GitHub Actions that power our Cookbook/Foundations content
- BR: a big chunk of functionality in our custom GitHub Actions could be “upstreamed” into Jupyterbook (e.g. routing notebook execution to Binder). That would make it much easier for other groups to use the functionality AND vastly simply the Actions code that we have to maintain. It’s a win-win that deserves an allocation of some developer time.
- MG: Metric-related gathering: Scientific Python’s devstats looks like a good model to follow. Will help address our issue 319
- State of cookbooks not in the gallery
- Dask: recent PR fixed data access issue. Seems ready to add
- Gridding: removed because of an issue. Can it be added back?
- BR: yes. I worked with the author last week to fix the builds for this cookbook, it should be ready to go
- Guidance on reviewing cookbooks for inclusion in the gallery: can we write down a simple policy? Need something other than just “does it run?” This is really necessary now that we’re reaching a critical mass of newly-contributed cookbooks.
- MG: Set up some sort of queue When newly-contributed cookbooks get added, add them to the queue. Automatically set up a “ready to review” issue.
- DC: Create a project board dedicated to cookbooks added to the queue.
- BR: Whatever we come up with should serve the purpose of keeping things open and moving … not creating barriers and making this hard.
- Need two reviewers to approve the cookbook
- When a reviewer “signs off” on a cookbook, that includes two things:
- Mechanics of the cookbook repo are “solid” and up to date
- Scientific content passes muster
- Make sure that references to cited content appear
- Assure that the citations.cff file is complete
- MG: aim for consistency in cookbook layout, e.g. title (“Dask Cookbook” rather than “NCAR Dask Tutorial”
- RF: While scientific content could be reviewable by anyone, the “style guide” part of the content would most likely require someone familiar with the styles used in our template. Sort of “infrastructure-related” content.
- BR: these are similar issues for, e.g. journal reviewers/editors. But let’s grow to that point, and that starts with a general guide for what’s needed to get a cookbook approved.
- MG: might not be necessary to require two separate reviewers, as long as the technical and content boxes are checked. This will help in terms of getting cookbooks reviewed and posted in a timely fashion.
- DC: will draft cookbook review document and ask folks for comment. Besides just sharing a document, will also post on the relevant Discourse channel
- MG: whatever we come up with should be added to a PR template for use by cookbook authors
- BR: what should we do with the cookbooks that are already waiting for review? Wait two weeks or move forward with a sense of the review criteria we have discussed today?
- DC: proceed with reviewing but hold off on final approval for two weeks
- MG: For those cookbooks that appear ready to go (e.g. webmap notebook) and whose authors are eager for it to be published, go forward with those.
- RF: same can be said for the Dask and Gridding cookbooks (Dask cookbook has already gone through review, and Gridding cookbook was previously in the gallery)
Action Items
- Cookbook review document 1
- Discourse post (+ review doc 2?)
- Async contributions before sync review
- AGU / AMS conference abstract deadlines (8/2 and 8/24)
- Scientific Python
- Cookbook gallery workflows being skipped
- Links to past talks hard to find (and more recent talks need to be added)
2023-07-10: Infrastructure Working Group
Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Brian Rose / UAlbany / @brian-rose
- James Munroe / 2i2c / @jmunroe
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: Cookbook discourse thread. Needs response. (RF: done)
- JC: NCE submitted to NSF
- JC: John meeting with Tricia OKeefe, NCAR partnership development, on Wednesday (1115 MT) to explore possible industry support
- BR: hopefully meeting soon with the AGU Notebooks Now! Folks about coordinating on “Cookbooks as scholarly objects”
Agenda + Notes:
Notes from last month’s EarthCube workshop in L.A. (JC)
- No plans for an EarthCube2.
- GEO OSE program supports Geo-CI (Think EarthCube lite).
- NSF taking an “incremental approach” with OSE to see what is needed (No OSE office planned at this time)
- EarthCube meeting is now a grassroots effort. May come to NCAR next year. Can we leverage with cookbook hackathon?
- John met with Arika Virapongse, Community engagement consultant for Earth Science data infrastructure. May be able to help with Pythia’s community engagement efforts.
- Dates for next year’s Pythia hackathon? Need to get dates settled ASAP from NCAR logistics point of view.
- Action: JC will get the ball rolling with a list of potential dates
- PRs for sandbox. What do we do with them? See latest.
- OPen issues for this (JC)
- Add a bot to auto-comment on new PRs in the sandbox (JK)
- Add a bot to auto-close stale PRs in the sandbox
- AGU abstracts related to Pythia? Deadline coming up early August
- Bring this up ASAP at the next AGU meeting
AGU abstract deadline approaching
2023-06-27: Outreach Working Group
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda + Notes:
- Remove announcement on main projectpythia.org page
- Tweet out newly-published (or expanded) cookbooks
- Radar (8/28→9/1)
- Annual meeting (1/28→2/1)
2023-06-26: Infrastructure Working Group
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KT: ERA5_interactive cookbook ready to be added to gallery
- Inline images not appearing on website: seems to be an issue in how the notebooks/images directory gets mapped to the _images directory in the published jupyterbook
- Procedure for adding to gallery post-form submission? A GH action should be triggered when someone submits a new cookbook to the gallery using the form. However it’s a simple one-line addition to the file https://github.com/ProjectPythia/cookbook-gallery/blob/main/site/cookbook_gallery.txt
- JC: John is giving a Pythia talk at the EarthCube community meeting on Wednesday in L.A.. Draft slides here. Please provide any updates by 4pm MT. Raleigh Martin will be in attendance. Discussion topics to bring up with Raleigh?
- JK: Went through steps for adding DOIs to several Cookbooks (published and unpublished), 5 left
- JK: talking about event at ESDS meeting right after this. slides
- BR: still working on the cookbook build actions to handle github secrets – should finish this this week
Agenda + Notes:
- Who is shepherding hackathon WIP?
- How to make sure work isn’t lost? Somebody from Pythia team shepherd through process, pair up with contributor, no blockers
- Make an assessment of WIPs
- Post hackathon, reach out once to the community asking for their current repository homes, drop links (Google doc, or sheets?)
- “Pythia team is interested in helping you get your work across the finish line, we’d like to have eyes on it so we can identify blockers, if you want to be on list drop link to repository with work in progress” do it ASAP, don’t pin them down on if they need help, include offer to adopt something that will not be worked on (sometimes people only have time during hackathon) - BR will do that
- “New” infrastructure issues
- Need GPU nodes on Binder services
- Storage for cookbook data, smaller buckets
- Where should these issues be opened - cookbook-gallery or cookbook-actions, discussion boards, projectpythia.io repo from portal for overall issues. Doesn’t matter if you add it to the project board? Still provide some guidance on this. There will be questions about the Pythia binder service, and where should those go? Not ready to build that out
- Remove Pythia members list from portal page, dynamic and automated, currently unhelpful to community building. Scrap it? Only project PIs? - BR to remove what we have, have an anonymous “your name here” box, how to acknowledge someone who is no longer active on the project “distinguished contributors” look at Jupyter project
- Re-enable GitHub authentication (how to have more than one session)
- Decide on either reducing the VM CPU-class for the worker nodes and/or ask for the additional 750K SUs on JS2 that we originally applied for
- Investigate GPU-enabled Binderhub
- Persistent storage for authenticated users
- Multiple simultaneous sessions for authenticated users
2023-06-23: Hackathon Report Out and Quarterly Meeting
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- Elena Romashkova / NCAR / @rmshkv
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda + Notes:
- Good week, people showed up for free, gave time, were engaged, met new people, that was our goal
- Hybrid component worked remarkably well. AV was flawless, whole room could be heard by virtual attendees. Hybrid plenary twice a day, anyone remote or in the room could share screen. No audio issues.
- Slack was a great decision! Helped bridge the gap between in person and virtual attendees. Future of the slack? Does it stay live? We’re on a free slack service, content sunsets after 90 days. Not an archive but can still be used for collaboration going forward. Don’t want to find confusion among community or make it burdensome for organizers. We don’t want to continue monitoring tech support channel.
- Discourse great for more thoughtful conversation, slack great for real time back and forth. Good for continuous synchronous collaboration. Do nothing. It may stay active for a few more months while people finish up their Cookbooks. Use same slack channel (without old content) for next year. Make a note that content will be lost to the users.
- Infrastructure held up well.
- Not everyone who signed up came, but everyone who made it was really dedicated. Lots of progress made on content and infrastructure. Was worth it to reach out to people ahead of time to form cohort or team.
- 36 people active on the slack channel. Can target a larger group next year, might need a crash course that isn’t relevant for all attendees.
- Helpful to see what level of comfort people came in at. Great knowledge sharing and spin up!
- It was hybrid in people joining remotely some days and in person others.
- Thoughts from RF: Slack and breakout rooms. Max of 2 people in the room though, small group worked, waiting for people to join call wasn’t too frustrating
- Need stickers
- Better guidance as developing cookbook for realistic expectations on how much resources their Cookbook will demand. What is a typical environment? People asking for guidance.
- Negin’s suggestions about 5 mg buckets for data.
- Build frequency for health checking is daily, might be able to slow that down to save resources. Testing is biggest use of our resources. Much greater than people using the books.
- Data road blocks, lots of issues that got in the way of what Elena wanted to work on. Datasets weren’t available on the cloud or were too large to upload. CISL might host datasets later on. Encourage people to upload small representative subset of the data. Worth getting whole dataset out into open and then taking sample of it for demo purpose.
- More effort into subject matter champions that invest in building their cohort prior to hackathon and reaching out to people in that subfield to join. Less time selling ideas and shopping around for people on day 1.
- The split of half hack time and half in a jupyter tutorial was a little awkward. Possibly address this with a day 0, so people who need spin up tutorials show up earlier. Or half day on a Monday so people can travel in on Monday. And time for the organizers to get together Monday morning.
- Get some input on how remote attendance worked.
- Sometimes failed to join the Zoom room right away, they were cumbersome but seemed like the best solution. Clustering of channels into smaller group was a good idea that worked well! Some clusters probably worked used Zoom more than others.
- Hit promised deliverables except getting good automated metrics collected and posted on website, will benefit us as we track the growth of the community, understand hot spots of community interest.
- Proposal status, Brian is the PI
- Steering counsel, split up grant writing from community aspect, under Pythia or Pangeo label for setting priorities. Meets quarterly or semi annually.
- Replace Pythia members page on homepage, PIs is out of date.
- Let cookbooks run wherever they best run. NSF cloud and Pythia infrastructure. No need to all run on same place.
- Pangeo medium blog on Cookoff? Get in habit of “workshop report” for event, something informal, JK presenting at ESDS meeting on Monday
- Collaborate on article or journal about Cookbooks, JOSS journal of open source education, BAMS article (format essay, targets meteorology community). What are they and what problems are they trying to solve?
- Target special issue publication of Cookbooks, outline vision in a concise place that is future home of publishing content. Look more at AGU Notebooks Now!
- Plans to submit to AGU open science sessions? Discuss soon at next EWG
- Guidance on how to handle binderhub. Burn rate is high right now, was bumped up for the event. Could be addressed by IICC
2023-06-13: Outreach Working Group
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Andrew Huang / Anaconda / @ahuang11
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Nearly 50 people registered, split between in person and remote
- Made a loose list of topic ideas, particularly from people who agreed to facilitate a breakout group
Agenda + Notes:
- Subsurface geology, if other geologists are interested.
- Packaging/repo/SE best practices resources
- Can provide support on any cookbook
- Interactive analysis, meteorological observations
- 3D data visualization and/or open-source development
- Interactive visualization; ARCO dataset analysis/visualization
- Xarray
- Foundations + jupyterlite
- Plans to form groups and facilitate interactions?
- Advance organization vs real-time project pitches on day 1?
- Don’t want 2 groups with 20 people etc.
- Set up a Discord/Slack?
- We don’t have access to a paid Slack service
- Free Slack is an option but not persistent
- Discord is free to use
- Some people don’t like it because of associations with gaming
- Probably fine in the hackathon space
- “Another app” factor – most people already using slack
- Decision: set up a Slack – it’s familiar and useful. Emphasize to participants that it is NOT persistent.
- Max Grover just gave a cookbook orientation, can reuse this material. About 1 hour or 1:15 to dig through the whole thing
- First hour plenary:
- Overview of Pythia, current state of Cookbooks, high level goals (20 minutes) (BR)
- Schedule and expectations (20 minutes) (DC)
- Bit of a break for conversations
- 10 am: project pitches and group formation
- Afternoon tutorials
- Could be concurrent sessions?
- First 90 minutes: three choices
- Just go dive in to hacking
- Intro Python overview room
- For project leaders: discussion of logistics for the week
- Tutorial on building a Cookbook
Action items
- (DC) inquire with ESDS folks about Zulip, do they really want us to use it?
- (DC + MG) set up a Slack workspace
- (BR) prepare overview talk and slides (tap other people as necessary)
- (DC) prepare “schedule and expectations” talk and slides (tap other people as necessary)
2023-06-12: Infrastructure Working Group
Zoom | password: pythia23
Facilitator: Brian Rose | Note-taker: Drew Camron
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JK:
- will be in Main Seminar Room with Google Meets for open and close of days so people can join in companion mode
- but will be in Zoom for break out rooms because of how it handles breakout rooms better with the same link for the entire event but conference rooms cannot join en masse (on individual laptops)
- Please register so we can all have name tags (you can add your name to the excel sheet)
- We need to tell people to bring headphones and need to make sure breakout rooms are not behind security doors
- Last I checked, 20 virtual and 26 in person attendees
- Back in office Thu. 6/15
- Book build failing for new binderhub server likely due to GitHub authorization requirement
- Previous Binderhub server fails to launch (see JK’s Zulip post from 6/8)
- Update sphinx-pythia-theme : have been looking into this, but will not be a quick fix. IMHO not necessary to fix this pre-hackathon.
- RF: cookbook contributor’s guide updated
Agenda (with notes from the meeting):
- These were addressed, or will be as part of the hackathon
- Potential considerations for contributor’s guide confusion
- Will label good issues with hackathon label, brose@albany.edu will add new label
- Folks to address some above issues
- How will attendees communicate with organizers when need help?
- How do we ensure that unfinished work isn’t lost?
2023-06-05: Education Working Group
Zoom | password: pythia23
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR @clyne
- Rob Ford / UAlbany / @r-ford
- Elena Romashkova / NCAR / @rmshkv
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KT:
- Updates to contributing guide PR incoming
Agenda (with notes from the meeting):
- Julia meeting on Wednesday with the multi-media team at NCAR
- Accepted everyone who responded by first deadline, time to accept remaining registrants
- Added locations and times to agenda
- Good ratio of projects proposed to attendants
- Hackathon preparation status (organizing committee)
- Other hackathon items:
- What can we do to ensure work in progress is finished, reach out again, understanding that it is open source
- Cookbook submission status
- Data access issues - make it work with smaller data
- RF will follow up
- Other submissions?
- Do we have a cookbook shepard?
2023-05-30: Outreach Working Group
Attendees: [Name / Institution / GitHub Username]
- Drew Camron / Unidata / @dcamron
- Max Grover / Argonne / @mgrover
- Julia Kent / NCAR / @jukent
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Julia sent acceptances to all registered attendees
Agenda + Notes:
- Updated the schedule with room locations
- KT listed invited people; DC to reach out
- Potential cookbook topics
- Subsurface geology, if other geologists are interested.
- Packaging/repo/SE best practices resources
- Can provide support on any cookbook
- Interactive analysis, meteorological observations
- 3D data visualization and/or open-source development
- Interactive visualization; ARCO dataset analysis/visualization
- Xarray
Subsurface geology, if other geologists are interested. |
Packaging/repo/SE best practices resources |
Interactive analysis, meteorological observations |
3D data visualization and/or open-source development |
Interactive visualization; ARCO dataset analysis/visualization |
Xarray |
Data Formats / Big Data |
Climate variability cookbook; general assistance to virtual participants |
Visualizing complex weather data, creating derived data from model outputs. |
Data Visualization Techniques |
2023-05-22: Education Working Group
Zoom | password: pythia23
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Julia Kent / NCAR / @jukent
- Ryan May / Unidata / @dopplershift
- Drew Camron / Unidata / @dcamron
- Max Grover / Argonne / @mgrover1
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JK: Sciparcs starting, intern working on xarray stuff, will be joining our cookoff
- JK: will be making a video walk-through of Pythia content for Climatematch Academy
- DC: one of our interns very interesting in contributing to basic Pythia content (Foundations)
- JC: John met with EDS Book team (Alejandro and Anne) to discuss how we might collaborate. Next step: share roadmap *after* we hear about NSF GEO/OSE award.
- JC: Reschedule canceled May meeting for June. During hackathon?
- Will propose a time slot for this in latter half of hackathon week
- JC: Need to discuss NCE for pythia soon
- BR: I’m off-grid as of this Wednesday until Friday 6/9. Reach out ASAP if you’d like my input on any high-priority work
- MG: I noticed that our nightly builds do not actually push the built content to github pages - do we want to do that?'
- BR: it was built that way intentionally, but we could add the option to publish each build
- MG will open an issue on the cookbook-actions repo for this (see #62)
- MG: Created a collection of draft cookbooks for our interns at Argonne - will transfer toward end of summer when notebooks more developed (over in this organization https://github.com/EVS-ATMOS)
- RF: Issue with cookbook author permissions, on the agenda below
Agenda (with notes from the meeting):
- OWG Report
- Hackathon update
- Announced registration deadline extension until June 2
- But Hotel deadline in May 24
- More complete draft agenda https://projectpythia.org/pythia-cookoff-2023/schedule.html
- Pythia team members, please register for the Cookoff, our UCAR staff need to know who’s who
- Working on inviting specific individuals
- Collecting ideas for logistics,
- Feeling pretty good about numbers right now. Should expect the list to grow a bit more until the deadline, especially with outreach
- Time to start reaching out to group facilitators
- Cookbook submission workflow explained
- Negin Sobhani transferred ownership of a cookbook (on Dask, yay!). Now what?
- A consultant who produces a lot of tutorial content
- What happens next? Current process:
- Open a PR on gallery generator repo, there is a form to fill out with gallery tags
- This step hasn’t occurred yet for that cookbook
- Should cookbook authors have the Admin role on their cookbook repo?
- https://github.com/ProjectPythia/landsat-ml-cookbook/issues/15
- Only downside we have identified is the possibility of accidental deletion
- Policy should be to make content creators admins on the specific cookbook repo
- We should “elevate” everyone already designated as “outside collaborator” to members of the pythia org
- Let’s not use the “outside collaborator” status for Pythia
- Updates to Cookbook contributors guide
- Leads could get together on Friday afternoon at end of hackathon to hack together a quick summary of successes
- Possibility of presenting this to the ESDS group the following Monday
- Pythia Binderhub and Jupyterhub up and running, with GitHub authentication!
- Review “content” backlog - discussion
- Triage unprioritized new issues
- Assign/volunteer staff to high priority tasks
- Review status of “in progress” tasks.
2023-05-16: Outreach Working Group
Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KT: Binderhub now has secure https enabled; next step is Github auth
Agenda + Notes:
- List of registrants (13 virtual, 18 in person)
- Deadline extension? - June 2nd - needs announced
- Hotel booking deadline is May 24 (just note this in the deadline extension email)
- Need agenda by end of month
- Need to respond now to everyone who did register
- Deepak Cherian, Katie Dagon (NCAR)
- Anderson (Carbonplan)
- Scott Henderson (U Washington)
- Chelle G
- Johnny
- Tom (Microsoft)
- Julius and Tom Nicholas
- Rich Signell (USGS)
- Jessica (U New Hampshire)
- Max Jones
- AMS Hackathon list from Denver
- Vapor team (GPU instance on Jetstream 2)
- Marshall (previous SIparCS intern) presented on Xarray things
- Tonya Snow, Colorado School of Mines
- This year’s SIParCs interns
- EOL folks (ex. Mike Dixon, LROSE development group)
- Kim Wood (MSST), Casey Davenport (UNC), Jon Thielen (CSU)
- GeoCAT team
- UCAR EdEC (Curtis Walker)
- Brian Blaylock (HRRR / “Herbie” library)
- Start time 9 am each day
- Morning plenary 9 - 10 am
- Scheduled lunch 12:30 - 1:30 pm
- Afternoon debrief 4 - 5 pm
- Asynchronous updates from groups in different time zones through a doc
- Slido or jam board or google slides
- Shortened session on Wednesday to make room for outing
- Lunch is earlier, plenary could be 1:30 - 2:30
- Outing starts 3 pm
- Do we need a facilitation ideas doc?
- Good ideas about using slides for the group check ins – both for in-person and remote groups
- Outdoor activities
- Offer either a hike behind NCAR or excursion to Pearl Street
Action Items
- Announce deadline extension [DC, discourse + twitter]
- Announce extension on conference web page [BR]
- Announce at Pangeo call [KT]
- Email current registrants with draft agenda and hotel reminder [KT will draft, send it to Taysia]
- Include website link
- Suggest following Pythia twitter
- Reminder about the hotel deadline
- Send email to the above list of potential invitees [DC] [KT will obtain contact info]
- Investigate weeknight trivia options in Boulder [JK + DC]
- Update agenda on website [BR]
2023-05-15: Infrastructure Working Group
Zoom | password: pythia23
Facilitator: Drew Camron | Note-taker: ktyle@albany.edu
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Max Grover / Argonne / @mgrover1
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KT: Binderhub update
- Working on https and GitHub authorization
- MG: Still working on troubleshooting the secrets issue
- BR: thinking about a limitation in our testing strategy re: failing notebooks that still “pass” our CI tests
- DC: label-syncing looks to be working!
Agenda (with notes from the meeting):
- Untriaged Infrastructure:
- 334: still waiting for response from original poster
- 319: On hold - still needs to be assigned priority/personnel
- High priority Infrastructure
- 306: Max will open a relevant issue on the Radar Cookbook; other cookbook authors (e.g. Thomas) are also welcome to create a citation with their ORCID
- 344:
- Sphinx Pythia Book Theme: DC: looks a bit more complicated than at initial glance
- 48: CMIP6 cookbook: MG suggests we remove this from the cookbook gallery until the secrets issue can be worked out. BR notes that binderbot is currently ignoring the table of contents, so the cookbook would need to be removed from the repo temporarily. MG: the functionality in Binderbot seems to support secrets, but not clear how to exploit it yet.
- 20: (Kerchunk cookbook) Data artifacts: in contributor’s guide, make it clear to cookbook authors that they should strive to use publicly-available data; if not, consider making a small snippet available. Longer term: explore if JS2 or other public large storage could be used for Cookbook datasets.
- Infrastructure needed for cookbook-athon
- Binderhub
- MG: Get DOI generator workflow running smoothly
- MG: Ensure our existing cookbooks are working as expected and documentation is correct
- DC: Coordinate breakout volunteers; add participants to the organization so they can readily transfer ownership
- MG: what are participants proposing in terms of what they’d like to work on?
- DC: ensure that virtual participants have an easy way to participate
- BR: Health checks for cookbooks need to be finding “unhealthy” cookbooks
2023-05-08: Education Working Group
Zoom | password: pythia23
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- Brian Rose / UAlbany / @brian-rose
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- DC: reminder to sign up for IWG facilitation rotation
- DC: will shortly have a markdown tutorial for Foundations ready
- BR: gave a talk and demo to the ESDS book community connected with the Climate Informatics Reproducibility Challenge, notes here:
- JK: Need everyone’s ORCID, what to do for people who’s we’re missing?
- Still missing Kevin Paul, will wait a few more days before merging the citation document
- Jetstream2 instance setup is ongoing. Almost ready to fire up the BinderHub.
- Still working on secure https.
- Should get a certificate issued on projectpythia.org
- KT and JK might talk offline about this.
- Can still use a jetstream.cloud.org domain if we can’t make it work
- New intensive cookbook from @bonnland (Brian Bonnlander?) will serve as a good test case for dask-intensive workflows.
Agenda (with notes from the meeting):
- Reg deadline for hackathon is this Friday May 12.
- Currently have around 20 people signed up, including some from beyond immediate Pythia people, and including some overseas people (remote).
- At least 3 people here at uAlbany planning to participate.
- Hackathon: would be great to do a walkthrough of Cookbook construction on day 1, a more relaxed version of the demo that Brian gave today
- We will most likely extend the deadline for registration
- Next week we will hash out the hackathon agenda in more detail.
- Pyart-gridding notebook: assigned to MG
- xbatcher-ML-1-cookbook: some broken code
- Should we have a “cookbook maintenance” repo tag to identify broken cookbooks that need attention?
- Using mamba – high priority - KT assigning himself
- defer any Infrastructure issues to next week; tag any non-Content issues that are actually Content-related
- Discussed a number
- Same tag-related comment as in High Priority
- Pre-hackathon needs that are content-related:
- THere’s a number of broken notebooks currently in our Cookbooks
- BR will open an issue to catalog these
- Our build system shouldn’t be pushing broken notebooks to the published gallery
- Improvements to the Cookbook guide?
- Team members should walk through the Cookbook creation process and get as familiar as possible.
- Identify potential simplifications to the guide.
- What’s the path of least resistance to creating a working Cookbook?
2023-05-02: Outreach Working Group
Meeting link: Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Taysia Peterson
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda + Notes:
- Registration form
- 14 people have registered so far (mostly NCAR)
- Taysia will do some math on how many people our breakout rooms can support
- Drafting up an acceptance email - notify the 15th
- Those who have registered receive an auto-acknowledgment
- Taysia’s questions: need answers by end of May
- How many teams will there be, so breakout rooms can be planned
- Need to inform the NCAR multimedia team the detailed agenda for each day.
2023-05-01: Infrastructure Working Group
Zoom | password: pythia23
Facilitator: ktyle@albany.edu | Note-taker:
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- BR and JC plowed through a lot of issue triage. Take a look at the Project Board, especially at things labeled “medium”: https://github.com/orgs/ProjectPythia/projects/3/views/13 to see if anything really needs to be elevated
- JK will be investigating infrastructure surrounding DOI/citation
- Copy editing has been completed on foundations with the work of JK and L
- DC will add some material on Markdown, motivated by upcoming Unidata Users Workshop
- Additional exploration with EDS/ROHub workflows before we chat with them again
Agenda (with notes from the meeting):
- Default: previous facilitator will be next meeting’s note-taker
- VM has been set up (IP 149.165.152.50); next step is to build and deploy Binderhub
- DNS Registration needs to be set up for binder.projectpythia.org and hub.projectpythia.org [JK + KT]
- Labels in user support: consider avoiding “low priority” until work timeline lines up with that description for community issue submission
- Will add “needs more information” label once label syncing is ironed out (BR)
- Cookbook-actions testing: in-progress as exploration and testing, not currently requesting outside support
- DOI/citations: project/foundations citation in zenodo with DOI, requires addressing #344 re addressing foundations releases
- brose@albany.eduRyan May noted as zenodo+software experience and will support JK as that progresses
- Next steps include workflow automation for cookbooks
- Label syncing: this has become more relevant; still lower priority than above
- Shortest path: provide PAT
- Adjacent to cookbook-actions testing, DC/BR will explore new optional action for users
- Theme: dependency issues and release-ability
- Will require someone taking on the knowledge background on the theme
- Pythia-datasets still just for foundations
- Other considerations (ACCESS credits, git LFS?) for intermediate data or necessary accessible data
- Other issues deprioritized, combined, or re-worded as appropriate
Action Items:
- Sign up for 5/15, 6/12, 6/26 IWG slots
2023-04-24: Education Working Group
Zoom | password: pythia23
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Brian Rose/ UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- John Clyne / NCAR / @clyne
- Max Grover / Argonne / @mgrover1
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: John submitted abstract to NSF EarthCube Geoscience and Cyberinfrastructure workshop
- KT:
- Submitting a cookbook-related abstract to the AMS radar meeting
Agenda (with notes from the meeting):
- Hackathon planning update
- Application live, posted on Pangeo Discourse
- Should keep track of number of participants
- All active Project Pythia members should register
- Planning team should meet soon (likely at next Tuesday’s OWG)
- Meeting with EDS Book team
- Clarifying goals/differences between cookbooks and EDS book
- Likely to set up another meeting for further discussion
- JC meeting in person soon
- Publishable original work vs. example workflows?
- MG: EDS uses hvplot, which we don’t have material on. We could collaborate to add hvplot content to Pythia
- JC: reference each other’s work, but note differences
- GitHub releases for Foundations. Also Radar Cookbook
- Content on creating a good Python package
- JK: Maybe some work will be done on this at the Cook-off
- MG: related to pyOpenSci efforts
- Interactive plotting would be a good addition to Foundations, since multiple cookbooks may reference it.
- Markdown content PR needs a few small changes and reviews
2023-04-18: Outreach Working Group
Meeting link: Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCLAR / @jukent
- John Clyne / NCAR /@clyne
- Anne Fouilloux / Simula Research Laboratory / @annefou
- Alejandro Coca-Castro / The Alan Turing Institute / @acocac
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda + Notes:
- EDS is a bit further along in terms of their ability to efficiently mint DOIs for their content
- Let’s focus on the overlap and differences between our two entities:
- EDS focus more on producing publishable objects. Uses RoHub (see docs here).
- Example: Sea ice forecasting using IceNet. In that notebook, it has a DOI, links to launch Binder, and a RoHub link.
- In RoHub, versions of the research object can be forked / snapshotted; they serve an analogous purpose to forks/commits/versions in GitHub
- EDS uses citation as an education tool for responsible science
- Pythia more on targeted use-cases, repeatable workflows (Cookbooks)
- Pythia focuses on the basics … get people familiar with the Pangeo Python ecosystem
- EDS: focuses on best practices how to write publishable, reproducible notebooks. Emphasize on common metadata standards in these notebooks. Represents an evolution in the traditional peer-reviewed paper paradigm.
- Pythia cookbooks: more tightly-focused than Foundations … want to build infrastructure to make it easy and attractive for those who want to share their workflows using datasets / methods of interest to them … and having a common repository to house them (rather than having various useful notebooks randomly scattered across the internet).
- Pythia cookbook philosophy: build it and they will come. Make it an attractive way for the community to share useful workflows. We’re at the proof of concept phase … need to get the community entrained.
- The best example of what Pythia envisions to be an effective cookbook is the Radar cookbook. It follows a two-part model:1) background information about the question at hand (what is radar data? Where can I find it?), and 2) specific use-case examples (e.g. load a NEXRAD archive from cloud storage and create some interesting visualizations from it)
- There’s nothing stopping a cookbook author from having their cookbook be a published, refereed product … although at this point that isn’t what we envision to be the main motivation for someone to build and share a cookbook. But we do want them to have a citable reference!
- Notebooks as citable research objects, usable metadata
- Pythia has good infrastructure in terms of automated workflows, integrated with GH actions; integration with cloud binder/hubs
- Review process and community
- Outreach and hackathon
- Potential collaborative opportunities:
- Take the radar cookbook and integrate into an EDS-like gallery object
- Leverage MystNB to cross-reference notebooks
- Anne & Alejandro: Explore Pythia Cookbooks process
2023-04-17: Infrastructure Working Group
Meeting link: Zoom | password: pythia23
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KT
- Will be setting up our new JS2 allocation this week
- SciPy talk approved! Will coordinate presentation plan with Drew.
- AGU Notebooks Now!: The goal of AGU’s Notebooks Now! Effort is to “Elevate Computational Notebooks as Primary Elements of the Scientific Record”. Exploring many of the same issues we’re facing with cookbooks. Information on joining Notebooks Now working groups available here.
- Anne Fouilloux and Alejandro Coca-Castro will join the OWG tomorrow (Tuesday) to discuss the Environmental Data Sciences Book project
- CISL event planning resources here and here
Agenda + Notes:
- From previous meeting: Devote time at each IWG to review open issues/PRs, e.g.:
- triage un-triaged new issues
- Assign staff to high priority items
- Check progress for active items
- Current/untriaged new issues/PRs:
- Issue 329 (Portal site not publishing/previewing in GH pages)
- PR 331 resolves the publishing issue
- PRs 330, 332 are attempting to resolve the previewing issue
2023-04-10: Education Working Group
Zoom | password: pythia23
Facilitation rotation sign-up and schedule
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- John Clyne / NCAR / @clyne
- Max Grover / Argonne National Lab / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: NCAR is deploying an on-prem, prototype cloud. This is a pilot project, whose goals include exploring feasibility of a production deployment. John would like to include support for Pythia’s cloud needs as a use case.
- JK user/client to help come up with specs for this
- Pilot project but if it is deployed we may want to use it instead of jetstream
- KT - any integration with research data archive, besides thread service? Great way to have datasets in object store Zarr format. JC- Yes we want to provide some python APIs to NCAR data holdings. Looking for use cases to drive these services. Look KT into convo.
- MG - Radar cookbook has RDA huge gridded dataset that would serve as a good dataset to show capability
- KT: Who manages our projectpythia.org domain? We will need to register a binder.projectpythia.org DNS entry soon.
- Need to set up Cname record to point to jetstream service - JK manages this, ready in next couple weeks
- BR: absent from today’s meeting. The meeting form looks good to me!
- MG: Went through and created a list of Pangeo gallery authors, serving as a potential list of people to explicitly invite
- We should expand this list…
- Deepak Cherian, Katie Dagon (NCAR)
- Anderson (Carbonplan)
- Scott Henderson (U Washington)
- Chelle G
- Johnny
- Tom (Microsoft)
- Julius and Tom Nicholas
- Rich Signell (USDS)
- Jessica (U New Hampshire)
- Max Jones
- AMS Hackathon list from Denver
- Vapor team (GPU instance on Jetstream 2)
- Marshall (previous SIparCS intern) presented on Xarray things
- Tonya Snow, Colorado School of Mines
- This year’s SIParCs interns
- EOL folks (ex. Mike Dixon, LROSE development group)
- Kim Wood (MSST), Casey Davenport (UNC), Jon Thielen (CSU)
- GeoCAT team
- UCAR EdEC (Curtis Walker)
- Brian Blaylock (HRRR / “Herbie” library)
Agenda (with notes from the meeting):
- Identifying folks to explicitly invite to participate in the hackathon
- See the OWG notes (mainly pangeo-gallery related folks)
- Finalize hackathon signup form
- JC - we should reach out to James Munroe about coffee breaks through 2i2c
- Working on linking to cook off page - PR
- Uncertain on how many people will be there. Invite James to next meeting
- 20 hotel room block, 100 capacity of seminar room, ideal capacity? Over 50 is too much. What is a good group size? When should registration cut off be?
- Add info to form, pick a date that we have to tell them by. Friday May 12th tell them. May delay by a week at deadline time, Change from June 12th on form. Hotel block deal available until May 24. Know in advance if we have to down-select. Maybe notify first X as soon as possible. If it looks like we’re getting too many applicants, slow down the confirmation process (more competitive at the end).
- Check current number of applicants at every meeting. Hard deadline, but rolling review process.
- Mix of in-person vs virtual. Use UCAR zoom for remote participation. Limitations on participation over Zoom? No
- If too many applicants, what is the process of deciding who can come and who doesn’t? First come first serve or based on statements of interest. (include a statement that completing the form doesn’t guarantee acceptance and you’ll be notified by such and such date if they don’t meet criteria, make it clear that interest statement can be used to make this discussion).
- PR for adding sign up form
- Environmental Data Science (EDS) book project (John & Brian, 5 mins)
- Pythia has a lot of overlap with the European EDS Book project. See slides, and accompanying Pangeo Showcase talk (once posted)
- Anne Fouilloux, one of the principals, has expressed interest in possibly working with us. Wants to make our resources more FAIR
- Next steps?
- BR: I had previously invited Anne to a Pythia meeting, there were problems joining the Zoom (https://discourse.pangeo.io/t/making-cookbooks-citable-best-practices/3170/10)
- Encourage everyone on the team to look at slides linked above, great presentation. Talk from the Pangeo showcase will be posted on Zenodo. Worth hearing entire presentation and thinking about overlap and thoughts on how to collaborate
- Review process is fairly robust, more detailed conversation will be great.
- Meet during OWG time? Tuesdays at 9 MT?
- KT- foundations or cookbook effort? Thread on twitter for plotting data with metpy on maps. Want to identify gaps in plotting instructions. We only have one cartopy notebook. Good cookbook candidate? Flesh out intro to cartopy chapters. Haven’t added much new foundations content in a bit, good opportunity to revise and add content. Add snippet to gallery of how-tos. Need to identify the underlying missed need.
- Create an Issue on Foundations to propose extending the Cartopy content
- Invite people to hack on open foundations issues as well as Cookbooks? Current way the form is written doesn’t call out leading a cookbook on X, form focused on “breakout” section
2023-04-04: Outreach Working Group
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
- Max Grover / Argonne National Lab / @mgrover1
- James Munroe / 2i2c / @jmunroe
- John Clyne / NCAR / @clyne
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- MG: applications are out for Climate Match Academy (Abigail Bodner is leading the effort). Signup sheet is live now
- JK is helping out with that
- MG is retweeting their links
- MG: retweeted Radar Cookbook
Agenda (with notes from the meeting):
- Ideas:
- 1. Can the registration form include (or at least link to) a description of the objectives of the hackathon?
- 2. Can you be open and public with those you have already registered?
- 3. Are there targeted individuals that are going to be invited to the hackathon?
- K. Goebbert / Valpo
- EDS Book team +1
- Identify authors on various Pangeo gallery notebooks
- Py OpenSci folks
- CryoCloud (Tasha Snow, Jessica Scheick - icepyx).
- 4. Need to aim for a blend of senior / “keynotey” people, along with newbies. These folks could lead a daily keynote/kickoff session. Don’t want it to consist of mostly/all newbies.
- Update registration form, including:
- Opt-in for having their name visible:
- Demographic details: Drew will liaise with NCAR folks (John/Julia)
- Open-ended question asking why they are attending (what area do you work on? What are you interested in working on? Do you have a link to a GH repo … paper/presentation?)
- If registration does not automatically lead to acceptance, then the form might need to say “registrations need to be approved by the organizing committee” or something to that effect. Their answer to the open-ended question will inform their approval. This should be a low bar, but needs to exist and be defined.
- Deadline for form submission: Friday May 12. Critical for motivating those who will defer until the deadline date and also for informing us on how many may attend
- Publicize on social media / email channels
- Identify / invite the targeted individuals
- NCAR Gallery - Brian Bonnlander (NCAR), Anderson Banihirwe (Carbon Plan)
- CESM LENS - Scott Henderson (Univ. Washington)
- NASA CCMP - Chelle Gentemann (NASA)
- Cloud benchmarks - Ryan Abernathy (Earthmover)
- Glaciology - Jonny Kingslake (LDEO)
- Landsat 8 - Tom Augspurger (Microsoft)
- CMIP6 - Julius Busecke (LDEO), Tom Nicholas (LDEO)
- Pangeo tutorial - Rich Signell (USGS)
- Icesat cookbook - Jessica Scheick (Univ. of New Hampshire)
2023-04-03: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JK - Will change to UCAR Zoom link with password “pythia23” next week? YES!
- JC: NSF hosting Building upon the EarthCube community summer event, June 27 - 28, Los Angeles. There is funding available for early-career scientist
- JC: John, Brian, and Julia had a very informative meeting with Kieth Maull and Mathew Mayernik on citing cookbooks. See Julia’s excellent notes
- JC: NCAR’s Doug Schuster is working with AGU’s Notebooks Now working group, which are exploring the requirements needed to support publication of executable notebooks as journal articles. John will follow up with Doug.
- KT: Jetstream2 allocation was approved, but is still not accessible. I have emailed JS2 support for a status update.
- RF: Will this new Binderhub resolve the issues we have on the cookbooks that require more substantial resources? THAT IS THE GOAL
- BR: Plans to open an issue regarding better documentation of how we automatically preview PRs (other folks outside Pythia, e.g. Julius Busecke, would like to use it)
Agenda + Notes:
- Summary of Citing Pythia Cookbooks discussion with NCAR librarians Mathew Mayernik and Keith Maull (full notes):
- Open to change - DOIs in the journal context are assigned to things that don’t change, but outside of that there are DOIs to things that do change (datasets that are continuously growing, or minor or bug fixes in software packages, etc)
- What to cite - The GitHub repository seems the most stable (if the site went down for any reason)
- Authorship vs contributors - We want to encourage primary scientific contribution and overall maintenance contribution. We’ll have a changing contributor base over time. Zenodoc has lots of “contributors” options but they don’t appear in citation. Open inclusive authorship policy.
- Versioning - make sure time stamps and accessibility of older versions. How big of a change warrants a new version (not minor maintenance fixes, but significant changes to dataset, the science, or how to use a dependent package). This opens the door to the user citing the wrong version.
- Releases - doesn’t sound like we need a regular release schedule. But what can we automate/simplify of the release process? Put machinery in place to handle creation of DOI, release, archiving of code, etc in a formal process to reduce the number of hoops.
- Zenodoc vs UCAR to create DOIs - Zenodo will meet our needs, will archive a Zip file of the GitHub repository at each snapshot/version we save. Potential issue is that we store unexecuted notebooks, they are populated at run time in the name of reproducibility. So if it isn’t runnable anymore that output is lost?
- BR - I think we could set up a “release” pipeline that would commit executed notebooks (at the end of our book-build workflow) to a dedicated branch of the repo and include those in the archived snapshot identified with a DOI.
- Add a “how to cite” section to each resource. If it isn’t prominent, readers will make up their own citation method and the DOI won’t be used. Also add the DOI to README files and to gallery cards.
- DC: What would administration of issuing DOIs look like? Would we need to have a group of folks who would have DOI-minting privileges on Zenodo? Both Ryan May and Brian Rose have existing DOI-minting privs from MetPy/Climlab.
- Matt M and Keith M have a workflow for DOI minting; JC has folks on his team as well who have the expertise. John will post a pointer to step-by-step guide.
- New “releases” → New DOI. Previous versions’ DOIs point to an archive on Zenodo.
- Status of outstanding issues/PRs:
- Sort by tags. Let’s at least get the high priority items assessed!
- OE:
- The project board does not “automatically” pick up new issues. The issue originator must do so. Similarly, we can’t easily search for issues that are not on the project board … however, as long as you are not sorting “By-repo”, you can click on Add Item, add a hash tag, and then you can see repository issues and add them to the project board.
- Our practice at this point is to just include Issues. We can link to the PR, but do not include a PR as its own project item (i.e. row)
- Orhan created and saved a new view with the High Priority tag.
- BR: can one also view unlabeled issues, since that is part of the weekly triaging? Alternatively, can one view newly-created issues?
- KT: is there a way to configure GitHub so any new Issue automatically gets an appropriate (e.g. “Unassigned” or “New”) tag assigned?
- Multiple label search: e.g. -label:"high priority" -label:"medium priority" -label:"low priority" shows what items are NOT assigned
- Going forward: We need to devote ~5 minutes each week to new issues that have come up.
- Does it make sense to create a new weekly/biweekly iteration that would hold these new issues?
- BR: Are merged issues automatically marked with a status of “Done” by GH? OE: Not sure
- JC: Perhaps a good way to wrap this up is to figure out what we want to do for a standing process starting at the next IWG meeting? For example, triage un-triaged new issues, making sure staff are assigned to high priority items, checking on progress for active items, etc.
- What is the best guidance on how to review PRs submitted by bots?
2023-03-27: Education Working Group
https://albany.zoom.us/j/97375285304?pwd=bGQzeldCcGs4VEdpVnc4aDlxMEFZdz09
Facilitation rotation sign-up and schedule
Attendees:
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- JC: NSF/GEO does not have funds to support summer hackathon at this time. Possibly have funds sometime in June. We can check back then.
- JC: Meeting scheduled with Mathew Mayernik (NCAR librarian / project scientist for Thursday to discuss citing cookbooks.
- BR + KT: taking the first steps to setting up a “new and improved” Pythia Binder on Jetstream2 using our dedicated ACCESS allocation
- MG: Environmental data science folks holding a “reproducibility challenge”
- See the thread here
Challenge Registration Closes: 22 April
Teams & Projects Announced: 30 April
Challenge Begins: 1 May
Peer Review Begins: 15 May
Challenge Ends: 31 May
Results Announced: 15 June
Published in @eds_book: Summer
Agenda (with notes from the meeting):
- ProjectPythiaTutorials content, gallery, landing page, and organization
- Current landing page is empty
- Should we construct a gallery?
- In general, no. To support “first party” recurring Pythia tutorials, possibly yes
- Support paid content in resource gallery
- Should consider our stance here, currently have not had to support this explicitly
2023-03-21: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / brian-rose
- Max Grover / Argonne National Lab / @mgrover1
- Julia Kent
Agenda (with notes from the meeting):
- Two times a day checkins, planning to cluster people into groups for group projects
- Add page to Pangeo site for reference - use Cookbook template, own github repo that can be edited with project pitches
- Title
- Dates
- Description
- Prelim agenda
- Hotel information
- Plans for remote participation
- Suitable equipment enabled to support hybrid meetings
- 2 synchronous check-ins per day
- What experience to come with? What spin up tutorials are planned?
- Bring science content and put in self=publishing self-maintaining content, doesn’t know infrastructure, improve workflow to use better parallel computing and Pangeo-stack tools
- Assume know Python and Jupyter notebooks, might not be 100% on GitHub
- Go through Cookbook development workflow, adding notebook to repo with workflow, troubleshoot development process, leverage foundations for self-study leading up to event.
- Brief intro to Pythia -> what are Cookbooks
- Smaller break out room for GitHub or Pangeo questions
- Try to blend groups, someone to usher the group through Github issues
2023-03-20: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
- Rob Ford / UAlbany / @r-ford
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne National Lab / @mgrover1
- Brian Rose / UAlbany / @brian-rose
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: NSF OSE GEO proposal submitted. Thanks to all who contributed!
- JC: Possibly submitting an unsolicited proposal to NSF to support summer hackathon.
- MG: Went through and made more changes to the hackathon registration form
Agenda + Notes:
Action
- Set up meeting with Matthew Mayernik. Who are the attendees?
2023-03-13: Education Working Group
https://albany.zoom.us/j/97375285304?pwd=bGQzeldCcGs4VEdpVnc4aDlxMEFZdz09
Facilitation rotation sign-up and schedule
Attendees:
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne National Lab / @mgrover1
- Drew Camron / Unidata / @dcamron
- Tyler Erickson / free-range / @tylere
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- DC: Three new cookbooks “published” to the gallery!
- BR: still out today but Pythia GEO OSE proposal was submitted to NSF today!
- KT: working with senior undergrad on Holoviz-related apps; he will present on Pangeo showcase on 4/26
- RF: Cookbook contributor guide updated
- MG: Drafting form for summer hackathon registration
- Topic suggestions
- Other part of form is dependent on whether NSF provides travel funds
- TE: still learning about Pangeo and its related projects
Agenda (with notes from the meeting):
- Action item: KT, DC, RF will look into it
- MG: We still have some Pangeo notebooks to fold into Cookbooks … can we encourage the original authors to participate in the summer Hackathon?
- KT: longer term … just watching advances in the foundational ecosystem such as Pandas’ upcoming 2.0 release, which leverages Apache Arrow
- MG: good opportunity to add some material for our data format foundational content that is Pandas-specific … such as working with tabular/columnar data CSV, Parquet, Apache Arrow
- DC: working with MetPy development folks on updating the MetPy tutorial and integrate into a Cookbook
2023-03-07: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany/ @ktyle
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Max Grover / Argonne / @mgrover1
Agenda (with notes from the meeting):
- SciPy draft submitted
- June 20-23 Hackathon planning
- Identify local institutions who may be able to attend (e.g. MSC, CU Boulder, CSU, NOAA, etc.)
- Tweet out a save the date
- OWG project management tools
- A shared Google calendar is up and running
- Twitter duties: Drew up for one more week
- Comb through the external resource gallery each month: add new resources and maintain existing ones
2023-03-06: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Ryan May / Unidata / @dopplershift
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
- Tyler Erickson / / @tylere
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Volunteer for notetaking?
- NSF GEO-OSE proposal status: text finished, now in hands of the various partners’ research proposal support staff
- SciPy 2023: Talk proposal submitted 3/1
- Julia K: Lily K. is working on the edits for the Foundation notebooks / content
- Rob: working with Drew on new cookbook
- Drew: has resolved token authentication issue with cookbooks, will document the workflow and discuss at an upcoming meeting
Agenda (with notes from the meeting):
- Consensus is to have contributors be on an equal footing in terms of their GitHub memberships (i.e., don’t have them be “relegated” to Outside Contributor status)
- Similar model followed by Matplotlib
- EWG Facilitator sign-up
- Tyler Erickson (recently left Google; involved with Google Earth Engine): more familiar with nbdev (which is based on Quarto) … curious about how it differs from Jupyterbook
- Jupyterbook is an easy “next step” for folks who have taken the leap into presenting their research/lectures/etc via a Jupyter notebook. It would be a an additional step for learning a new workflow that uses nbdev/quarto
- Also curious if we are looking into in-browser (e.g. Jupyterlite) as a platform/sandbox
- One issue is how much of a load it is … especially dataset-wise
- At this point, does not integrate with a Dask cluster
Items from previous IWG meeting:
- Align CI workflow across all of Project Pythia
- cookbook-gallery still missing some of these pieces, including link-check
- cookbook-actions is itself un-tested and has little CI checking on updates and enhancements
- DEFER to future meeting: To mamba or not to mamba, that is the question
- Use mambaforge as the basis for environments, or:
- Use conda, but with built-in libmamba
- See Discourse discussion here.
- (This might be considered an educational content question, so we could take it at the next EWG)
- DEFER to future meeting: Ruff: highly performant linter
- Several projects (e.g. Xarray) adopting Ruff
2023-02-27: Education Working Group
https://albany.zoom.us/j/97375285304?pwd=bGQzeldCcGs4VEdpVnc4aDlxMEFZdz09
Facilitator: Ryan May | Note-taker: Kevin Tyle
Facilitation schedule
Attendees:
- Ryan May / Unidata / @dopplershift
- Kevin Tyle / UAlbany / @ktyle
- Rob Ford / UAlbany / @r-ford
- Max Grover / Argonne / @mgrover1
- Brian Rose / UAlbany / @brian-rose
- Rich Signell / USGS / @rsignell-usgs
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- Julia Kent – can’t make today due to SIParCS interviews. Summer event update from Taysia:
- Booked the Damon and Chapman rooms Tuesday-Friday the entire day (except Wednesday 12pm-1:15pm the Chapman Room)
- Booked ML680 after 10am on Tuesday, all day except 2-3pm Wednesday, and all day Thursday/Friday.
- Do we want additional space in the library?
- Heavily dependent on whether NSF funds a portion of the event.
- Requesting a 20-room hotel block June 19-24th
- Max: reverting radar cookbook update until Binderbot authentication can be worked out
- Brian: racing to NSF proposal deadline for follow-on to Project Pythia! Including folks from UAlbany, NCAR, Unidata, 2i2c, USGS
- Ryan: working on NSF proposal above
- Rob: cookbook maintenance
- Julien: Pythia and cookbooks being discussed with his group (Paleocube) . What makes up an effective notebook?
- Brian: Our notebooks articulate how long it would take a “typical” user to work though the notebook. We haven’t as yet received feedback that a specific notebook’s time estimate needs revision.
- Rich: Interesting learning links from ESIP meeting today. There could be a good opportunity for Pythia to organize a meetup of “Pangeo educators” to discuss how folks are doing things.
- Dedicated EWG meeting, which could then lead to:
- Townhall @ AGU/AMS?
- SciPy BoF
- This could also help establish a steering board should “Pythia-Next” be funded
Agenda (with notes from the meeting):
- Kevin / Drew working on it
- Brian presenting for ESDS group at NCAR this week - will submit the above-linked form expressing interest
- Hackathon update and planning
- Submitted 2 page document to NSF program officer contact (early last week)
- Further discussion with Julien: in terms of developing notebooks, what should we be focusing on?
- How to use the “technology”, e.g. xarray
- B. Rose: Foundations and Cookbooks focus on this, but in a scientific use-case perspective
- How to practice and present good science (e.g. EOF analysis)
- Ryan: Up to the notebook author! Could certainly include (2) as part of (1)
- Julien: But it’s up to the audience as well.
- Brian: up to this point, much of the community’s involvement has been motivated by how to use the “techonology”. An author would want to have made the conscious choice to post their work in the form of an executable notebook, not just in a PDF/journal.
- Max: Radar cookbook has started with tools, but now adding content talking about wx radar science itself. Incorporating content delivered in the recent AMS Short Course on radar.
- Julien: relates scenario of how a chemist might not be aware of how to properly compute statistics (need to multiply by cosine of latitude)
- Brian: Goal is to ease the path toward bringing your scientific content to a notebook if it can be readily linked to data + software
- Max: is the cookbook template better-suited to a classroom lecture, or better to take pieces to place in a course?
- Max: should we draft a registration form for the hackathon yet?
- Brian: wait a bit longer for funding situation clarification
- Ryan: but can we at least start with drafting the form, noting that funding is pending?
- Max will proceed on making a form
- Continue to promote on Twitter, etc.
- Rob: is there going to be a hybrid component? Yes, although if it is funded, more effort on in-person.
2023-02-21: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany/ @ktyle
- Max Grover / Argonne / @mgrover1
- Drew Camron / Unidata / @dcamron
Agenda (with notes from the meeting):
- Draft proposal
- Drew and Kevin will iterate independently the rest of the week but will meet up via Zoom at noon EST / 10 AM MST Friday
- June 20-23 Hackathon planning
- OWG project management tools
- Project board, a la the new Github board that Orhan demoed?
- A shared Google calendar may be more appropriate, which would include
- Deadlines (e.g. conference submissions)
- Rotating duties (e.g. Twitter)
- Twitter duties (will be scheduled on the above-mentioned Google calendar)
2023-02-13: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Elena Romashkova / NCAR/CGD / @rmshkv
- Max Grover / Argonne / @mgrover1
- Rob Ford / UAlbany / @r-ford
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
- Ryan May / Unidata / @dopplershift
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- KT
- Kicking off new semester of new Pythia model teaching
- Using Py-ART for the radar stuff
- JC: Reminder that Brian Rose will be visiting NCAR Feb 17 - March 3.
- BR: Coming to Boulder on Friday :)
- Submitted proposal to ACCESS CI program
- Computer time on Jetstream2 - secured more substantial allocation there 🚀
- Lots of credits, approved in minutes
- RF: No updates
- ER: cookbook in progress ideas coming later in the agenda
- MG: No updates
- RM: No updates
- DC: Pinged Brian about testing actions issue
Agenda + Meeting Notes:
- Did come to conclusion to submit a SciPy abstract (deadline Feb 22)
- Updates on Hackathon discussed during quarterly meeting
- Julia linked the hackathon title/description sent to admin folks (see below)
- Requested rooms for breakout rooms, hotel room block
- Updates from meeting with UCAR’s EdRC office re hackathon - John, Brian
- Notes here
- Encouraged us to reach out to other groups who have done this before (ex. Unidata)
- Unidata experience with workshops?
- What is plan B if we can’t get support for external participants?
- The need for this is real…
- Chances of securing funding is very low, look at engaging virtual people as well
- Could recruit more NCAR/UCAR/CU people too
- Make sure to have remote as an option, worked OK last year, can improve
- Ready to publish
- DC checking with him after this meeting to get this moving
- KT will follow up
- Currently in a “polishing” stage
- Priority here should be getting this to the finish line
- Intake cookbook (James Morley)
- Currently published, work left to do?
- No updates
- Worth checking with Julia + team on this one
- VAPOR 3D visualization (Julia Kent)
- Julia had a meeting with the VAPOR team last week about this
- CGD/Matt Long (Elena Romashkova)
- Have a notebook plotting some FEISTY fish model data as maps + timeseries, with some data organization and potentially interactivity involved…would this be a useful cookbook?
- Read data using xarray
- Applying some calculations
- Visualize at the end
- GO FOR IT!
- Jupyter notebook based diagnostics
- Working with MOM6 ocean model
- Good to coordinate with COSIMA recipes
- Asynchronous training materials for
- Getting spun up with workshop content
- Intro to Python/Jupyter
- Met-focused content
- Creating 1-2 cookbooks out of this material, have things cross-referenced, consolidated
- Mostly not started, main thing contributed near hackathon season 😀
- Where should the content not in Foundations go?
- Ex. Basic Python, including control flows, etc.
- Could consider putting this into its own cookbook, or building on Foundations
- RM - if we include, we should include in (new) section in Foundations
- Next up after actions testing
- Cloud centric workflows (Rich Signell)
- Covered by Kerchunk topics
- Consolidated content from Pangeo + EarthML (Jim Bednar, Anaconda; Max Jones, Columbia)
- Plan was to hold off until infrastructure issues (binderbot) resolved
- Infra more related to Pangeo gallery
- Data infra issues related to Pangeo gallery
- We have ironed out most of these things - good to check up with them
- Our stuck Pangeo migration cookbooks
- xgcm-cookbook
- Not at the top of any one person’s plate. Some compute and data limitations, but needs revisiting
- Would be good to contact the original pangeo gallery developers
- Contribute, and attend the summer event :)
- One variant allows for “cookbook-like” generation of animations for NOAA Science on a Sphere visualizations
- How should people contact us?
- Pangeo Discourse (recommended)
- Want this to be a focal point
- Repeat this message
- Focus should be on engaging with people there
- Open issue in cookbook-gallery
- Where do we put the guide?
- Append to main contributor’s guide
- Somewhere in cookbook-gallery
- The more links to it the better, make sure it is on the cookbook gallery page
- Maybe removing the overall contributors guide would make it more clear - and making it bold/highlighted
- Update cookbook gallery according to discussion today - RF
- Draft an issue template for cookbook contributions - OWG
- Drafting the FEISTY content into a cookbook - ER
- Follow up with Pangeo + EarthML folks - JC
- Reach out to Pangeo gallery people to migrate content - OWG
2023-2-6: Quarterly Meeting
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / Argonne / @mgrover1
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
Agenda (with notes from the meeting):
- Status of planned submissions (notes from our October 24 meeting)
- Regridding cookbook (Thomas Martin)
- WIP. repo exists, but has not been added to gallery yet.
- Ready to publish
- Consolidated content from Pangeo + EarthML (Jim Bedner, Anaconda; Max Jones, Columbia)
- Plan was to hold off until infrastructure issues (binderbot) resolved
- Our stuck Pangeo migration cookbooks
- Not at the top of any one person’s plate. Some compute and data limitations, but needs revisiting
- Cloud centric workflows (Rich Signell)
- Fall 2022 ATM433/533 (Kevin Tyle)
- Intake cookbook (James Morley)
- VAPOR 3D visualization (Julia Kent)
- CGD/Matt Long (Elena Romashkova)
- Potential topics include CESM (and others) data wrangling, comparisons to eg obs data, analysis and viz workflows
- How do we move these forward?
- BR: have the hackathon!
- JC: do we need to have more cookbooks available to attract attendees?
- BR: It might be helpful to have some key cookbooks that exercise infrastructure
- KT: Having more content in time for NSF GEO OSE proposal might be helpful
- BR: can we identify low hanging fruit, particularly for “external” CBs. External CBs probably more valuable (to reviewers) than from core team.
- KT: what about kerchunk cookbook
- Incorporating project boards into our best practices
- Summer event
- Who is on the organizing committee?
- OWG + BR
- Optimal hackathon size: ~50
- Update on interactions with NSF
- Are there areas where we can improve?
- What is one thing that you would change?
- Are there opportunities that we should pursue that would help achieve our goals?
- BR: securing additional funding
- RM: Scipy CFP is out, and due Feb 22.
- MG: there will be another ARM meeting in August. Plan to develop cookbooks.
- BR: How do we incentivize scientists to contribute cookbooks?
- MG: should we mint DOIs for each CB? Easily done through Zenodo.
- BR: we should figure this out before the summer event.
- MG: we should engage the NCAR librarians: Matt Marenick (sp?)
- BR will post Discourse question.
2023-02-07: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany/ @ktyle
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- James Munroe / 2i2c / @jmunroe
- Drew Camron / Unidata / @dcamron
- Max Grover
Agenda (with notes from the meeting):
- OWG leaning against submitting a tutorial
- In support of submitting for a 30-min talk on cookbook infrastructure, GHA+binderbot, executable book workflows, etc.
- List of tracks - https://www.scipy2023.scipy.org/present
- Possibly under “tending your open source garden + community”
- Consider JupyterCon?
- KT + DC will start to draft abstract
- June 20-23 Hackathon planning
- Pythia Cook-off: hacking together cookbooks for Pangeo
- Project Pythia [event]: cooking up recipes for the Pangeo stack
- Description:
- Fee for provided lunch:
- Currently exploring funding opportunities for travelers
- Goal headcount: ~50
- Potential 2i2c sponsorship opportunity for snack/coffee break(s)
- Brainstorming
- Start with “here’s what can be done” and leave people room to generate from start?
- “Bring your own project”, asking people to come with workflows and code in hand, and let the hackathon turn these into cookbooks, eg this paleo project (it’s the same dates, oops - joint session opportunity?)
- JM links other “recipe” effort from COSIMA
- Tutorial/content groups to connect with:
- Making room for groups to gather and combine similar efforts into cohesive cookbooks
- Reach educators looking to use these in their classes?
2023-2-6: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- Orhan Eroglu / NCAR / @erogluorhan
- Ryan May / Unidata / @dopplershift
- Julia Kent / NCAR / @jukent
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Volunteer for notetaking? Drew
- Quarterly meeting: tomorrow 12 Noon EST / 10 AM MST. Send agenda items to John (or just add to this document). Draft agenda items thus far:
- Review of metrics
- Prioritizing tasks (with support of new project boards)
- Summer event
- Open discussion
Agenda (with notes from the meeting):
- Summer Hackathon: June 20-23
- Brainstorming during tomorrow’s OWG meeting (and likely quarterly meeting thereafter)
- Link to Draft
- Can be submitted any time, will word smith and delegate submitters; can iterate with funders as-needed
- Link to last week’s planning meeting notes
- One pager draft in the works and will need quicker feedback later this week
- Currently populated with cross-org issues, organized into separate views
- Can be populated with any issue or PR, OE prefers to focus on issues and have PRs explicitly tied to relevant issues in-PR
- Expect some limitations and headaches with automation
- Board allows creating items that are not explicitly tied to repo issues
- Iteration (ie “sprint”) duration and recurrence can be defined - tie to meeting schedule? Longer?
- Value of “Comment” field for specific project-based history, etc.
- Individuals can modify views without affecting entire team, then edits can be saved or exported to new views
- Align CI workflow across all of Project Pythia
- cookbook-gallery still missing some of these pieces, including link-check
- cookbook-actions is itself un-tested and has little CI checking on updates and enhancements
- DEFER to future meeting: To mamba or not to mamba, that is the question
- Use mambaforge as the basis for environments, or:
- Use conda, but with built-in libmamba
- See Discourse discussion here.
- (This might be considered an educational content question, so we could take it at the next EWG)
- DEFER to future meeting: Ruff: highly performant linter
- Several projects (e.g. Xarray) adopting Ruff
2023-1-30: Education Working Group
Meeting link:
Facilitator: Brian Rose | Note-taker: Rob Ford
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Julia Kent / NCAR / @jukent
- James Munroe / 2i2c / @jmunroe
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- JC: Quarterly Meeting rescheduled for Tuesday, Feb 7, 10am MT. Our usual zoom coordinates. “Required” for PIs/senior-personnel
- RF: Landsat ML Cookbook by Demetris Roumis added to gallery
- BR: harmonized build/preview/test infrastructure across repos
- JM: shoutout for pyOpenSci: https://www.pyopensci.org/
- Link to it (resource gallery)?
- JC: note in our content when something is reviewed by pyOpenSci (maybe)
Agenda + Meeting Notes:
- Scipy call for proposals due Feb 22
- Additional discussion points for hackathon and foundations updates below
- Summer Event (June 20 - 23) - john
- Current status
- NCAR Main Seminar Room reserved June 20 - 23
- Can help with a variety of logistics (A/V coordination, registration, room reservations, etc.,)
- Need to find out what else Taysia can do for us
- Meeting with UCAR EdEC on Feb 13, 11am MT
- What can they do for us (e.g. advertising, provide content)
- Help with funding (to support external participant travel)?
- What kind of event are we holding?
- Reserve additional rooms if needed
- Complete the CISL Event Request Form and meet with Taysia
- Identify the conference committee. Tasks:
- Program
- Advertising
- Hybrid or in-person only
- Budget (registration fees? Support from NSF?)
- Lodging for external participants
- Etc.
- BR: Come with an idea for a cookbook, work on it, and get help with infrastructure. Present work at the end/last day
- JM: contributors could come out of the event with a DOI
- JC: could attract a larger audience by covering foundational/cookbook content (first half-day?). Need to think more about what to cover.
- KT: bring in other NSF-funded groups to contribute content (AI2ES, other EarthCube awardees)
- JC: Should reach out to EarthCube office. John will do.
- Event title ideas:
- Focus on “cooking”?
- Cookbook Kitchen
- Project Pythia Cookbook Workshop
- (OWG will continue)

- Questions about Foundations (see OWG notes below for context)
- Is our “spinning up” workflow opinionated enough, and is there enough pre-setup to get started in Foundations without digging through multiple textbook chapters (OWG opinion: no, move some conda-based setup to the very beginning)
- When does the benefit of advocating for mamba/mambaforge outweigh any burden on learners with switching gears to mamba?
- No clear downside, but we need to ask others: open discussion on Pangeo Discourse
- How do we proactively keep Foundations alive regularly?
2023-01-24: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany/ @ktyle
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- 148 (how to get involved, now a draft PR 297)
- Twitter/Mastodon schedule
- Kevin: twitter ambassador until 2/7
- Every new cookbook release
- Hackathon plans
- Open issues on Discourse (make this a regular agenda item for OWG)
- Julia may submit an abstract for the upcoming NCAR SEA conference
- SciPy Call for proposals due Feb 22
- During last week’s MetPy community call, Eric Bruning (Texas Tech) mentioned Pythia as a resource … noted that finding the “Installing & Managing Python with Conda” section is buried a bit too deep
- Should we move this section up in the “Getting Started with Python” chapter?
- June Pythia Hackathon planning (6/20-23)
- Set up an interest form?
- Further discussion on next Monday’s EWG meeting
- Hybrid as well as in-person? If hybrid component, need to have a dedicated hybrid leader(s)
- Think about rewarding external participants (stickers, etc.)
- Drew: will look into ordering more stickers
- Questions for EWG re Foundations
- Prerequisite conda “setup” clearer up front
- Using mambaforge and an opinionated workflow
- General review of foundations intro material
2023-1-23: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @ r-ford
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
- Naheem Adebisi / Boise State Univ
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Brian Rose will visit NCAR Feb 17 - March 3.
- AMS report out
Agenda (with notes from the meeting):
- Thursday’s quarterly meeting (1/26) - John
- Needs rescheduling; John will reach out to non NCAR folks
- High priority infrastructure-related:
- Establishing pythia-wide project board to facilitate tracking high priority issues, etc.
- OE will lead; set up foundation over the next month
- Synchronize Foundations / Portal actions with Cookbook actions (see 339 )
- RM/DC are reviewing this PR
- Identify and fix gaps in the Cookbook testing workflows
- A thorough review of our cookbook testing suite is in order, but first order of business is to synchronize Foundations actions with Cookbooks
- Brian: request ACCESS account so you can be co-PIs (looks like John already has one)
- Status of recently submitted cookbooks
- E.g. @norlandrhagen, kerchunk; @jnmorley, librarynb
- KT will follow up with @norlandrhagen
- JM’s work with JK and Pythia team ended at 2022, but JK feels the cookbook is not too far from the goalline
- Larger question: what do we consider “good enough” for a Cookbook?
- BR: A cookbook that runs, passes tests, is “correct enough” in the sense that a couple pairs of eyes have looked at it should be published.
- BR: and this is why we need some robust testing cookbook-wise
- JC: what’s the minimal useful product we can put into people’s hands? Go for that rather than striving for the “perfect cookbook”
- BR: what’s the sufficient amount of content to deem a cookbook publishable? Is one notebook, wrapped in the cookbook template, enough? Yes if there is a clear link to other content and/or clear direction toward a more complete cookbook. Else, look to combine with other published cookbooks
- RM: needs to show a certain amount of “connected ideas on a subject”
- BR: on the other hand, don’t want to put up unnecessary barriers to folks who want to contribute
- Action item: open a discussion on Pangeo Discourse
- What happens when someone transfers a repository to Pythia?
- For content, best to follow up with original cookbook author, but if the issue is infrastructure related (e.g. JSON issue preventing jupyterbook from fully executing)
- Landsat-ml-cookbook by @droumis
- RF: coordinated with cookbook author; currently there is an infrastructure issue that’s keeping it from being built (likely the JSON/Jupyterbook/JupyterServer issue)
- Best use of github org and teams to manage access?
- OpenAQ (proposed)
- How to come up with metrics for how people are interacting with cookbooks … whether it’s just a search, or developing a new cookbook, etc; look at Google analytics for trends in searches for Cookbooks
- Welcome Naheem Adebisi (Boise State Univ.) … discovered Pythia while learning Xarray
2022-12-19: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- Brian Rose / UAlbany / @brian-rose
- Kevin Tyle / UAlbany / @ktyle
- Max Jones / CarbonPlan / @maxrjones
- Ryan May / Unidata / @dopplershift
- Max Grover / Argonne / @mgrover1
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- JC: Deepak Cherian expressed interest in consolidating CGD examples as Cookbooks in ~Feb timeframe. Has funding from “Jupyter Meets Earth” award to support this. John will follow up in February.
- JC: external cookbook contributions from @droumis (via discourse) and Max Jones. Latter needs response.
- JC: Looking into dates for possible summer workshop/hackathon on the Mesa that would include external participants.
- RF: Still working on fixing issues with the CMIP6 cookbook … could be related to Xarry October update
- JK: looking at cataloging various ESDS blog posts … they should be good potential cookbook topics
- MJ: Cookbook topics: EarthML and Kerchunk. Could link into existing Foundations chapters as well as cookbooks.
- MG: At ESGF meeting this past Monday, folks were interested in assisting with CMIP6 cookbook, also include xcdat
Agenda + Meeting Notes:
- No meeting last week. May not meet next week either since it’s a holiday week.
- JC: Please tweet out upcoming Pythia activities at AMS
- MG: A few more Mastodon followers. “Tweets” are termed “Toots” (although “posts” is preferred)
- Strong presence at AGU: talks from Julia and John. Poster presentation from Kevin T. Pictures here.
- John and Brian met with NASA’s Paige Martin (TOPS PO) to discuss ways to work with NASA’s TOPS initiative.
- John met with NSF GEO/EAR PO Eva Zanzerkia. Eva is promoting Pythia to “solid earth” community.
- Lots of awareness of Pythia at AGU!

- Cookbook contribution issues and documentation:
- How can we add an external collaborator so as to enable the transfer of the contributed cookbook repo to our organization? More generally, how can we make the cookbook contributing process more seamless? Simple Google form?
- https://docs.github.com/en/repositories/creating-and-managing-repositories/transferring-a-repository - “To transfer a repository that you own to an organization, you must have permission to create a repository in the target organization.”
- MJ: Xarray has a model for repo transfers
- RF will work with the contributor to get his EarthML cookbook added. Will use this as an opportunity to better document the cookbook contributor’s guide.
- DC: suggest that organization owners enable 2FA
- NetCDF and other formats (GRIB)
- Cookbook authors should consider whether they should have it runnable in GitHub actions or if it needs a Binder instance
- AMS breakout group organizing. MJ will be sending out instructions for those who want to lead a breakout
- Brainstorming on summer 2023 Pythia workshop. Working with NCAR staff in terms of dates. More to follow.
- RM: Unidata triennial workshop 6/5 to 6/8.
- Infrastructure short-term needs (since we may not meet for a while)
- Nuances regarding nightly testing continue to exist
- Are these issues significant enough to warrant refactoring cookbook code? Not at this time …
- With end of year approaching, time to look at Google Analytics (and other relevant metrics). JC asks JK to take a look. Will discuss at an upcoming OWG or a separate meeting.
2022-12-05: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
- Max Grover / Argonne / @mgrover1
- Drew Camron / Unidata / @dcamron
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- KT: working on poster for AGU, using summer EarthCube poster as base. Will invite comments maybe tomorrow afternoon
- JC: John will be on an AI in the Earth Sciences panel at AGU, Data and education: Empowering AI/ML in Geosciences, Monday, 12/12, 2022, 12–18 CT, S406a
- BR: we’ll do a crash course in migrating repos to ProjectPythia org after today’s meeting
- RF: CMIP6 Cookbook is still failing nightly build, needs looking into
- DC: Will give the OWG update
- MG: Xarray + Dask content got merged into Foundations! Can work with RF on CMIP6 Cookbook issues. ESGF meeting Monday at AGU, hoping to show this Cookbook off.
- JK: EarthCube demos booth looks very sparse so far. Only Pythia!
Agenda + Meeting Notes:
- Plan to create online + print-friendly version of “How to Get Involved with Pythia” flyer this week, before AGU.
- Look out for PRs this week
- Should have plenty of stickers for AGU in the hands of various people. Keep an eye on numbers to see if more need to be ordered prior to AMS meeting
- ProjectPythiaCooksbooks contains (cookbooks, cookbook-specific actions, gallery generation), each cookbook will need action references updated
- Transfer to education+infrastructure teams
- Todo
- Rename cookbook gallery generation repo and fix subdomain (if necessary)
2022-11-29: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
Kevin Tyle / UAlbany / Kevin Tyle
- Max Grover / Argonne / @mgrover1
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Prepare for AGU!
- Post list of talks/posters on the Pangeo discourse
- MetPy
- Open Science in the Rockies
Agenda (with notes from the meeting):
- 148 (how to get involved)
- Divide this up so we can get it done!
- Opening a draft PR
2022-11-28: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: Highlights from today’s “Future Opportunities for Geoscience-CI Community Engagement” webinar
- There was an AGU Notebooks Now workshop on November 3,4. Info and recordings here. Goal of Notebooks Now!: “Elevating Computational Notebooks as Primary Elements of the Scientific Record”
- NSF solicitation 23-521 (Strengthening the Cyberinfrastructure Professionals Ecosystem). Solicitation goals include “fostering innovative and scalable education, training, and development of instructional materials, to address emerging needs and unresolved bottlenecks in CIP workforce development.” Deadline Feb. 23 2023. Worth looking into.
- Zoom meetings “sign in to join, this meeting available to authorized participants only”. Is this what we want?
- Old way (NCAR-hosted, have to type in a password)
- New way (UAlany hosted, t
- Create some appropriate verbiage on the relevant page and/or calendar link?
Agenda (with notes from the meeting):
- Provisional minihackathon: devote last part of EWG agenda 12/5 and some time afterward
- Cookbooks
- AGU booth demo signup
- High priority infrastructure-related:
- 345 (make allow_errors=False and add flags for any cells that are expected to fail, so we don’t miss problems such as what occurred in 343)
- Create PR to resolve similar issue in this CMIP6 Cookbook
- We should add some verbiage for cookbook contributors so they know how to add that flag
- 344 (Mint GitHub releases for Foundations)
- Rule of thumb: new release when there is some clearly-visible new content or revised look … but not for trivial typos/fixes.
- Also gives us a formal way of citing content with a DOI
- GitHub makes automating a version release fairly easy … some human intervention is still necessary, but can be pretty minimal
- What is the imagined scenario where someone would cite us?
- If we don’t provide the citation link, then we won’t be cited :)
- Consider a quarterly release by default?
- Would we also want to preserve previous versions so they could be visible on, e.g. GitHub pages?
- Zenodo’s archiving provides a source archive, not a visible/rendered archive … maybe not perfect, but it’s do-able, and low-hanging fruit
- Bigger issue in general! How do we properly archive electronic content such as Foundations? Larger than Pythia. JC will reach out to AGU for their thoughts.
- 267(How to communicate with us)
- 231 (Autoapprove/automerge bot PRs)
- Based on R. May’s comment, close?
- Can we have two people tagged for review, but only need one to merge? YES … on a per-team basis.
- If not, tag one, need one to merge.
- PR author will need to exercise some judgment. If the author feels it needs a 2nd reviewer, the author will need to do explicitly tag them.
- What do we do about auto-assigned reviewers who are not currently active on the project? Time to cull those who are no longer active. Just need to take those folks off the relevant teams (notify them at the same time that they are being removed)
- 148 (how to get involved)
- Discuss at tomorrow’s OWG
- Contact email for the GitHub org needs to be changed from Kevin P to John C.
2022-11-21: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- Ryan May / Unidata / @dopplershift
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- JC: EarthCube will have a booth at the upcoming AGU meeting in Chicago. If you would like to volunteer to do a demo of your tool at the booth, please sign up here
- Demo a notebook or two, some of the infrastructure
- KT, JK interested in signing up for slots
- JC: There will be several EarthCube sponsored events at AGU. A complete list is here.
- BR: migrating Cookbooks to the main ProjectPythia GitHub org is more complicated than we thought due to some GitHub magic around URLs for Pages.
- Difficult to maintain cookbooks.projectpythia.org domain for the repos under the normal Project Pythia organization
- Decision: cookbooks.projectpythia.org -> gallery and just host cookbooks under projectpythia.org/mysubject-cookbook, thus allowing us to continue migration to single GitHub org (which is the bigger win vs. cookbooks.projectpythia.org namespace for cookbooks).
- JK: Updated some cookbooks for infrastructure (e.g. binderbot)
- DC: Unidata workshop at CSU
Agenda + Meeting Notes:
- Fosstodon https://fosstodon.org/home @projectpythia
- Scientific Python community has not come to a consensus on where it’s moving yet
- EWG efforts prioritization going into conference season
- Julia working on 0-10
- James working on 20-30
- BR: Should be really close
- JC: Not particularly high priority, maybe SIPARCS student?
- Cookbook doesn’t currently run properly
- Migrated from Pangeo, good demo of “larger” cookbook
- Assigned to Robert Ford and Brian Rose
- BR: Outdated for current infrastructure and wrong on some points. Should be high priority since this leads potential contributors.
- Assigned to Kevin Tyle
- BR: Would reduce maintenance, but not high priority since doesn’t affect contribution process
- Drew Camron will pick this up a little after the holiday
2022-11-16: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
Quick Updates:
Agenda:
Social Media Platform:
Twitter message thread: https://twitter.com/messages/1572270698579591168
Sci. Python Mastodon draft governing document: https://hackmd.io/qw3V_E08TviYYGA1x83w7A
They propose rolling it out coincident with PyDataGlobal (12/1-3)
Got on waitlist to join Fosstodon (same password as Twitter account)
AGU:
time to start working on posters and talks
AMS:
Pangeo hackathon on Friday of that week. Waiting for updates from organizers. Pythia-led breakout group.
Saturday the week before, AI2ES meet up at Center Green. KT will be attending.
Pythia summer 2023 hackathon: time to start investigating dates; mention during tomorrow’s Pangeo call - KT
Pangeo Open Science Working group? Last meeting was 7/1/2022; MG asked Ryan A. during last week’s Pangeo call; apparently group is still active
Pangeo showcase:Information/signup page - https://pangeo.io/pangeo-showcase.html
2022-11-14: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne / @mgrover1
- Rob Ford / UAlbany / @r-ford
- Brian Rose / UAlbany / @brian-rose
- James Morley / UVU / @jnmorley
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- NASA TOPS: Decided against pursuing the Dec. 2022 call
- KT: presenting on RAPIDS-AI-accelerated analysis/visualization of NLDN data during Dask Demo Day Thurs. 11/17 11AM EST / 9AM MST
- JM: Prototype for new tool for launching Jupyter Books locally
- OE: Held a two-day ESDS fall event last week in which there was Python tutorials (50 attendees; Julia led the Pythia-related content
- RF: working on CESM cookbook
- MG: still working on integrating radar cookbook with Binderbot
- JK: focusing on updating the infrastructure in cookbook
Agenda (with notes from the meeting):
- Testing GitHub-based auth + HTTPS this week
- Celebrate closed and discuss open Infrastructure Issues and PRs
- Do these still need to be done manually? (Follow up with Drew)
- As mentioned in issue 231 is there a best practice we should follow to get these quickly reviewed/merged?
- What should we look for as potential reasons to not approve/merge?
- The issue that has bedeviled us has been in the pre-commit jobs which wants to include pre-release versions of tools (e.g. alpha version of some of the related packages )
- OE: with geocat, they do not use pre-commit autopush services at all. Instead, have the developers use it in their local versions when they do a commit. It gets developers in the habit of best practices/standards, e.g. PEP-8.
- BR: this is where we started, but then moved to enabling the automatic precommit. We made this decision in order to ease the path for external contributors. MG concurs.
- Until the pre-alpha issue goes away, we can either:
- Continue the manual merging
- Try dependabot
- TODO: For these autobot updates, can we relax the reviewer requirement to 1? (update #231 to that effect)
- Specify criteria for inclusion on this page, beyond those who are funded by the project, e.g.:
- Attends meetings semi-regularly
- Has made contributions to Pythia materials
- MG: suggests making this a monthly agenda item as opposed to automating it
- Good thing for OWG to do: come up with a list of folks and ping them
- Closember.org does automation by populates their list of supporters by those who star the project
- Put a call out on the Pythia Pangeo discourse page (nicely ties into next item below)
- Any updates post-11/7 EWG?
- Is Foundations #322 ready for review?
- Not yet - need to update our foundations infrastructure first
- LibraryNB: Jupyter notebook library handler for running Pythia notebooks entirely locally … and also shows what notebooks you launched using this tool
- Akin to Binder, but does not use containers
- More like an automated way of going through the steps we enumerate to “run this notebook locally”
- “Very alpha” - not on GitHub yet
- Tutorials might still benefit from its own organization; fold others into single organization
- There is agreement on going forward with this, but it will need to be carefully orchestrated
- Can we do this 1 repo at a time, or does it need to be done all at once?
- Perhaps start with the cookbooks repo this afternoon?
- Start with https://github.com/ProjectPythiaCookbooks/xgcm-cookbook
- Keep cookbooks.projectpythia.org as the gallery landing page, but move cookbooks to projectpythia.org organization (there is a lack of flexibility in how you can use CNAMEs): downside, URL to the cookbooks won’t have cookbooks in the URL; upside: single organization; better discoverability
- BR will write up the sense of today’s discussion and the options on the Pangeo/Pythia discourse
2022-11-07: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Elena Romashkova / NCAR / @rmshkv
- Rob Ford / UAlbany / @r-ford
- Rich Signell / USGS / @rsignell-usgs
- Ryan May / Unidata / @dopplershift
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- MG - Dask content ready for another review, incorporated suggestions
- PR #321
- JK taking a look at this this week
- JK - help going through list of typos
- Walking webtree created - able to see where there are typos, etc.
- 48 pages - LINK
- KT - students are putting together ERA-5 visualizations for our Science on a Sphere … good cookbook idea?
- Students used 1 day weather event
- Upload map to maptoglobe.com
- DC - local intro student Python workshop summary
- 2 half day synchronous CSU workshop (grad and returning grad students)
- Resolve programming barriers to scientific programming
- Feed these concepts back into Pythia material!
- Awaiting post-workshop feedback
- Gave away stickers!!
- CSU has a dedicated programming TA for classwork
- CU Earth Lab has some similar content with an accompanying certificate program as well: https://www.earthdatascience.org
- Course certification
- Ask multiple choice questions to encourage working through the notebook
- Cover a lot of similar content (limited to remote sensing) - Xarray eg
- BR - finished last big push on Cookbook infrastructure (binderbot stuff)
Agenda + Meeting Notes:
- Talked about investigating alternatives to Twitter
- MG asking Scientific Discourse forum and where to go - Mastodon
- Pangeo Hackathon, last day of AMS - registration closed at end of last month. Link?, Pythia breakout group
- Summer Pythia hackathon - picking dates
- Sign up for AMS short courses
- Close to 500 twitter followers!
- Update on Cookbook infrastructure (BR - can defer to next week’s IWG if there are more pressing issues)
- Looking at HRRR-AWS cookbook. `_config` file specifies to execute in Binder (override Jupyter execution options), and specified binderhub_url (specifies where launch buttons point to)
- As Cookbook author, have to manage config file
- Checked by GH nightly build
- Jupyter Book gets confused if don’t set flag to false in step “disable …” vs cache
- Rendered book updated by GH actions
- Draft PR to test going back and forth between build platforms (simple one line on config file change)
- Some notebooks won’t run on GH actions - motivator for work
- Backwards compatible
- Be able to run locally, not rely on GH actions? Hoping for JupyterBook to hold this functionality
- Document vs black box mentality
- Python 3.11 broke a lot of things
- Binderbot is not currently maintained and as a result must be pinned to <=3.10<3.11
- Trigger-book-build uses gh action to build book - specifies environment file - allow using different file name?
- Put together list of priorities for education before next meeting - BR
2022-11-1: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne / @mgrover1
Quick Updates:
Agenda:
Social Media Platform: investigate other alternatives to Twitter? Perhaps Mastadon, which is a decentralized sharing platform? Is there an existing Mastodon group that we could join?
MG will ask on the Scientific Python discourse forum
There is a Twitter group chat with a lot of the Pangeo folks, “Scientificish Python Twitter”; MG will also poll that group to see if there is some interest in another platform
Closember: make a push to get Pangeo gallery content moved to Pythia Cookbooks
However, if we move the Cookbook repo in two weeks that may engender confusion
AGU:
Outreach through our posters/talks; on Monday, there is an open science meeting and an ESGF meeting
AMS:
Pangeo Hackathon Friday of AMS week; registration deadline was yesterday; MG will ask Max Jones tomorrow during Pangeo call
Pythia summer 2023 hackathon: time to start investigating dates; mention during tomorrow’s Pangeo call
Is anyone still attending the Pangeo Open Science Working group? Last meeting was 7/1/2022
Pangeo showcase: would be good to have someone present; there are openings 11/9 and 11/16. KT is willing but not until Jan. 2023; JK will check her schedule; maybe 11/16
2022-10-31: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / clyne
- Drew Camron / Unidata / @dcamron
- Max Grover / Argonne / @mgrover1
- Rob Ford / UAlbany / @r-ford
- Julia Kent / NCAR / @jukent
- Orhan Eroglu / NCAR / @erogluorhan
- Ryan May / Unidata / @dopplershift
- Rich Signell / USGS / @rsignell-usgs
- Brian Rose / UAlbany / @brian-rose
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: John giving a lightning talk next week at NSF “retrospective” meeting. Plans to highlight sustainability. Input on slides (last slide, in particular) welcome. Slides will be submitted Friday.
- BR: attended TOPS call last Tuesday (along with JC/KT)
- MG: radar cookbook’s binder link now links to the provisional BinderHub link (README needs updating too); looking to also deploy a binderhub in the ARM/ORNL space
- JK: will update the upcoming Pythia meeting schedule
- RS: has started on a cookbook (Hurricane Ike visualization on an unstructured grid) … developing on ESIP infrastructure (JC notes that this ties in with another EC-funded project (Raijin) on unstructured grids, OE is also on Raijin
Agenda (with notes from the meeting):
- BR asked in last week’s webinar whether an infrastructure-focused proposal would be welcome for this call … this call is more content-focused, so probably not. However a cookbook on how to write cookbooks might be a possibility.
- Timeframe remains limited … Unidata does not have bandwidth … BR will make a decision later this week … will at least likely work on the notice of intent
- HTTPS supported
- Kevin will test the prototype Binderhub beginning Thursday 11/3 … expect to not rely on the Hub instance for several days
- Tutorials might still benefit from its own organization; fold others into single organization
- There is agreement on going forward with this, but it will need to be carefully orchestrated
- Can we do this 1 repo at a time, or does it need to be done all at once?
- Perhaps start with the cookbooks repo
- Project dashboard
- Discuss Open Infrastructure Issues and PRs
- Binderbot: Cookbook designer should be able to choose between whether to build on GH actions or on Binder. This can already be user-specified in the Jupyterbook config file … but has a potential downside in terms of causing a failure with the JB build “if you don’t know what you’re doing”
- MG: is this worth upstreaming to Jupyterbook? BR: yes, but need to get a draft fleshed out first. MG will post on our space in the Pangeo discourse forum
- Issue on GitHub here: https://github.com/ProjectPythiaCookbooks/cookbook-template/issues/89
- The binderbot tool probably needs maintenance before it could become a dependency of Jupyterbook. We’ll do the “proof of concept” work.
2022-10-24: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- Julia Kent / NCAR / @jukent
- Rich Signell / USGS / @rsignell-usgs
- Ryan May / Unidata / @dopplershift
- Elena Romashkova / NCAR / @rmshkv
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- Stand-up time to share around the room - 1 min checkins
- JC: Pythia PIs are still having discussions on NASA TOPST solicitation. A decision on whether to pursue is forthcoming
- JC: We have a quarterly meeting on the calendar for Thursday. John will likely reschedule and make a Senior Personnel meeting.
- BR has shared a few demos on Binderbot infrastructure, is making progress
Agenda + Meeting Notes:
- Abstracts accepted and conference season
- Use of the Discourse space?
- Open discussion re “scope of foundations”
- Call for contributions, participation in community events (eg hacktoberfest, closember, etc.)
- Open discussion on “review” process for cookbooks
- Pythia presence? Individuals will be there, but there is still room to have a proper Pythia focus at the hackathon
- Zoom headaches and solutions - will test out switching to a UAlbany configured zoom
- Expanding our cookbook gallery - discussion
- Tremendous interest from the broad geosciences community, but at the moment we only have 4 published cookbooks
- Revived discussion on review, attribution, and emphasis on “community” aspect of cookbooks
- Status of planned submissions
- Regridding cookbook (Thomas Martin)
- WIP. repo exists, but has not been added to gallery yet.
- Consolidated content from Pangeo + EarthML (Jim Bedner, Anaconda; Max Jones, Columbia)
- Plan was to hold off until infrastructure issues (binderbot) resolved
- Our stuck Pangeo migration cookbooks
- Not at the top of any one person’s plate. Some compute and data limitations, but needs revisiting
- Cloud centric workflows (Rich Signell)
- Fall 2022 ATM433/533 (Kevin Tyle)
- Intake cookbook (James Morley)
- VAPOR 3D visualization (Julia Kent)
- CGD/Matt Long (Elena Romashkova)
- Potential topics include CESM (and others) data wrangling, comparisons to eg obs data, analysis and viz workflows
- Open discussion - how are we feeling about the gallery content?
- Some content is waiting on infrastructure! (BR and others)
2022-10-18: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / Argonne / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
Agenda (with notes from the meeting):
- Max: had a meeting last Friday with LROSE/CSU … they are looking into converting their existing educational content into notebooks and are also looking to spin up a BinderHub on Jetstream2 … Max/Kevin will follow up with them
- TOPS solicitation: MG agrees it is worth pursuing; at previous TOPS meetings, their folks are aware of Pythia and view it positively
- AGU: JK and MG presenting talks Monday 12/12; KT poster Wed 12/14
- Discourse post about scope of Foundations and more generally opening up EWG participation asynchronously
- Pangeo hackathon during AMS: Friday 1/12
- MG: bringing stickers to DC ARM conference this week, and also ARM trading cards …
- Call on community to contribute youtube series and video walkthroughs of Pythia content?
2022-10-17: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- Orhan Eroglu / NCAR / @erogluorhan
- Rich Signell / USGS / @rsignell-usgs
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Julia - Sorry all I have to miss today’s meeting because I’m sick.
- Drew - note that Max added a pinned topic on the Pangeo discourse
- Rob - working on new notebook with binderbot
- Orhan - EDSS event 11/10 11/11 … mostly NCAR folks … Project Pythia will be discussed – event will be updated as more info comes in
- Rich - curious about TOPS education push
- John - had a good vacation!
- Brian - exciting updates on cookbook infrastructure, wrt Binderbot
Agenda (with notes from the meeting):
- AMS: Max, Drew, Julia, Kevin have abstracts accepted
- AGU: Kevin is presenting a poster; believe there is one other talk from the pythia folks
- Config.yml file that is in every cookbook has a binderhub_url; point it to the same binder instance where the cookbook will launch; thus, we can set the execute_notebooks setting to ‘off’
- JC: what is the long-term plan for the binderhub service? Will we be needing to continually update the links to the binderhub?
- BR: one advantage of this approach is that any notebook can use any binderhub server they wish. For now, using KT’s prototype binder is used for testing
- KT: Currently the provisional allocation has run through 10% of its CPU allocation
- DC: what about authentication on binderhub? (BR: https://binderhub.readthedocs.io/en/latest/authentication.html)
- BR: Should executing the notebooks via a Binder be a requirement we build into the cookbook infrastructure (simpler (JC likes this idea)), or can it be an opt-in via some sort of switch (needs to be documented; DC would prefer this idea if it doesn’t slow things up much)? BR can fairly easily write up the switch option; will take longer to write it up for a contributor’s guide
- BR: Should we save notebooks fully-rendered? DC: unless you want to take advantage of GitHub’s notebook-rendering capability, probably not needed in practice. BR: Pangeo’s gallery, when originally set up, did save the output of their executed notebooks in a separate branch.
- JC will examine the call in more detail in the next few days, perhaps a meetup later in the week if time involves
- Anaconda / Holoviz team as a partner?
- BR: section 3.1 of the call has a clear overlap with our Cookbooks.
- BR/RS: 2i2c, Quansight would both be potential partners
- RM: NASA’s call presupposes adapting the “NASA way” … NASA data providers, e.g.
- JC will reach out to Chelle
- DC: JK and MG have been regularly attending the TOPS calls … will discuss on tomorrow’s OWG call (JC will plan on attending)
- Should we fold all of our repos into a single organization?
- Tutorials might still benefit from its own organization
- Are there any logistical obstacles to folding the other orgs into the main Pythia org? BR thinks likely not.
- Discuss Open Infrastructure Issues and PRs
- GitHub project/Dashboards: further thoughts after Orhan’s presentation 10/3?
2022-10-04: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / Argonne / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR @jukent
- Drew Camron / Unidata / @dcamron
Agenda (with notes from the meeting):
- MSCAR connections + takeaways
- Alicia Hoffman from UW Madison interested in getting involved, invited her to the upcoming EWO/outreach meetings
- Phillip Chmielowiec - working on Project Raijin related interactive plotting capabilities, encouraged to check out
- SIParCS student?
- Discourse post about scope of Foundations and more generally opening up EWG participation asynchronously
- Discourse post Call for Cookbooks
2022-10-03: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- These PRs both demonstrate farming out the execution of notebooks to an external Binder service instead of executing on GitHub Actions
- This means a more bare-bones environment and less resources needed on GiHub Actions
- BinderBot is the tool that makes this possible:
- Opens a session on the Binder service with the full computational environment
- Sends the notebooks to that binder to be executed
- Downloads the executed notebooks back to GitHub Actions
- We can then proceed with jupyter-book build … using the already-executed notebooks.
- The environment image will be saved on the Binder service, so if we point the “Launch Binder” buttons at the same service, we’ll get good launch times and ensure performance (it’s the exact same environment that was used to build the content in the book)
- These examples are running on http://binder.mypythia.org/, KT’s experimental binder on jetstream2. But we should be able to point them to any available Binder service.
- This approach seems to work well. The PRs need some cleanup but the call out to the binder works great
- Questions:
- Does this seem like the right path forward for Cookbooks?
- Are we likely to have a sustainable / scalable binder service on jetstream2 (or elsewhere?)
- Discuss Open Infrastructure Issues and PRs
- GitHub project/Dashboards
Action items
- KT + JC + JK:: triage open infrastructure issues
2022-09-26: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- Kevin Tyle / UAlbany / @ktyle
- James Munroe / 2i2c / @jmunroe
- Ryan May / Unidata / @dopplershift
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- JC: Invited to talk about Pythia at and EarthCube retrospective meeting in November, and an NOAA organized AGU panel on ML
- BR: making progress on binderbot for executing notebooks. Will report at next IWG
- MG (not able to make meeting since we have a lab-wide meeting at the same time)
- Submitted a draft PR adding xarray + dask material (#321)
- Teaching a tutorial on Saturday at UIUC (link to materials)
Agenda + Meeting Notes:
- Pythia Discourse Proposal [DC]
- Moving forward today to request a #Project Pythia subcategory within #Education
- Will need to add overview to Discourse and link on projectpythia.org
- Cookbook submission process [DC]
- Revive discussion on submission and review process and organization management
- Problems
- Requires a point person with org access
- Requires membership first to transfer repo, then can be “downgraded” to external collaborator if needed
- No review process, no shared ownership/responsibility
- BR: we could have “editors” to facilitate reviews and submissions, similar to journals.
- We would need some infrastructure for this
- JC: for long term maintenance, can assign an editor to each cookbook
- BR: 2 maintainers: the original contributor and someone active in Pythia
- Use Discourse to continue this discussions with others
- JC: add to contributor’s guide
- Decided to add xarray + dask material instead of full dask tutorial: see MG draft PR
- BR: demo of building Foundations on KT’s binder with binderbot (PR #322). Can point to other binders (Pangeo’s?) as needed.
2022-09-19: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Max Grover / Argonne / @mgrover1
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
- Julia Kent / NCAR / @jukent
- James Munroe / 2i2c / @jmunroe
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- [KT] Going through existing open infrastructure issues. There are many.
- [DC] EWG nudge to check out Pangeo Discourse discussion for new Pythia community communication home
- [MG] Secrets are now enabled with cookbook. Radar cookbook can now run health checks with credentials.
- MG can draft some content for Cookbook contributor’s guide that explains how to set this up.
- [RF] Continuing workflow on regridding CMIP data
- [DC] Thomas Martin started at Unidata as AI/ML expert, putting together Cookbook content evaluating and comparing different regridding libraries. Good survey cookbook material, should be a PR appearing this week.
- [JMun] will discuss Binder + 2i2c (agenda item)
Agenda (with notes from the meeting):
- Follow-up on Ryan Abernathy’s Pangeo discourse post
- Community has converged on the need for a self-contained build / execute environment for cookbook-style content, e.g. building on BinderBot
- Who’s going to do the work?
- JMun: we had a technical discussion with 2i2c folks and Ryan A about what needs doing.
- One suggestion was to extend Dask Gateway to accept secrets (so Binder can leverage more powerful tooling)
- Also a plan to deploy a JupyterHub Pangeo Binder instance (again)
- Can clarify whether 2i2c developer time is available for this
- Haven’t scoped out how to get the infrastructure on Pythia side off of GitHub actions.
- Pythia allocation on Jetstream2 is still a possible path forward
- KT: we should sit down with 2i2c people and hash out a plan
- JMun: tentative plan to deploy AWS Pangeo binder instance
- Who’s going to propose alternative workflow for Cookbooks to call out to whatever resource we choose?
- KT: sit down with BR this week to take a quick first look at BinderBot. We will report back at next IWG.
- MG: similar discussions going at ARM, may be some collaboration opportunities
- JMum: Yuvi Panda might be an ideal person to be technical lead, if resources are available.
- Celebrate Merged PRs!
- BinderHub update
- Prototype instance remains up (but may be periodically offline without notice as I continue to test things)
- Pangeo oceanographic cookbooks remain incomplete pending PangeoForge recipe approval
- Discuss Open Infrastructure Issues and PRs
- We have dozens of Infrastructure issues spread across multiple repositories. Some are prioritized, and some are not. Some assigned, some not. Some are probably no longer relevant.
- Currently open Infrastructure-tagged issues and PR’s
- How do we manage these better and make sure important work is getting done with the limited resources we have?
- Can we do some housekeeping?
- Triage issues and assign priorities?
- Make assignments for high priority items?
- Find a way to manage the dispersed issues from a single point? E.g.. a multi-repo project board?
- Can we incorporate some Agile project management? E.g. a dedicated Project Owner, sprints, continuous releases, etc?
- KT + JC will go through and do some triage / prioritization / assignment between now and next IWG meeting
- James Morley: report on JupyterLite
- Could this be useful for Pythia infrastructure? What content could use this?
- Local version of Pythia Foundations running in JupyterLite on a laptop
- Spins up fast
- Can host as a static web page anywhere
- Get a jupyterhub instance. Importing python packages is a little slower
- Things seem to run ok
- Hurdle seems to be that you can only use specific packages that have been ported to web assembly (unless it’s a pure python package)
- E.g. netCDF4 package currently not available in pyodide distribution
- MG: getting closer! More packages being added
- Limitations of web assembly … import of multithreading packages won’t run properly.
- E.g. pythia-datasets package does not work (because has to make data request)
- RM: opendap calls are *possible* but have to go through the javascript interface. Fundamental mismatch between javascript i/o and python-based i/o
- Ongoing discussions about making this easier
- Probably not ready for most Pythia infrastructure, but a possibility for a Cookbook if it’s compatible
- RM: zero infrastructure cost! Even with current limitations, there’s still reason to be optimistic about using JupyterLite for some of our content.
Action items
- KT + BR (+ RF? + MG): sit down and look at BinderBot, report back to IWG
- KT + JC: triage open infrastructure issues
2022-09-12: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Ryan May / Unidata / @dopplershift
- Kevin Tyle / UAlbany / @ktyle
- James Munroe / 2i2c / @jmunroe
- James Morley / UVU / @jnmorley
- Drew Camron / Unidata / @dcamron
- Thomas Martin / Unidata / @ThomasMGeo
- Max Grover / Argonne / @mgrover1
Quick Updates:
Please sign in on this page above (and add any quick updates here)
- JC: Jim Bednar (Anaconda) Max Jones (Columbia), Ryan Abernathey, John Clyne, and Kevin Tyle met about finding a sustainable, permanent host for Pangeo-related examples, consolidating content from the Pangeo Gallery, EarthML, and Project Pythia. Anaconda decided that Pythia was the right long-term location, given its infrastructure, focus, and sustained basic funding level. Anaconda to select and port relevant content to Pythia.
- JC: Related to the above, Ryan Abernathey has made this post proposing to contract 2i2c to develop extensions to the Jupyter ecosystem to facilitate building notebooks/books remotely.
- rford2@albany.edu is looking for regridding support in CMIP6 content
- ktyle@albany.edu is testing some of Pythia’s content in-class with some feedback on missing areas (GitHub FAQs would be helpful); we can look forward to content
- James Morley has been working with benefit of Jupyterlite to Pythia content and intake++ cookbook content and Pangeo tutorial transition
- James Munroe:
- Developed a JupyterHub/Cloud data tutorial for the M2LInES community:
- Will try and follow options for additional compute resources for building Pythia examples from the 2i2c side.
Agenda + Meeting Notes:
- Direction for clearing up Pythia’s contact pathways
- OWG proposes working with Pangeo to get/repurpose a category on its Discourse for Pythia efforts and community, will chat with Pangeo after EWG approval [tentative yes]
- Elevate access to the “Community” section of projectpythia.org with a button or other visual access near the top; potentially modify that landing more substantially as well
- Primarily use repo discussions internally
- Getting Pythia’s calendar onto Pangeo’s calendar
- Drew Camron/Unidata running a AMS 2023 student conference if
- Thomas (Unidata, @ThomasMGeo) & intermediate and “one-pager” content [Drew + Thomas]
- New intermediate foundations book?
- Encouraging micro-sized cookbooks? and/or supporting “cookbooks” that have varied content not centered around a singular workflow
- Old open PR #109 on Foundations to be revisited
Action Items:
- [Drew Camronktyle@albany.edu?] appear at Pangeo weekly meeting this week to pitch Pythia Discourse tag
- [Drew Camron] email EWG confirming Pangeo Discourse move
- [Thomas Martin] open regridding cookbook repo, even if just one notebook
- [Max G] open PR with subset of existing dask content as intro in Foundations
2022-09-06: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
Agenda (with notes from the meeting):
- https://github.com/ProjectPythia/projectpythia.github.io/issues/267
- JC: this is concise/direct: a good, clean easy way to have visitors find and contact us
- BR: the link takes visitors to a GitHub discussion page (but it’s one discussion page amongst several other of our GH pages with discussions) … and we have something similar, under our “Join us!” link. Maybe reorganize our landing page so that link is easier to find.
Action items
- KT will reach out to Curtis Walker
- Establish internship/postdoc emails of interest [DC]
- Discuss Discourse w/ Pythia EWG -> Pangeo meeting to discuss presentation
- Get on Pangeo Developers Meeting Calendar? (Oopsie?)
2022-08-29: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / John Clyne
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Ryan May / Unidata / dopplershift
- Rich Signell / USGS / @rsignell-usgs
Quick Updates:
- Please sign in on this page above (and add any quick updates here)
- Note: Drew will be passing along a signup sheet for folks to volunteer to lead each EWG meeting
- OWG report (see notes from most recent OWG meeting below)
- AMS abstracts are due in two days (8/31)!
- Pangeo hackathon @ AMS: still under discussion
- UCAR EDEC group (see below from JC)
- KT: ATM433/533 has started!
- JC: Project Pythia annual report approved by NSF! Thanks, all!
- JC: Members of Pangeo (Max Jones et al) have reached out about hosting ML focused notebooks on Pytha (cookbooks). John and Kevin attending. Meeting calendar invite for September 1 at 11 AM ET. Zoom coordinates: https://us02web.zoom.us/j/89565931140?pwd=ZjBZNmlhNERoRXdWdytSRTBWa3FYZz09
- JC: UCAR EDEC group reached out about possible collaboration with Metropolitan State U (Denver): interested in Python and AI/ML material for their undergrads
- RS:
- I’m interested in helping to develop curriculum or training to help onboard people to cloud-native geospatial, concentrating on environmental simulation data. So I’m wondering whether to: (1) work on docs for packages I care about, (2) generate cookbooks for Project Pythia, (3) contribute to the NASA TOPS curriculum, (4) start an ESIP Cloud Pathfinder project for USGS, (5) write an O’Reilly Book on “Cloud Native Geospatial with Python”, (6) participate in ocean hack week, (7) some combination of above…
- This 12 min Ted Talk I learned about from one of Julia’s tutorials (a URL at the bottom of a slide) about how to ask for help (and now not to ask for help) blew my mind: https://www.youtube.com/watch?v=h-TIRIxhq6E
- I have a workflow to convert ECMWF ERA5 API requests to cloud-optimized data on S3. It accesses the CDSAPI using Dask, extracting NetCDF3 files that turn into chunks in NetCDF4 files. These NetCDF4 files are compressed using BitInformation and then kerchunked (and CF-ized) into a virtual dataset that can be accessed using the Zarr library using Intake.
- KT: Sounds like a good basis for at least a couple cookbooks!
- BR: Good opportunity to focus on the implied edge cases here, where the data is not quite in analysis-ready form
- DC: Consider making a blog post
- JK: See xarray discussion which is related to these types of edge cases
- Regridding: Ryan Abernathy reported at the Pangeo weekly meeting last week he has a new flexible, fast regridding algorithm that uses only xarray and geopandas (new capabilities in geopandas). Basically creates polygons around each grid cell in the source and target grids/meshes and does exact polygon geometry calcs. Promised to post his example in the Pangeo Discourse, so keep an eye out there.
- JK: need a response re: PR 316 (see Action Items below)
- BR: Grad-level GenCirc course is in Jupyterbook; perhaps some material could be contributed to cookbooks
- RF: using Pangeo CMIP6 material for research; will create and/or modify existing Pythia Cookbook CMIP6 notebooks to include xesmf, xmip
Agenda & Meeting Notes:
- Discussion on “how to reach Pythia” (notes from last meeting, missed again)
- Including the challenge of having three separate orgs!
- https://github.com/ProjectPythia/projectpythia.github.io/issues/267
- JC: this is concise/direct: a good, clean easy way to have visitors find and contact us
- BR: the link takes visitors to a GitHub discussion page (but it’s one discussion page amongst several other of our GH pages with discussions) … and we have something similar, under our “Join us!” link. Maybe reorganize our landing page so that link is easier to find.
- Succinct brochure or slide deck about Pythia?
- RS: suggests leveraging Pangeo Discourse: create separate subforum or tag
- BR: we’ve had some high hopes to use GitHub discussions (DC: this was part of our more general education-related focus on using GitHub), but may be obscured by the large # of repos/orgs within Pythia. If we decide to move to discourse, we need to spend some time closing/resolving our existing discussions on GitHub
- BR: have just one GitHub discussion on our top-level Pythia org, and have a link on our main page, a la Fatiando’s, that goes directly to it.
- DC: how can we point folks to a “real human/humans” at Pythia?
- JC: on a Contact Us page, break it down into: 1) Discussion Forums, 2) Email to the Pythia UCAR group 3) Attend one of our weekly meetings 4) Schedule a separate “Meet us” meeting
- We already have an “email us link” so it’s ok to generally give folks the ucar.edu email group address
- Similarly, we already have all of the “Pythia Humans” listed on the landing page
- Once again, this argues for the creation of a short paragraph that encapsulates what folks should do depending on the nature of their “Contact Us” query
- Project Raijin Contact Us page is an example of a more concise Contact Us page
- Further discussion will take place at a forthcoming OWG meeting
DC: check out FOSS4G-related discussion and content: https://github.com/ProjectPythia/projectpythia.github.io/discussions/273; https://github.com/pangeo-data/foss4g-2022/blob/main/tutorial/afterword/pythia.md
Action Items:
- Review PR https://github.com/ProjectPythia/pythia-foundations/pull/316
- Tweet out our updated Monday weekly meetings (Done, thanks to JK!)
- Clean up existing discussions that we have now across our Pythia repos
- Make our Contact Us link easier to find / more prominent on our main page
- DC will go over the above two items and discuss more at next OWG meeting
2022-08-23: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / @ktyle
- Drew Camron / Unidata / @dcamron
- John Clyne / NCAR/ @clyne
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne / @mgrover1
Agenda (with notes from the meeting):
- EDEC discussion. Notes here
- Can support in a variety of ways
- Seeking funding
- Developing training
- Organizing hackathons, workshops
- Incorporate Pythia into existing NCAR|UCAR training, postdocs, internships (SOARS, SIParCS)
- Connect with the NCAR Geo REU program
- AMS abstracts (at least for EIPT, Python, Education) deadline is now next Wed. 8/31
- AMS side room request due 3 Oct if unable to get Pangeo hackathon space from AMS Python
Action items
- KT will reach out to Curtis Walker
- Establish internship/postdoc emails of interest [DC]
- Nudge AMS & AMS Python, Pangeo re Pangeo+AMS hackathon
2022-08-22: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent (featuring Pabu and Raava)
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Ryan May / Unidata / @dopplershift
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KT: start of semester today at UAlbany, little time since last IWG
- AMS Abstracts Due Wed. 8/24! Python Symposium + EIPT deadline extended to 7 Sep
- New OWG meeting schedule (just bumped up one week)
- JC: there are currently two, duplicate Google Calendar invites for this meeting. One will be canceled. Please accept the other. Ultimately, this will fix the calendar problem noted last week noted by Brian. R.
Agenda (with notes from the meeting):
- Thanks to BR for spearheading Cookbooks’ Binder links change to MyBinder from the test 2i2c BinderHub
- Prototype instance remains up (but may be periodically offline without notice as I continue to test things)
- MyBinder remains painfully slow, to the point of being unusable
- Look into Pangeo Binderhub in the interim … it does require a GitHub account.
- JupyterLite is worth exploring … issue is remote data access … you are limited to what is available in a browser/JavaScript framework such as JupyterLite. Add JupyterLite as a badge, side-by-side with a Binder badge? Currently Cartopy is not supported by JupyterLite, but hopefully getting closer.
- Pangeo oceanographic cookbooks remain incomplete pending PangeoForge recipe approval
- Automation for maintaining infrastructure across all Cookbook repos
- Centralize more of the Cookbook infrastructure into a shared repo?
- Pros and cons relative to distributed repositories with bots that open PRs on individual repos
- Discuss Open Infrastructure Issues and PRs
- Is this resolved by PR 64?
- RM: probably more useful to be notified that there is a problem … perhaps include a code block with stack trace
- DC: Edit cell metadata to add “raises-exception” flag
- This PR is unassigned … right now, cookbooks do not have anyone automatically assigned to them. What should we do about this?
- Who is/are the long-term maintainer(s) in general across all cookbooks?
- In the case of this PR, this is an update to an existing cookbook. Go with the practice of having the original author(s) be notified, a la how it’s done when someone submits a revision to a conda-forge recipe?
- Is there something akin to stalebot that could avoid having PR’s/Issues sit unattended? https://github.com/marketplace/stale
- Keep your GitHub notifications open!
- PR #316 (Foundations) How to Review a PR
- We have dozens of Infrastructure issues spread across multiple repositories. Some are prioritized, and some are not. Some assigned, some not. Some are probably no longer relevant.
- How do we manage these better and make sure important work is getting done with the limited resources we have?
- Can we do some housekeeping?
- Triage issues and assign priorities?
- Make assignments for high priority items?
- Find a way to manage the dispersed issues from a single point? E.g.. a multi-repo project board?
- Can we incorporate some Agile project management? E.g. a dedicated Project Owner, sprints, continuous releases, etc?
Action items
- Change configuration setting in notebooks to not allow errors and see what breaks
- Note any cells in any notebooks that are expected to fail and need the “raises-exception” flag in the cell metadata
- Document how to find and change the raises-exception flag in the edit metadata cell
- Examine stalebot
- KT: do a first pass at triaging open issues/PRs and discuss at next IWG 9/19
- Investigate JupyterLite … perhaps side-by-side with a Binder tag.(Julia will see if James Morley might have the bandwidth for this)
2022-08-18: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Rich Signell / USGS / @rsignell-usgs
- Max Grover / Argonne / @mgrover1
- Ryan May / Unidata / @dopplershift
Quick Updates & Open Floor:
- Please sign in on this page above (and add any quick updates)
- Keeping the rotating meeting chairs - please let us know if you have issues with this :)
- BR and KT can only attend first ~20 minutes today
- EWG and IWG meetings seem to have disappeared from the google calendar
- Need to copy this over - add all the other days
- Lisa is the meeting owner - John working on getting this sorted out
- Monday 1 PM MT is the new time :)
- MG: Running into issues of getting environment secrets into cookbooks - enables interfacing with data portals
- JC: NSF annual report is ready for review here and will be submitted next Friday. Please use “suggestion” mode for any changes. Thanks all!
- KT: Fall 2022 ATM433/533 class begins Monday - students will contribute cookbooks as part of their class project
- Will be a cookbook! Several teams working together, should have more content!!
- 11 new cookbooks 🚀
- Rich Signell: Retiring from USGS Oct 2023, moving to phased retirement in January 2023 (½ time). Phased retirement requires mentorship activity. Planned activity is to create content to fill in gaps in the full stack of tools/best-practices for environmental simulation data workflows, considering doing this by contributing to Project Pythia. With weekly meetings tackling real world use cases with a team of 10 USGS early career/mid career folks.
- Able to go to phased retirement - spend half time on mentorship activities
- Filling gaps with cloud-centric workflows, and adding cookbooks + components of education
- No updates from Rob, Drew, Ryan
Agenda (with notes from the meeting):
- General Pythia update during community python session
- Julia planning on submitting
- NVIDIA RAPIDS lightning data cookbook
- Kevin T considering submitting an abstract here
- Abstract on radar cookbook during tools
- Would be good to have an update here
- Inviting John to next OWG meeting, EDEC recommendations
- Project Pythia Workflows with Rich
- “Modern simulation stack”
- Tried downloading data from ERA5, lots of dask, interactive viz, kerchunk things!
- No foundational materials on:
- Dask, Holoviews suite, STAC, etc.
- Great place to foster collaboration
- Cookbook to start with
- Highlighting aspects of how you might convert some collection of data (netCDFs, grib, etc.) into cloud optimized dataset
- How to use kerchunk + zarr
- Good place to start :)
- Cross reference, mention in workshop materials
- Plan is to fully convert most of training materials to Pythia cookbooks
- Foundations is more focused on what you need to get started with the stack
- Cookbooks is workflow focused, not necessarily foundational
- Might have a place for package-focused cookbooks that fall outside of foundational materials
- Dask content would be great!!
- Show when things go wrong…
- Delayed functions, etc.
- Real world use-cases
- Near top of priority list
- Room for resources to tailor for specific projects, groups
- What are techincal/community needs required?
- Helping within community - nothing much USGS-specific
- Only USGS-specific things are HPC/Cloud configurations
- Would be great to work with AWS/other cloud providers with open compute
- One of the original goals was helping people move to the cloud too
- Big question - if you could train a cohort, where would you start?
- Could have people take Kevin’s class :)
- Materials focused around where can/should people start?
- OpenDataCube testing learners
- Ways to evaluate people’s learning progress?
- Including the challenge of having three separate orgs!
- Succinct brochure or slide deck about Pythia?
Action Items:
- Nudge “contact pythia” discussions for asynchronous activity [DC]
2022-08-16: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / @ktyle
- Drew Camron / Unidata / @dcamron
- Max Grover / Argonne / @mgrover1
Agenda (with notes from the meeting):
- NASA TOPS meeting 8/11 highlights
- ~200 attendees, tough to set up breakout rooms in Teams
- Barely got through the introduction stage
- Will hold another meeting
- Conference this fall at Langley (registration deadline has passed though)
- Pythia abstract for Education + Outreach
- Radar cookbook in data viz and discovery
- Kevin - possible abstract on lightning data + Rapids
- ARM github tutorial and radar cookbook, open science workshop
- Climatematch academy
- TOPS
- Earthcube workshop support
- Makepath
- Pangeo
- Invite JC to next meeting, skim EDEC recommendations
2022-08-11: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Kevin Tyle / UAlbany / @ktyle
- James Morley / UVU / @jnmorley
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Kevin Tyle will be our next chair of the IWG
- BR to lead today’s meeting during transition
Agenda (with notes from the meeting):
- Prototype instance is up (but may be periodically offline without notice as I continue to test things)
- Plan for allocation request to Jetstream2 for a “permanent” Pythia Binderhub
- Need to estimate what resources we’ll need: will liaise with Pangeo Binderhub maintainers
- Lightweight at the moment, not much memory, may not scale to many users (for now)
- Having our own BinderHub under our control:
- More responsive and performant
- Persistent storage – maintain state of notebooks
- Increased user time
- Can strategically prebuild docker containers to guarantee fast launch for users
- Migrate to new Pangeo binder service?
- Move them all back to mybinder.org for now! (HIGH PRIORITY)
- KT: SciPy sprint resulted in submitted PRs for pangeo-forge recipes related to oceanographic data that are needed in some of our Cookbooks…. Still waiting for them to emerge from PR purgatory
- Automation for maintaining infrastructure across all Cookbook repos
- Centralize more of the Cookbook infrastructure into a shared repo?
- Pros and cons relative to distributed repositories with bots that open PRs on individual repos
- Cookbook health checks and failing notebooks
- Meeting formats: open discussion
- BR will have a teaching conflict every week starting August 25 – will be missing first half of every meeting.
- We will look for a new meeting time (JC + admin staff)
- Discuss Open Infrastructure Issues and PRs
- We have dozens of Infrastructure issues spread across multiple repositories. Some are prioritized, and some are not. Some assigned, some not. Some are probably no longer relevant.
- How do we manage these better and make sure important work is getting done with the limited resources we have?
- Can we do some housekeeping?
- Triage issues and assign priorities?
- Make assignments for high priority items?
- Find a way to manage the dispersed issues from a single point? E.g.. a multi-repo project board?
- Can we incorporate some Agile project management? E.g. a dedicated Project Owner, sprints, continuous releases, etc?
Action items
- Redirect all Cookbooks to mybinder.org for now to fix broken links (BR)
- Open Issue about shared Cookbook infrastructure
- Send JC our schedules (BR + KT)
- Look for a meeting time with fewest conflicts (JC)
2022-08-04: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julien Emile-Geay / USC / @CommonClimate
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Deborah Khider/USC ISI/@khider
- Ryan May / Unidata / @dopplershift
- Max Grover / Argonne / @mgrover1
Quick Updates:
- Met with Rebecca Hacker, early career and development group, advanced study program, serves postdocs, build university partnerships
- They had a number of recommendations for raising awareness and helping us with tutorials, workshops, administrative teams, registration, resume building, diversity components, advertising channels
- Share info with OWG? Or have in weekly meetings?
- Grow our community and plan for future events
- NCAR SOARS program, incorporate Pythia in content
- OWG first, then bring to larger meeting – invite JC Tuesday the 16th
- John’s notes here.
- Have used Pythia materials for education and are doing similar work for the paleoclimate community.
Agenda (with notes from the meeting):
- LinkedEarth+PaleoCube discussion:
- Cross-referencing and supporting gallery materials
- Uses Pythia’s notebooks, but presented differently; modify for engagement and problem solving
- Interactive code cells built on two-way communication with a binder instance, has limitations
- Infrastructure of interactivity provided by EarthCube TAC, based on this repo based on this project
- Ryan suggests pyscript / jupyterlite / pyiodide explorations
- Identified large hurdle of basic python usage and syntax resulting in tutorials above
- OWG report (Max/Drew/Julia/Kevin T)
- AGU abstracts submitted!
- Focus on reaching different audiences
- No longer the EWG? Just a Pythia meeting
- Celebrate Merged PRs!
- Discussion boards and “How to Find Us” (NOTE, WE DIDN’T GET TO THIS TODAY)
- Including the challenge of having three separate orgs!
- Succinct brochure or slide deck about Pythia?
- Discuss Open Issues and PRs
Action items
- Put together example cookbook within next month
- How to best interact with Pythia
- For next meeting, clarify communication channels for contributors and collaborators
- Infrastructure
2022-08-02: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- James Morley / UVU / @jnmorley
- Spencer Carella
- Dylan Stewart / @dylanrstewart
- Kevin Tyle / @ktyle
Agenda (with notes from the meeting):
- Discuss with Dylan and Spencer from Makepath
- Add to ed024 re community-building success and engagement
- Add to ed025
2022-07-28: Quarterly Meeting
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Max Grover / Argonne / @mgrover1
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- Kevin Paul / NCAR / @kmpaul
- Dylan Stewart / makepath / @dylanrstewart
- James Morley / UVU / @jnmorley
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: Thanks and best wishes to Kevin Paul!
- JC: Writing assignments for NSF Project Report due Aug 12
- JC: Feedback to Julia on AGU abstract by Friday, July 29?
- JC: We should discuss AMS abstract at next week's meeting
- JC: Elena Romashkova, a new CGD associate scientist, will be joining us in Sept.
- Thanks James Morley!
Agenda (with notes from the meeting):
Kevin Paul transition
- “Binder on HPC” → “Binder for HPC”
- Kevin Tyle will take over this project, and it will be based on a cloud-based deployment of Binder on JetStream2
- We will back-burner the HPC BinderHub work that Kevin Paul started (here) for now and return to it if necessary.
- Justification: More and more HPC systems are moving their traditional HPC configurations to dual HPC-cloud or even more pure cloud setups. Thus, we believe that using the more canonical cloud-style deployment model for Binder in such an environment is the right approach to pursue.
- Some things to consider with this shift:
- Data should be “local” (or close) to the compute. For small datasets, streaming data from a more remote location is fine (e.g., THREDDS or Globus). For large datasets, compute should happen near the data. But it doesn’t matter where Binder is located, or even where your JupyterLab instance is running! What matters is where the Dask workers are running! We should explore new ways of launching remote Dask clusters. Then we can run our notebooks on MyBinder and launch Dask clusters on the remote system.
- Dask Gateway could provide this mechanism; not sure how Dask Gateway works with authentication
- I would like to look into this in my free time, but it depends on my capacity
- Other IWG transition topics/questions
- projectpythia.org now owned by Joan Fisher here at NCAR; still trying to figure out how Joan can actually pay the fee, but the next $12/year payment isn’t due until January 2024
- New IWG chair: TBD. Hope to have someone in place by next IWG meeting.
Where we are now as we approached the end of 2nd year of our EarthCube award
- All major deliverables met with the exception of “real time display of metrics”
- Some work remains to improve cookbook support
- Foundations: done!
- Cookbooks: A solid start. Need to continue to expand gallery, hopefully with help from external contributors
- Advanced or specialized topics (e.g. Dask, metpy, geocat): discussion needed
- Pythia platform - aka “binder on HPC”
- We appear to have made a course correction here.
- This will need to be a focal point for year 3
- Pythia Portal: 10,000 views last year
- Foundatations: 23,000 views last year
- External contributors (PRs submitted): 5
- Twitter followers: 405
- Youtube video views: ~2,500
- Well placed in terms of documentation (contributors guides) and infrastructure to grow community of external contributors
- We have an active outreach group that has been highly successful in building awareness of Pythia
The tentative roadmap for year 3
- Grant deliverables we still need to meet:
- Display of metrics on web site
- Coverage of advanced or specialized topics (e.g. Dask, MetPy, etc.)
- Cookbooks or a new chapter after foundations?
- Expand our cookbook gallery content and supporting infrastructure
- Organize a community “event” (e.g. workshop, hackathon, tutorial)
- Grow the Pythia community of users and contributors
Discussion topics
- How are we doing? How do Pythians feel about the project?
- Don’t lose site of need for sustainability after year 3. Need a “big event” to kick this off.
- RM: echo need for focus on sustainability. May look at pushing metpy material to pythia. Unidata will be hosting event next summer. We should coordinate.
- KT: should start planning event by next quarterly meeting. Look for opportunities for smaller, regional scale meetups.
- What is or is not working well?
- MG: be more proactive in attracting the wider pangeo community
- KP: structure of meetings may make it difficult for people to feel like they can drop in and join. Should meetings be more ad hoc? Figure out how to get people in meetings to feel more included.
- BR: need space for more free flowing discussions. Consider alternating ad hoc and agenda driven meetings.
- DC: Better advertise who can come.
- BR: maybe we need to rebrand the meetings and have and advertised focus for that meeting.
- RM: name “working group” might be scaring people off.
- KP: should we change quick updates to “sharing” or “show and tell”? This would provide opportunity for discuss for new people.
- What is one thing that you would change?
- Are there opportunities that we should pursue that would help achieve our goals?
Action items
2022-07-21: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Kevin Paul / NCAR / @kmpaul
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR @clyne
- Max Grover / Argonne / @mgrover1
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KTyle volunteered to facilitate this meeting and Drew to take notes
- Julia will facilitate next meeting, KT will take notes (DC if KT absent)
- SciPy sprints demonstrated recipe construction process
- If approved, these will become available openly (in progress)
- Kevin T is working on putting a recipe together
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- SciPy interactions highlights
- Continued interest in pangeo gallery migration, integrating with pangeo-forge
- Continued integration with pangeo, including consolidating outreach efforts
- scientific-python.org likely better fit for running pyaos mailing list, including encouraging their discussion forums
- JK will be attending NASA TOPS kickoff meeting Monday 8 am MT and report back
- Funding for both AGU and AMS
- AGU deadline Aug 3 - JK will lead coordination
- AMS deadline August 23
- JK has agreed to join AMS scientific python committee
- NSF Project Report - check for writing assignments, due Aug 12
- Kevin Paul transition - what needs to be done
- KT and KP met 7/20: move forward with Binderhub on Jetstream 2 as primary hub (i.e., backburner the hpcbinderhub approach
- Jetstream 2 direction more widely applicable
- Xsede partners and data availability
- Desirability of “on-prem cloud” infrastructure similar to jetstream 2
- Accomplishes project goals of public access to data otherwise only accessible behind HPC gateways, co-efforts with data moving to public pipelines eg globus
- Use annual report to demonstrate existing hpcbinderhub functionality and justify prioritization of focus on jetstream approach
- Metrics - Project Pythia Metrics
- IWG Chair? - TBD by co-PIs
- projectpythia.org - Google Domains maintenance/management
- Has been transferred to Joan Fisher. John C and Julia have admin permissions.
- KT will be added as a domain manager
- Domain set to expire 14 January 2024
- James Morley practice SIParCS talk - 15 minutes
Action items
2022-07-19: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / Argonne / @mgrover1
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
Agenda (with notes from the meeting):
- Pythia blog post on there??
- Kevin, Damien, Scientific Python org working together to get an atmos section on the discourse forum
- Opportunities for joining efforts with Pangeo + Scientific Python
- NASA TOPS Curriculum meeting
- 8 AM MT Monday July 25
- Registration - register
- Julia attending
Action items
- DC, KT nudge for previous outreach efforts
- Work on blog post summarizing Pythia’s presence @ SciPy in Scientific Python blog (Max)
- Attend the Pangeo meeting - expand our outreach group
- Brainstorm AGU sessions to submit to
2022-07-15: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- James Morley / UVU / @jnmorley
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Notes today: JC
- JC: PIs are discussing what to do about Kevin’s departure. Stay tuned.
- KT: BinderHub now working on Jetstream2!!
Agenda (with notes from the meeting):
- Announcement from Kevin Paul:
I will be leaving NCAR. My last day is July 29. After that, I will be taking a new position at NVIDIA.
My hope is that I will still be able to continue to contribute to open source projects, as I know many of my colleagues at NVIDIA already do. So, I hope that I will be able to continue to contribute to Project Pythia and Pangeo, in general. However, I do not know at what capacity I will be able to contribute.
Hence, I will be stepping down as the chair of the IWG. I leave it to the team to decide who should replace me.
Please note that while I will be leaving NCAR, I will still be reachable on GitHub with the same username.
Some items I think that the team needs to consider with my departure:
- Who can take ownership of the Google Domain (projectpythia.org)? Taking ownership means having your name, address, and email address associated with the domain. It also means providing a payment method (credit card) to pay the fee for the domain every year ($12). This is not urgent. $12/year will not break me. - JC and JK will connect offline on this
- Binder for HPC: I have written more about this below, and I am happy to give a demo of what I have so far at a future meeting, but someone from the team should consider contributing to or taking over this project. The current code can be found at https://github.com/ncar-xdev/hpcbinderhub. Naming a lead for this project/effort is also not terribly urgent, though we do have a schedule to keep. - one more IWG meeting before KP leaves, set up a separate meeting for IWG if that isn’t e nough
- A demonstration using Docker-Compose has been developed. Documentation can be found in README.md files throughout the repository.
- Working features:
- BinderHub authenticates users with the existing JupyterHub
- BinderHub builds images using repo2docker with the repo2podman plugin engine (i.e., it builds Podman images that are HPC friendly)
- BinderHub pulls images from a local Docker registry
- Some pieces are still missing:
- Newly built images are not pushed to the local registry
- Images cannot be launched with the existing JupyterHub spawner
- Let us know that your PR was merged here!
- Foundations and Portal quick links
- So many from the hackathon in Cookbook repositories!
- Cookbook infrastructure: auto generating yaml file
- Previewing and testing of cookbooks
- Open issues in each cookbook repo if the template has changed since they were created?
- JM fetched changes from template, added it as a remote and pulled in changes, created a merge conflict and had to manually address them.
- Does GitHub have support for change between repositories
- Just provide diffs between template and revisions to template
- Use versioning of the template?
- Infrastructure vs content
- Working on binder hub on jetstream2
- Have a functional binder hub on jetstream2. Early release.
- 2i2c binder hub used during hackathon
- SciPy is going great! More fodder for Cookbooks, 3D visualization package called pyVista with Xarray backend. Learned about ipywidgets.
- Presented yesterday. Tech issues during demo, tour of Pythia not as extensive as he’d like it to be.
- We will have options: jetstream2, 2i2c, Pythia Platform. Testing is on GitHub Actions, can we move to a binder hub?
- CMIP6, CESM LENS, and Physical Oceanography material moved from Pangeo Gallery
- Still some data access issues
- Data being moved to a publicly accessible
- Or use new data (RF looked into this)
- Pangeo forge (cloud-optimized gridded dataset recipes, specify what data you want and where it lives, and request it), code sprint at it on Sat. KT may try to migrate the data during the sprint.
- More radar content from MG
- Intake cookbook content by JM
- Discuss review process for Cookbooks
- Getting Cookbooks#33 merged
- Cookbooks#35 capturing barriers and potential solutions, discuss - Closed
- (BR opinion) We should encourage people to contributing new CHAPTERS (recipes?) to existing Cookbooks, as opposed to defaulting to creating entirely new Cookbooks. It will be an easier pathway for new contributors and avoid unnecessary proliferation of very thin Cookbooks.
- Do our Contributors Guides make this clear? Do we need more improvements to those Guides?
- “Submit a new Cookbook” button on gallery page. Should add button for removing cookbook - low priority, add a line to contributor’s guide
- Document an Infrastructure Guide - helpful, meeting with Kevin
- What does only KP know how to do? - sphinx-pyhtia-theme
- Potential solution being explored here, changes to the conf or thumbnail in a Cookbook
- Use GitHub Actions whenever a change is made to the main branch or update daily - do we have daily builds in place yet?
- Cookbooks are currently manually added to a txtfile (cookbook_gallery.txt), which takes info from the Cookbook’s _config.yml. Updating the gallery yaml with the “Submit a new Cookbook” button opens a PR that can be reviewed by Project Pythia members. - working on this
- Discuss Open Issues and PRs
Action items
- IWG discussion topics:
- Seek volunteers (or guinea pigs) to cycle leading EWG and running notes
2022-07-07: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Kevin Paul / NCAR / @kmpaul
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- James Morley / UVU / @jnmorley
- James Munroe / 2i2c / @jmunroe
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Rotating chairs for EWG meetings! Drew C will chair today.
- KT will volunteer to chair on 7/21 (BR will be absent on PTO)
- Previous runner of meeting takes notes at next meeting?
- Congratulations on a successful hackathon!!!!
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- SciPy deliverables and presence
- Ktyle presenting on Project Pythia Wed afternoon
- Birds of a feather session by Pangeo group
- Bring stickers!
- Live demonstrations
- Let us know that your PR was merged here!
- Foundations and Portal quick links
- So many from the hackathon in Cookbook repositories!
- Cookbook infrastructure: auto generating yaml file
- Previewing and testing of cookbooks
- Open issues in each cookbook repo if the template has changed since they were created?
- HRRR AWS cookbook by KT
- Working on binder hub on jetstream2
- 2i2c binder hub used during hackathon
- We will have options: jetstream2, 2i2c, Pythia Platform. Testing is on GitHub Actions, can we move to a binder hub?
- CMIP6, CESM LENS, and Physical Oceanography material moved from Pangeo Gallery
- More radar content from MG
- Intake cookbook content by JM
- Discuss review process for Cookbooks
- Getting Cookbooks#33 merged
- Cookbooks#35 capturing barriers and potential solutions, discuss
- (BR opinion) We should encourage people to contributing new CHAPTERS (recipes?) to existing Cookbooks, as opposed to defaulting to creating entirely new Cookbooks. It will be an easier pathway for new contributors and avoid unnecessary proliferation of very thin Cookbooks.
- Do our Contributors Guides make this clear? Do we need more improvements to those Guides?
- “Submit a new Cookbook” button on gallery page. Should add button for removing cookbook
- Document an Infrastructure Guide
- Potential solution being explored here
- Use GitHub Actions whenever a change is made to the main branch or update daily
- Cookbooks are currently manually added to a yaml file (cookbook_gallery.yaml), which takes info from the Cookbook’s _config.yml. Updating the gallery yaml with the “Submit a new Cookbook” button opens a PR that can be reviewed by Project Pythia members.
- Discuss Open Issues and PRs
Action items
- IWG discussion topics:
- Seek volunteers (or guinea pigs) to cycle leading EWG and running notes
2022-07-05: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / Argonne / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- James Morley / UVU / @jnmorley
- Drew Camron / Unidata / @dcamron
Agenda (with notes from the meeting):
Action items
- Reach out to James Munroe about hackathon experience
- DC, KT nudge for previous outreach efforts
- Scientific Python outreach BOF session at SciPy 2022
2022-06-30: Infrastructure Working Group - CANCELED
Meeting link:
Attendees: [Name / Institution / GitHub Username]
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- Post-Hackathon Discussion
- What is still in progress?
- What still needs to be done?
- Infrastructure for new cookbooks? What is still needed?
- Automated previews of new cookbooks is hard. ReadTheDocs, Netlify, and Vercel all can do it, but each new repo needs to be manually configured/setup. There appears to be no easy way to, for example, set up “previews for all repos in an organization.”
- On a related note, we have multiple “Netliify” admins (Julia, me, and Anderson). There is no way to “collaborate” on all of the different Netlify sites without paying for an upgraded level of service. Should we consolidate all of our sites with 1 admin?
- Discuss Open Issues and PRs
Action items
2022-06-23: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / John Clyne
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- James Munroe / 2i2c / @jmunroe
- James Morley / UVU / @jnmorley
- Max Grover / Argonne / @mgrover1
- Drew Camron / Unidata / @dcamron
- Kevin Paul / NCAR / @kmpaul
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: I had a meeting with members of NOAA’s Center for AI. They are developing AI training materials, and share many of our same goals. Next steps: invite to an infrastructure meeting to exchange ideas.
- JC: Meeting with NCAR’s Education, Engagement, and Early Career Development team to discuss how they might help us with penetration into universities.
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Earthcube - John and Max presented the posters
- General Pythia, Cookbooks
- Mind the Gap - same time as Earthcube, Kevin engaged with community here
- Quite a bit of awareness already, people are teaching classes with Python + geosciences
- Instructors already put together material - good to invite some of them to submit cookbooks with this content
- Machine learning cookbook? Chat with Sue Haupt from NCAR about this
- This needs some testing and refinement!
- Hackathon Prep - June 27-29, 2022
- New subdomain for cookbooks?
- Current status of build and test infrastructure on GitHub actions (BR)
- There are ~15 registrants, ~10 in-person
- There are a number of people with little prior pythia experience (e.g. GeoCAT team members, student interns)
- One hour planning session on Monday morning - who attends?
- How do we organize?
- Can we identify Issues for people to work on?
- Can we solicit ideas for things people want to work on?
- Can we support group efforts?
- Handle hybrid?
- Discuss Open Issues and PRs
Action items
- RF: Open issues for hackathon and mark with milestone
2022-06-21: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / Argonne / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- James Morley / UVU / @jnmorley
- James Munroe / 2i2c / @jmunroe
- Drew Camron / Unidata / @dcamron
Agenda (with notes from the meeting):
- Presented general pythia + cookbook posters
- Post posters on the twitter account
- Side by side, pngs
- Add to the drive and talks page
- James popping in to see what Pythia outreach is all about
- Education + outreach focus right now - cookbooks
- Last week - overlapped with Earthcube (Wednesday - Friday)
- 2nd workshop, NSF sponsored
- 2 largest groups - university people, private sector, some students
- Broke out into groups
- Part of programming + data viz subgroup
- Folks teaching dynamics classes
- A lot of people already doing Python + data science
- Strong awareness of Pythia
- Folks already have nice set of notebooks for their classes
- Source of future cookbook material
- Github presence from ideas from Mind the Gap
- Kevin creating repo org for now - lots of interest
- Devote portion of time at education working group for this
- People using the content
- How to copy over + host content?
- Coming up soon - try to emphasize cookbooks
- Cookbooks - how to pitch these?
- Different pitch for different people
Action items
- MG/KT - Add posters to drive
2022-06-16: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Meeting was canceled due to conflict with the EC meeting this week
Agenda (with notes from the meeting):
- Hackathon Prep - Review planning document - June 27-29, 2022
- Discuss Open Issues and PRs
Action items
2022-06-09: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- John Clyne / NCAR / @clyne
- James Morley / UVU / @jnmorley
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Kevin Paul / NCAR / @kmpaul
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Facilitator for today’s meeting?
- JC: Earth Cube annual meeting next week. Thanks Kevin T. et al for putting together our poster. Please get feedback to Kevin T. ASAP (I’d like to send poster to printer by COB today). Link to poster
- BR: Welcome back Abigail Bodner, who will present updates on plans for a Climate Academy (see below)
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Efforts to reach out to projects with similar work, EDSBook and Data Umbrella
- Data science content focused around reaching underrepresented groups
- Webinars focused around python ecosystem, would be good group to reach out to
- Next week is Mind the Gap Workshop 2.0
- Lots of talk about curriculum development, good place to bring Project Pythia to attention of attendees
- Mind the gap website
- Everyone who will be attending the hackathon (in-person or virtual, full-time or part-time) please fill out the registration form.
- Albany travelers all squared away?
- Monday, June 20 is an NCAR holiday (Juneteenth)
- BR: can we post an agenda somewhere public? Needed for travel accounting purposes
- RF: in contact with Kristi at NCAR, arriving June 19
- Heather Craker, GeoCAT team
- Other TBD?
- Review planning document
- Hackathon planning session in advance of event?
- We will devote the WG meeting two weeks from today for hackathon planning
- Abigail Bodner on Climate Academy
- ~ 10 min presentation followed by Q&A
- Climate Academy as a first instance of new “Sciencematch” initiative growing out of Neuromatch
- Sciencematch provides technical and logistical support
- Including funding – though CA will advise on climate-specific opportunities
- CA will focus on course-specific responsibilities
- Course design, curriculum, hiring TAs, etc
- Foundations book and Galleries are valuable resources
- Build on existing content and connect with other climate programs
- Team experience with building these kinds of resources
- Looking to recruit a core team by summer 2022: vision and general framing of curriculum
- Identify climate-specific funding opportunities, advise Sciencematch
- Recruit members to produce content (fall)
- Produce tutorials (winter 2023)
- Recruitment Spring 2023
- First summer school summer 2023
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
Action items
- Hackathon registration form! (everyone who hasn’t done it)
- JC+BR: draft agenda
- Comments on EarthCube poster to KT
2022-06-07: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Drew Camron / Unidata / @dcamron
- James Morley / UVU / @jnmorley
- Kevin Tyle / UAlbany / @ktyle
- James Morley
Agenda (with notes from the meeting):
- PyArt: reviewed Max’s draft; Max will incorporate suggestions
- Foundations: Kevin will make draft available Wed. AM
- Connect with authors and/or Turing Institute folks, seems like a lot of work could be shared
- Explore Data Umbrella opportunity for representing Pythia or contributing to
Action items
- Drew will reach out to the Env Data Science Book / Turing Way folks*
- Kevin will reach out to the Data Umbrella folks*
- Add Env Data Sci Book to resource gallery
* pending discussion with EWG this Thursday
2022-06-02: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Max Grover / Argonne / @mgrover1
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
- Orhan Eroglu / NCAR / @erogluorhan
- James Morley / UVU / @jnmorley
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KT: provisioning VM’s on Jetstream2; goal is to deploy a Binderhub instance
- JC: Meeting schedule for June 17, 9am MT, with NOAA Center for AI to discuss using Pythia to support their educational/training needs.
- JC: EarthCube annual meeting poster deadline: June 10. Who is creating the Pythia poster? Also need a printed poster. John will present. ~100 registrants.
- JC: EarthCube sustainability report available here
- JC: Fate of Earth Cube Council for Funded Projects (CFP) is TBD as earth cube program winds down.
- JK: Working with Lily Kailyn. Walking web tree spreadsheet of Foundations
Agenda (with notes from the meeting):
- Hackathon Updates - June 27th-29th
- CISL admin team should be reaching out to Albany today to start making travel arrangements
- Possible additional (remote) invitees:
- Elena Romashkova (Max G., version 2.0)
- Dan Adriaansen, NCAR/RAL
- TODO: Create an issue on the Portal
- Cookbook Check-in / Status
- Anything not in the hackathon planning document?
- Let us know that your PR was merged here!
- Foundations and Portal and Radar-Cookbook quick links
- Foundations 289, 290, 293
- Portal 252, 254, 256
- Radar Cookbook 5, 7, 8, 9, 10, 11, 12
- Julia gave an overview of the recent changes to the portal
Action items
- KP: Create an issue on the Portal with links to the GH ProjectPythiaTutorials org
2022-05-26: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Max Grover / Argonne / @mgrover1
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JK - Rotate facilitators to empower more voices in meetings?
- standing quick update to volunteer to facilitate the next meeting
- Drew and Robert to do at some point
- JC: Possible collaborations with NOAA Center for AI and NCAR/RAL Python Affinity Group. Setting up meetings for next week.
- MG: Updated the radar cookbook with overview + new read from AWS section in example
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Use a bot to close old PRs on Sandbox GitHub repository
- Reply and thank them for engaging?
- Navbar link order? Cookbook refactoring (added badges back)
- #256 and #291
- Automate across conf locations? Still needs to be done to each cookbook
- JupyterBook folk are working on this now! We can use a consistent configuration file across all repos - wait on this for them
- Consensus on this order, change cookbook template
- Apply gallery theme to both pages, iterate more on CSS for resource gallery for larger screens. 2 vs 1 column
- Travel update
- Health and safety requirements, and building security
- Draft planning document
- Advertise to Pangeo community?
- Cookbooks previews action
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
Action items
- Assign facilitator for next meeting
2022-05-24: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / Argonne / @mgrover1
- James Morley
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Julia giving tutorial on Thursday at 3PM eastern / 1 PM Mountain
- Can use some help in the chat (probably installation issues)
- Tweet out a reminder about tutorial
- Tweet out a call for Cookbook ideas - once we have one in place
- Drew mail James a sticker stack
Agenda (with notes from the meeting):
- Overview and Radar cookbook during same session
- NASA TOPS Curriculum Team
Action items
2022-05-19: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR @clyne
- Ryan May / Unidata / @dopplershift
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
- Brian Rose / UAlbany / @brian-rose
- James Morley / Utah Valley University / @jnmorley
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Welcome James Morley!!
- JC: Hackathon update: We have a room (June 27 - 29), but we’ll need to explore other travel support options. More discussion below
- JC: Exploring collaboration with NOAA Center for AI
- Seminar yesterday, expressed interest in leveraging Pythia!!
- JC following up
Agenda (with notes from the meeting):
- Are we inviting any external (to UCAR) attendees (beyond U. at Albany)?
- Ryan Abernathey? Max? Anderson?
- Heather (UCAR employee but remote), James?
- Keep it small and targeted
- Registration form coming soon
- Room has a chrome box and only supports Google Meets
- Do we need additional breakout rooms
- We should have additional breakout space (John’s conference room, others?)
- Other considerations?
- June 27th-29th
- What infrastructure do we need for the Cookbooks to unblock progress?
- Use projectpythia.org domain for all cookbook sites
- Template - MG creating a template
- Where is the content hosted?
- Host it under our domain – easy
- Adapting the content to fit our template
- Requires some rewriting and reorganizing – harder
- Testing and execution of cookbooks - hard & critical!
- Need CI on Pangeo servers, health checks
- This isn’t just a website to host static pages, but a resource
- What needs to happen before hackathon?
- What is missing from Max’s cookbook?
- Health checks?
- Can run from everywhere
- Can build and test via GitHub actions
- Some cookbooks that run on HPC may not have internet, how to do CI testing
- How to test for successful execution? How to notify when it fails?
- Goal to prototype this for one notebook before hackathon?
- Binderbot!
- How to check for accurate output, not just completion?
- One notebook hosted on correct domain KP + MG
- Add CI build and testing - BR + MG
- Add badges that show status checks to gallery JK (model after Pangeo gallery http://gallery.pangeo.io/index.html)
- Contributor’s Guide (rough draft before hack, but also polished product of)
- Template
- A 2nd notebook to test contributor’s guide and template by a different author BR (Chris work with MIRROR2 reanalysis data of polar heat fluxes)
- Hackathon prep meeting, agenda formation etc
- Create asynchronous Google doc with goals and issues
- Still want filtering - option to not display failing ones, domain - JK
- Let us know that your PR was merged here!
- Foundations and Portal quick links
- BR made cookbooks temporary page: #241
- JK changed syntax/documentation to be consistent and specify “resource gallery” over “gallery” in #284 and #246, and fixed GitHub “open a resource” workflow to match #19
- JK added cookbook navlink in #285 and #247
- Updates to contributor list: Added James to contributors #250, Anderson’s affiliation #252, and Max’s #254
- Discuss Open Issues and PRs related to Cookbooks
- #243 - Cookbook gallery generator - awaiting review
- This is a first step, it can be customized more and then needs refactoring
- Can add more sample cookbooks to cookbook_gallery.yaml
Action items
- Reach out to Ryan Abernathey
- JC find additional rooms and send hackathon form
- Add CI testing to cookbook - MG/BR
- Add filtering to cookbook gallery
2022-05-12: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Brian Rose / UAlbany / @brian-rose
- Max Grover / NCAR / @mgrover1
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: Trying to nail down a room for June hackathon. CVP (CISL Visitor Program) applications are moving through system. (external people allowed in meeting rooms?)
- JC: Presented Project Pythia overview at DOE/ARM Open Science workshop
- MG: Much of the Pythia material is being used at the DOE/ARM workshop
- BR: Thanks Julia for chairing today’s meeting, I am attending a different workshop
- BR: I promised one last read-through of the GitHub content for consistency before we call it done. Still on my to-do list (this is a quick non-update)
- JK: Project Pythia foundations material during AMGeo/Earthcube hackathon
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Lots of presentations this week (open science, hackathons, AMGEO)
- Lots of tweets scheduled
- Looking forward to opportunities
- Add CI of building book on Windows Mac and Linux (once a week, not every PR to check for health issues)? – bring up during IWG
- Identify some baby steps towards hosting first cookbooks?
- JC: Trying to nail down a room for June hackathon. CVP (CISL Visitor Program) applications are moving through system. (external people allowed in meeting rooms?)
- Don’t buy flights yet - hopefully will know soon
- What needs to be done for it to be successful?
- Advertise outside of team? - this event prepares us to open up, so no advertising yet. 3 days with team.
- Summer 2023 Pythia event - brainstorm during hackathon, extend past end of Earthcube grant.
- JK added “Cookbooks” label to issues’
- Try to identify infrastructure needs in advance to start the project (next IWG meeting)
- JC: need a cookbook in place first to highlight gaps.
- MG: cookbook runs, environment builds, need gallery – and submitting early next week Cookbook Repository
- BR: Top Nav bar change word “gallery” because we’ll have multiple galleries. Change to “resources” and add “cookbooks”
- Open some issues or discussion items
- Once template in place, migrate Pangeo cookbooks over
- Invite Ryan Abernathey to hackathon, loop him in.
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
Action items
- Open issues to move Cookbooks forward
- BR: Final review of GitHub content
2022-05-10: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Max Grover / Argonne / @mgrover1
Quick Updates:
- Julia - finished scheduling all resource gallery tweets (weekly through next Feb)
- Doesn’t include Pythia foundations content - separate tweet per section once polished
- Look at other categories of Tweets
- Twitter Schedule
- Kevin supported the Argovis hackathon pre-workshop using Pythia resources; seemed to be a success!
- Max - IOOS/GLOS workshop shared resources with Ocean community (NOAA and private industry)
- John Clyne presenting at RMSAR open science workshop
- Julia - AMGeO/EarthCube 2022 Interactive Workshop this week, hackathon support - https://amgeo.colorado.edu/
- Julia - present Project Pythia at ESDS forum
- Late June hackathon (in Boulder?) hackathon
- Meet James next Monday the 16th!
- 3 issues opened by phil-blain - who read through and gave good feedback. Ping them on the PR and ask if they want a sticker. - Drew Camron
- After the grant??? How to maintain
2022-05-04: Infrastructure Working Group
CANCELED
2022-04-28: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / clyne
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
- Rob Ford / UAlbany / @r-ford
- Chris Cardinale / UAlbany / @cjcardinale
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: Quarterly meeting TBD
- Was supposed to be this meeting, when to reschedule? Line up with hackathon?
- KT: ATM350 sticker distribution! (class wrapping up, described Pythia to them)
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Pre-summer conference prep
- Poster for Earthcube
- Talk for SiPy (July 10-17)
- Mid/late summer hackathon + pre-SiPy hackathon (the same one works?)
- Things to Tweet about!
- MetPy/Unidata AMS short course
- Lots of new PRs merged recently, what’s left to do?
- GitHub Gang of Four?
- Task for James Morley
- Invite ARGO-Viz people to look at our GitHub content (They have a GitHub hackathon coming up)
- Looking forward to closing issue
- BR and RF do another read-through
- Week of June 27? Seems to have the fewest conflicts
- Scope and goals for the hackathon?
- Format: remote, hybrid? (BR coming to Boulder?)
- A few weeks before SciPy - get community buy-in by contributing to cookbooks, so one goal is to have enough cookbooks (and infrastructure/template) in place
- Cookbook focus!
- Try to have infrastructure before hackathon
- How ready will we be for contributors?
- Point of in person, NCAR still mostly remote (by choice)
- Need to act quickly to secure rooms if in-person at Mesa Lab
- Aim for 3+ days (MTW)
- BR will check with UAlbany research administrator re: travel budget
- Inviting people to learn vs inviting people to work
- Summer 2023 community event, training and development
- Strengthen connections to GeoCAT? MetPy, partner efforts
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
Action items
- Re-read GitHub content, final review for now
2022-04-26: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- SciPy Birds of Feather proposals due 6/15
- TAC Hackathon Friday 4/29
- Poster prep for EC Meeting
- Scipy talk prep
- Pre-scipy hackathon
- Early plans to merge Pythia efforts at AMS Student Conference; no short course
- Advocate for low-stakes feedback on cookbooks
- Non-conference engagement?
- Tie twitter presence and medium post to early summer cookbook development
- Keep in mind expert reviewers for cookbook development and presentation
2022-04-21: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- Kevin Tyle / UAlbany / @ktyle
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Ryan Abernathey / LDEO / @rabernat
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: Pythia PIs, please visit and complete the list of peer-reviewed publications reported to the NSF available here
- JC: Where in the repo should digital assets such as source files for logos, ppts, etc. be kept?
- KT: Submitted trial allocation request to Jetstream2 / XSEDE
- KT: (sidelight: absolutely awesome Cloud-Native Geospatial Outreach workshop earlier this week: video playlist being populated here)
Agenda (with notes from the meeting):
- Discussing the future of the Pangeo Gallery and Project Pythia
- Pangeo is a large umbrella organization covering many different projects, and looking for ways to better sustain its efforts
- Specific Idea: What will happen with the Pangeo Gallery?
- Proposal: Retire the Pangeo Gallery and port material over to Pythia
- Links:
- Pangeo Gallery is a Sphinx cite with lots of example notebooks to view statically
- Intended to contain *real* examples, not just overly simplistic ones
- Many notebooks have binder links, but had to turn off Pangeo BinderHubs due to bitcoin mining
- mybinder has lots of people/resources to monitor this, but mybinder doesn’t allow dask cluster
- Pangeo Binder has more ram and can launch Dask clusters
- Solves the problem of assuring that the notebooks are “healthy”
- Provides a CI utility that runs the notebooks inside Binder and then pushes out the built Binder
- Aggregating all different binders using git submodules pointing to external repositories
- It is very minimally maintained.
- Is there a path where we can merge the Pangeo Gallery content into Pythia?
- Pythia Cookbooks seem to be exactly what the Pangeo Gallery is trying to provide, though the cookbooks are still in their infancy
- But Pangeo Gallery has a lot of infrastructure that can be useful for the Cookbooks
- On what resources do we expect cookbooks to run?
- Pangeo Gallery is a proof of concept and can be made more robust
- Funding history for the Pangeo Jupyter Hubs is based on grants that are time-limited, and future funding is uncertain
- Can NCAR keep the JupyterHubs maintained or funded (2i2c)?
- Safe to say there will be JupyterHubs, but it will be more of a federation…
- Even mybinder is limited on funding and could use help
- Want to share everything we have, regardless of whether it can only be run on specific resources, and we may not be able to maintain a general purpose resource for everyone to use
- We should try to keep all content in Pythia galleries in a healthy state
- Not in scope of Project Pythia to figure out how to provide open compute resources to the world; we just need to provide the CI solution to keep the content healthy
- use JHub API to ensure that notebooks work on the given hub (any hub)
- ACTION ITEM: Test out a simple CI workflow for the radar cookbook based on the BinderBot example
- Another idea: use a GitHub self-hosted runner to provide exactly the resources/configuration you need
- Is there a way for external contributors to easily provide content without needing to know how to build a cookbook?
- Need to think about how people can contribute content that meets our standards
- …which is exactly the Pythia goal.
- NEXT STEP: Let’s try moving over some subset of the Pangeo Gallery notebooks into cookbooks
- What can we do that adds enough value that entice people to contributing?
- Discuss Open Issues and PRs
- Use Pangeo Hubs?
- PRIORITY:
- Need to pull data from Google Analytics API
- Needs more thorough testing
- PRIORITY: MEDIUM
Action items
2022-04-14: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Kevin Paul / NCAR / @kmpaul
- Max Grover / Argonne / @mgrover1
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Max - Put together a first cut of the “Radar Cookbook”
- JC - Supporting AMGeO Hackathon Pre-Hackathon Office Hours: May 10th (Tuesday), 12pm-3pm MDT. Need a volunteer - JNK
- Julia will staff. John will connect Julia with the AMGeO folks
- Ryan Abernathey will join next week (confirmed for the 21st, at least!) to discuss pangeo cookbooks
- Jetstream2 is starting up - KT will request an allocation to kick the tires in his role as XSEDE Campus Champion
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- ARM/ASR abstracts due Friday - anyone interested in presenting on Pythia?
- Planning on having tutorials with Pythia content, would be great to have an overview of Project Pythia too
- May 10-14
- Thoughts on a workshop at AMS? Student workshop? Can we work with the Unidata team on this?
- Put on agenda two weeks from now
- No deadline for “student workshop”
- Possibly include Pangeo peeps to help with cookbook migration
- Hybrid, in person, remote?
- SIParCS intern: projects to work on
- Go through all Pythia materials, spin up on everything while also making notes about improvements
- Open PRs to address any improvements
- E.g. open PR on Markdown
- May get one or more Unidata interns to parallel James in going through Pythia
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
Action items
- BR: will start the process of collecting people’s availability for a summer hackathon
2022-04-12: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / Argonne / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Kevin reaching out to Damien, Scott about PyAOS mailing list
Agenda (with notes from the meeting):
- Should reach out to Damien about coming to an Education Working Group meeting
- Max helping with IOOS hackathon, ARM/ASR Open Science workshop tutorials
- Action item - bring up idea of having the student workshop be hosted by “Project Pythia”
- Use the current content, could have interns help present
2022-04-07: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
- Ryan May / Unidata / @dopplershift
- Max Grover / Argonne / @mgrover1
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: TAC hackathon office hours for “Argovis” group. Friday, April 29, 2-3pm MT. Need someone to staff.
- KT: Jetstream2 to be launched in next few weeks
- MG: Created the “Radar Cookbook” under Project Pythia Tutorials
- JC: argovis team (Donata Giglio, CU) is interested in contributing some of their material as cookbooks
Agenda (with notes from the meeting):
- Discussing the future of the Pangeo Gallery and Project Pythia
- Reschedule?
- Walkthrough new Radar Cookbook
- General cookbooks discussion
- Should cookbooks be walled off? How can we clearly label cookbooks/notebooks that require special credentials?
- Can we filter on free-access notebooks?
- What are cookbooks? Should they be general purpose (i.e., broad audience) or should they be specific use cases (i.e., narrow audience)?
- Maybe we shouldn’t be responsible for deciding what content is “of broad interest” or not?
- Perhaps we shouldn’t be too focused on trying to define what content a cookbook should have
- What is a cookbook?
- a JupyterBook
- adheres to a certain template
- makes an attempt to link to Pythia Foundations content
- attempts to be openly accessible
- A cookbook is an example of a tutorial and example; additional content that extends the Foundations content
- Need to keep track of the maintenance costs for these cookbooks (i.e., we own them)
- Should there be a categorization of content that exists between Foundations and specific examples?
- Perhaps the cookbooks contain their own “foundational” content, too?
- Keep in mind that the existing Foundations took a year to get to its current state!
- Perhaps we can’t make a decision, yet, until we get some cookbooks to look like
- A gap that we may want to consider addressing includes Dask, interactive plotting tools (Bokeh-based like hvplot), …all are kinda “advanced Foundations content”
- TODO: Amend the template to include sections like “Foundations” and “Examples”
- TODO: Add some language on “what is a cookbook?” (see above)
- KP: Look at theme of cookbook template and match topbar
- I actually don’t see a problem in the version deployed on GH Pages…?
- SIParCS Internship starting soon!
- Time to start thinking about good projects for the intern!
- Building more cookbooks?
- TODO: Create an issue to start discussion internship project ideas
- Discuss Open Issues and PRs
- Use Pangeo Hubs?
- PRIORITY:
- Need to pull data from Google Analytics API
- Needs more thorough testing
- PRIORITY: MEDIUM
Action items
2022-03-31: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- Max Grover / Argonne / @mgrover1
- Kevin Paul / NCAR / @kmpaul
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- SIPARCS: Computing resource request must be submitted by end of today
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Thoughts?
- We had some discussion about where we expect cookbooks to be hosted. The current template will publish the cookbook to the individual user’s github pages space. At some point we would like to have a hosted gallery of cookbooks as part of the Pythia site
- Deferring any decisions about this until after our conversation next week with Ryan Abernathey about the fate of Pangeo Galleries
- We’ve lost a bit of momentum here… Can we get past the finish line?
- GitHub content revisions
- Julia is on PTO this week, is the lead on these revisions. We’ll assess after she’s back.
- Still in draft
- Drew will be working on this after the current MetPy release push
- See the discussion from last year
- Re: logo, John will inquire with NCAR design team about a Foundations logo that is consistent in style with the main Pythia logo
- Brian might tinker with a simple design that just adds the word “Foundations” to the existing “pillars” logo
- Another option is to outsource logo design to fiverr
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
- To be assigned to our SIPARCS intern, will be a good warm-up exercise for learning how to submit PRs to Pythia repos.
- Devote time at EWG or OWG specifically to talking about Cookbooks?
- BR expressed the view that we should “Finish Foundations” specifically so that we can “clear the decks” to have more time to devote to Cookbooks at future EWG meetings.
Action items
2022-03-29: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / Argonne / @mgrover1
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Reached out to Jetstream folks, working with Unidata on this for Science Gateway Jupyterhubs
Agenda (with notes from the meeting):
- Should reach out to Damien about coming to an Education Working Group meeting
- Create a pythia-cookbook-template
- Look into pythia-cookbook-gallery
- Delete existing pythia-cookbook repo - create pythia-cookbook-template
- Look into pythia-cookbook-gallery
- Forward on Open science discussion for Thursday - ESIP/NASA
2022-03-24: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- John Clyne / NCAR / @clyne
- Julia Knet / NCAR / @jukent
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: EarthCube TAC hackathons: Tentative “office hours”, May 10th, 12pm - 3pm MST. Trying to get more details.
- Stay tuned! Will need a volunteer
- KT: pinged Jeremy Fischer @ Indiana University re: XSEDE/Jetstream2 status
- For Pythia Platform usage / Binder on HPC
- BR: Ryan Abernathey’s request to talk on 4/7/2022 about Pangeo Gallery / Pythia Cookbooks
- Good to sync on Pangeo BinderHubs and what maintenance issues exist with the Pangeo Gallery
- Pangeo Gallery is a prototype of the Pythia Cookbooks, so we need to understand what the requirements are
Agenda (with notes from the meeting):
- Getting feedback / less passive request for feedback (form)
- None. Currently in a waiting pattern with Anderson’s departure.
- Status of Binder on HPC:
- repo2apptainer is in a working state, but it needs testing on HPC systems
- container orchestration options need to be investigated on available HPC platforms (e.g., Cheyenne and Casper), but slurm and PBS container launch capability exists (just don’t know how it works)
- Running a BinderHub-like service on HPC: Is this required? If so, how to do this?
- full test on Cheyenne/Casper of JupyterLab launch from a link is required
- Discuss Open Issues and PRs
- JC: We have lots of resources than what is currently listed on the landing page, such as:
- tutorials
- repositories
- datasets
- cookbooks
- Need to update the entire landing page, now that we have a better main banner
- 2nd section banner is text-heavy and not complete; Get rid of text and present a grid/panels of resources; more graphical and inviting
- More than just updating the landing page; need to provide some descriptive content
- Should include Foundations, Gallery, Cookbooks, Tutorials
- Xdev Tutorials:
- Maybe also link to GitHub organizations (multiple!)
- Maybe get ideas from https://quantecon.org/
- This can be done incrementally:
- Step 1: get missing content on page with new text/links
- e.g., paragraph on landing page and link to Xdev Page
- Step 2: address the aesthetics
- Step 1: HIGH
- Step 2: MEDIUM
- Use Pangeo Hubs (already have some verbal approval from Ryan Abernathey)
- PRIORITY:
- Need to pull data from Google Analytics API
- Needs more thorough testing
- PRIORITY: MEDIUM
- James Morley will be joining us for CISL SIParCS internship to work on Pythia project
- Cookbooks:
- Max is willing to prototype Cookbook gallery (existing repo)
- Start in a fresh repo
- Need to figure out whether all Cookbooks should be under the Pythia umbrella
- Goal is to create material that meets a certain threshold of quality
- Cookbooks are our working examples, so should be owned by Pythia
- Shared responsibility across core devs and people who submit cookbooks
- Let’s not guarantee that everything works all the time, but we aspire to keep everything working and up-to-date
Action items
2022-03-17: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- Max Grover / Argonne / @mgrover1
- Rebekah Esmaili / STC+JPSS/ @resmaili
- Amy Huff / IMSG at NOAA NESDIS STAR
- Kevin Paul / NCAR / @kmpaul
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Contributing a "Atmospheric Satellite Data Cookbook"
- Follow-up from AMS meeting with Amy Huff and Rebekah Esmaili
- Background
- Working with Aerosol community, need more Python educational materials
- Workshops, wrote up a book, AMS workshop
- Amy has support for training?
- More scientist focused, more entry level
- Looking to transition to more advanced
- Gave talk at training and outreach session
- Amy part of NOAA - work with end-user community
- Developing an educational program working on satellite data - aerosol, smoke products
- Drew and Max reached out about working on these cookbooks!
- General Cookbook Discussion
- A cookbook would be distinctly different from the Foundations content, such that it covers both:
- More advanced topics
- More specific use cases
- Enable an easier way for contributors to "plug their content in"
- What are examples of possible “cookbooks”?
- Satellite data
- Radar data
- Cloud optimized data formats
- Processing data at scale (Dask, Ray, etc.)
- Station data
- CMIP6
- Reanalyses (ERA-5, CFSR)
- What would be helpful as a collaborator?
- Objective - increase the use of the data, promote correct use of those products
- Connecting with the effort, reaching wider range of end-users
- Plenty of nuances working with particular data, need to improve documentation/use cases
- Larger community!
- Centralized platform
- Build on the content that exists in Foundations
- Connecting STAR/NOAA with this community - open source, open science practices
- Example of a radar cookbook (ARM = Atmospheric Radiation Measurements, indicating what would be developed under their funding/new material)

- How do we build these books?
- Based around juptyer-books
- Utilize some sort of pythia cookbook template
- Basic infrastructure, notebook template
- Where should users be able to run these?
- Not a lot of computation available here - might not want this…
- Pangeo Hub
- Institutional Jupyter Hubs
- Example - satellite data cookbooks
- Files are too large, can’t run on Binder
- Provide the Python code in notebooks, show users how to download the data on their own using Python
- VIIRS
- 2-3 ~100-150 MB files to cover subset of CONUS
- ~1 GB of data to cover entire CONUS
- A bit smaller than that (e.g., 6-10 MB for CONUS)
- Variability in the size of the domain (Full Disk, CONUS, mesoscale)
- Note that 1TB of data costs $23/mo on AWS S3 (additional access and data movement costs could double this)
- Pangeo working on Dask-Binder instances
- Should probably collaborate with them
- Could look at pooch
- General idea - some of these cookbooks will require larger resources
- Need to make sure we support both things running on Binder, and running locally/individual HPC clusters
- Where can users find these books?
- Key difference between regular gallery - linking back to Foundations material
- Do we want each cookbook to be its own standalone book?
- Might need to create a new subdomain
- One big book?
- Content separated by chapters
- Tie binderhubs to the different books
- Ex. python environment for a given repo/book
- One huge environment to rule them all?
- Or individual environments for each cookbook?
- Like the idea of individual repositories
- Won’t have something completely figured out at the start, should start prototyping…
- Recipe in place, see what this looks like
- Can revisit how we can make this process easier
- How can the next group just drop something in here?
- No maintenance put in here…
- Cookbooks are how-to, which is different from tutorial
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
Action items
- Moving forward on the radar data cookbook prototype - Max
2022-03-16: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / Argonne / @mgrover1
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Earthcube abstract submitted
Agenda (with notes from the meeting):
2022-03-10: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR @clyne
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
- Max Grover / Argonne / @mgrover1
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: EarthCube TAC hackathons: Scheduling “office hours” for hackathons. We’ll need to try to “support” these in some way.
Agenda (with notes from the meeting):
- Getting feedback / less passive request for feedback (form)
- Display a banner at the top? Make it toggleable and maybe only display it during critical times
- No infrastructure PRs were merged since the last IWG meeting
- https://github.com/andersy005/repo2apptainer
- Apptainer: formerly known as Singularity
- Could be useful to launch on different HPC centers for Radar Cookbook (for example); or on different university HPC clusters for classes
- This facilitates “moving compute to the data” acknowledging that most data is produced on HPC centers - ideally should be the same everywhere (cloud or HPC)
- Use system-specific abstractions like Dask Gateway and Intake
- Orchestrating container launching on an HPC machine needs some investigation (can launch a container via slurm or pbs, for example)
- Cookbooks discussion in 2 weeks
- What infrastructure do we need for a Cookbook gallery?
- Maybe alternate between Platform issues and other infrastructure issues, if needed
- Good thing to have fewer infrastructure issues/PRs open
- Discuss Open Issues and PRs
- Use Pangeo Hubs (already have some verbal approval from Ryan Abernathey)
- PRIORITY:
- Need to pull data from Google Analytics API
Action items
2022-03-03: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Chris Cardinale / UAlbany / @cjcardinale
- Max Grover / Argonne / @mgrover1
- Drew Camron / Unidata / @dcamron
- Kevin Paul / NCAR / @kmpaul
- Kevin Tyle / UAlbany / @ktyle
- Rebekah Esmaili / STC, JPSS / @resmaili
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Earthcube abstract due next week
- Contributing a "Atmospheric Satellite Data Cookbook"
- Follow-up from AMS meeting with Rebekah Esmaili (on March 17th, next EWG)
- Max has explicit funding to develop “Radar Data Cookbooks” from DOE/ARM
- Design discussion Tuesday
- Conversation two weeks from now?
- BR is away on 3/17. Can someone else host the EWG?
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Thanks Kevin for submitting the SciPy abstract!
- Other conferences – Mind the Gap
- KT will be part of this meeting, can present on Pythia content
- Earthcube - same week as Mind the Gap
- Abstract
- JC will attend EarthCube, can present for Pythia
- KT will adapt the SciPy abstract to the EarthCube call (due Mar 8, 2022)
- Will ping others to take a look, and then submit
- Let us know that your PR was merged here!
- Foundations and Portal quick links
- #9 PPT/tutorial-template adding prereq section to README template
- #225 foundations - “improve GitHub landing page”
- #227 foundations - Clarify how to find the Binder link
- Foundations Matplotlib content #230 #233 #235 #236 and #238
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
- Improve Portal landing page and About content #215
- After a good asynchronous debate, we have landed on some wording for the landing page that lets users now what Pythia is while also making clear the connection to Pangeo (responsive to Issue #214)
- Preview: https://6220ffe70d64701b8a669ccf--projectpythia-org.netlify.app
- Are we happy with this?
- Do we like the look of the “About” buttons?
- Are we satisfied with the contents of the revised About page
- #206 Collect Pythia presentations in one place
- Markdown content is in that issue
- Add overarching "how to get involved" page #148
- “GitHub Gang” met to discuss on Friday 2/25
- Revision work is underway.
- Getting rid of “Under Construction” #231
- Should we make all Pythia logos easily available (e.g. in a GitHub repo)?
- The google drive is not open to public
- Some versions of logo are NOT in the google drive anyway
Action items
- Open Discussion item re: Cookbooks and necessary infrastructure (MG)
- Open issues for specific items addressed in GitHub discussion document
- Take the Unidata Markdown content and adapt to Pythia format (DC)
2022-03-01: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Max Grover / Argonne / @mgrover1
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- SciPy 2022 abstract submitted last week
- SIParCS interviews today!!
Agenda (with notes from the meeting):
- Great to be able to use/modify Project Pythia content!
- Where to put the recording?
- Seemed quite a bit ARM-specific, willing to put on ProjectPythia
- New playlist?
- AMS Board on Data Stewardship
- Wants to create Pythia-esque tutorials
- Luke Madaus (Jupiter Intel) Jared Marquis (UND) and Kevin T will have a meeting soon; will invite them to a future EWG after that meeting
- BDS would like to have a resource gallery, whose content includes workflows, reproducibility, cloud-hosted datasets; tutorial videos would be especially useful
- Satellite Data Cookbook - respond!
- Amy Huff - cookbooks
- Good to have a set of questions related to what a cookbook should look like
- Open science workshop - DOE/ARM
- Max is working on a blog post about the highlights from AMS-Python conference
- Will post on the Project Pythia Medium Account
Action items
- Invite Amy to Thursday’s meeting (Max)
- Add cookbooks small discussion to the EWG agenda (Max)
- Add Earthcube abstract to EWG agenda (Drew)
- Abstracts due March 8 (next week)
2022-02-24: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
- Orhan Eroglu / NCAR / @erogluorhan
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / Argonne / @mgrover1
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- John met with the first TAC hackathon team (Deborah Khider, et al). Of the seven training modules available, they were only interested in 3 Pythia ones. Meeting with the second hackathon team this Friday.
- Helping with github tutorial for ARM/DOE Monday
Agenda (with notes from the meeting):
- Let us know that your PR was merged here!
- KP: Appears that no infrastructure PRs were merged since the last IWG meeting
- Clarifying relationship between Pythia and Pangeo - STATUS
- “Binder on HPC” via nbgitpuller
- Ideas? Could add a third-party package that creates the kernel after nbgitpuller pulls down the notebooks and content
- TODO: Create an issue on this?
- Ngbitpuller addresses the issue that binder (mybinder) doesn’t provide persistent state. Nbgitpuller also keeps state synched, while keeping local modifications made by user. Nbgitpuller also helps with launch speed.
- Discuss Open Issues and PRs
Action items
- Provide @brose feedback on #180 on foundations site: See https://executablebooks.org/en/latest/gallery.html
2022-02-17: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Rob Ford / UAlbany / @r-ford
- Chris Cardinale / UAlbany / @cjcardinale
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: Update on TAC hackathons
- Feedback from TAC team and status of PR 227
- Friday meeting with first hackathon organizers.
- KT: SciPy 2022 abstract to be submitted by tomorrow
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- PyAOS-Pythia Coordination Call report out (see 2/15 notes below)
- Rename PyAOS email list?
- Meet later in the day with Damien and Scott
- Rotating meeting time?
- Take ownership of PyAOS census by AOS
- Getting Started with GitHub content update
- Just waiting on final approval
- It’s time to launch our comprehensive content review, https://github.com/ProjectPythia/pythia-foundations/issues/214
- Small review team should read through individually, make notes on inconsistencies, repetitions, and gaps.
- Come together to reach consensus on revision plans and divide tasks
- JK, BR, JC? Who else? KT
- Clarifying relationship between Pythia and Pangeo
- Ryan Abernathey’s comments here regarding Pangeo website
- Clarifying this on Pythia sites (portal, Foundations) – currently no Pangeo mention
- This is an oversight that we should fix quickly.
- Plan? - Portal Landing Page and About Us, include Pangeo icon
- “We are the educational initiative of the Pangeo community”
- Pangeo needs to link to us too
- BR will edit the landing text on Pythia
- Discuss Open Issues and PRs
- JK plans to take a look at these two, possibly take ownership to get them finished
- Consensus is that the subject is too complicated and not foundational enough, doesn’t belong in Foundations book
- Excellent subject for a future Cookbook
- Consensus is that this is not “foundational” material
- Good subject for a future Cookbook on data formats
Action items
- Read GitHub content (all, but especially BR, JK, JC, KT)
- JK - look at Matplotlib PRs
- BR - schedule meeting for GitHub content review, hopefully before next EWG meeting
- BR - edit the portal landing page, include a link to Pangeo
- BR - investigate moving the 3/3 EWG meeting to later in the day to accommodate Damien Irving
2022-02-15: PyAOS-Pythia Coordination Call
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Damien Irving / CSIRO / @DamienIrving
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Scott Collis / Argonne National Lab / @scollis
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- Scott (and the AMSPy program committee) suggests Pythia be the “flagship” title for connections in this space and AMS Python planning and more
- Pythia committees on conferences?
- Communities of instructors and ownership of lessons e.g. Python-AOS carpentries lessons
- Damien asks about contributing to Foundations, and if it’s so far set-in-stone
- Current state of Foundations and future considerations for cookbooks
- PyAOS Census to be planned and hosted by Pythia, aiming for biggest push during AMS 2023
- Kevin offers to host pyaos mailing list via UAlbany listserv
- Having carpentries material easily mirrored to sphinx/jupyterbook for consistency in presentation and deployability
Action items
- Kevin and Damien will work on moving the PyAOS listserv to albany.edu
- Damien and Scott are invited to upcoming EWG meetings (3/3 specifically though any day is ok!)
2022-02-15: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- SciPy 2022 Abstract Deadline extended to Fri. 2/18
Agenda (with notes from the meeting):
- PyAOS call with D. Irving and S. Collis today at 3PM MST/5PM EST
- AMS Board on Data Stewardship virtual meeting last Monday: interest in Pythia
- EarthCube call for abstracts due March 8 - June annual meeting. Looking for notebooks
Action items
- Tweet tweet
- Tweet about feedback form
2022-02-10: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / John Clyne
- Rob Ford / UAlbany / @r-ford
- Ryan May / Unidata / @dopplershift
- Julia Kent / NCAR / @jukent
- Anderson Banihirwe / NCAR / @andersy005
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Any unfinished discussions from the quarterly meeting?
- SciPy 2022 Abstract Deadline: tomorrow! Link to draft
- JC: putting together internal GitHub training at NCAR. Hope to incorporate our new GitHub content and get feedback
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Kevin Tyle is planning to submit a poster abstract for SciPy 2022 meeting, due tomorrow
- Provisional “yes” to submitting/requesting a BOF and lightning talk, further highlights here
- Send stickers to Albany!
- Review Damien Irving PyAOS mailing list proposal. Reference: 210, 208
- Anything more to discuss?
- Zoom chat with Damien/Scott Collis Tuesday 2/15 3PM MST / 5PM EST
- Need time to discuss?
- 223 (KT needs to incorporate review suggestions and merge)
- Then need to update the general intro … also delete the “Advanced topics” for now?
- Reviewing Quarterly Meeting Goals - Infrastructure
- Resource Gallery and site searching
- Not sure if we need a dynamic site for this or if we can just do this with JS
- Searching other sites? web crawl… Or search just our sites?
- Use existing good search capabilities as well as possible
- Low priority (already have effective functionality with gallery filters)
- Ability to launch interactive Jupyter session from a button on our content pages
- Binder or just link (if not Binderized)
- Features/limitations of Binder
- Public BinderHub (mybinder.org) is slow to launch and only provides a single running Jupyter instance (no Dask parallelism)
- ephemeral (no saving state after closing the instance)
- no authentication / login (only public data and no per-user customization)
- Not tied to a JupyterHub that you can visit ex post facto
- Pangeo/2i2c BinderHubs can also be slow to launch but provide Jupyter instances with Dask backends that have fixed scalability (limited number of Dask workers)
- requires authentication / login (per-user customization)
- saved state between use
- tied to a JupyterHub that you can visit at any time
- Public and Pangeo hubs are launched on a specific cloud infrastructure
- e.g., AWS, GCP, Azure
- There is no HPC version of Binder!
- What is needed for “Binder on HPC”?
- Doesn’t need to be Binder! It can be just a link/button that launches a Jupyter instance on a remote HPC machine
- Does the JupyterHub named-server API allow this? Kind of, but it needs a “username” on the JupyterHub in order to launch the new Jupyter instance…
- Would need either authentication and user profiles on the Pythia websites
- Or it would need a “user prompt” after clicking the button to gather information needed to login on the remote JupyterHub (i.e., no longer single-click)
- Notebook would need to reside on the remote HPC system; and it would need to be in a user space with write permissions
- Jupyter kernel would need to reside on the remote machine, too and available for all users to read
- How to improve launch time
- use pre-build Docker images (no conda/pip installation step)
- Have available instances running ahead of time (requires knowing what kind of instance will be needed…i.e., handy to have only 1 kind of instance / 1 kind of “kernel”/environment)
- Can you pay extra for faster launch time?
- Should be faster than cloud, in principle (little/no virtualization), so need to see a working solution before knowing
- Accelerate the binder launch to improve user experience on our material/sites
- Make examples that use HPC resources possible (e.g., “Binder on HPC”)
- Use the cloud for Pythia specifically (for now), which means using public data and repositories
- e.g., auth can be done via GitHub
- Maybe ask the Jupyter folks if there is anything moving in this space first (reach out to Fernando et al.)
- Maybe go with nbgallery? If execution of the notebooks is the most important thing, then nbgallery could be ideal
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
Action items
- KP: Contact Fernando about Binder on HPC
- AB: Continue work exploring Binder on HPC solution
- KP: New Issue: Define how best to containerize our own content
- KP: New Issue: How should we move forward with Pythia BinderHub?
2022-02-03: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne/ NCAR/ John Clyne
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @ brian-rose
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Any unfinished discussions from the quarterly meeting?
- KT: report from 2/2 TAC Hackathon Planning meeting; website is https://throughput-ec.github.io/ec-workshops/
- JC: The Annual EarthCube meeting will be June 14 - 16 in San Diego. Abstracts are due March 8. Details here.
- JC: Thoughts on Damien Irving’s PyAOS mailing list proposal. Reference: 210, 208
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- EarthCube call for abstracts due March 8 (conflicts with MindtheGap, AOFD meeting), volunteer to review notebook submissions - JC
- Notebooks, talks, how to represent ourselves, poster session
- OWG make a suggestion for next 2 weeks
- SciPy abstracts due Feb 11th
- Review Damien Irving PyAOS mailing list proposal. Reference: 210, 208
- Homework for everyone: take a careful look at https://pyaos.github.io and see if there is content that we can wrap into Foundations.
- Email is legacy, support but don’t put a lot of energy into it
- Plan for polishing GItHub content, work on continuity
- Last chapters to review and merge
- Do it in a hackathon once last sections are in place
- JK, BR, JC take notes, then come together with consensus and divide tasks
- BR will ping us when it is time to move forward on this front
- Reviewing Quarterly Meeting Goals
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
Action items
2022-02-01: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- No more OWG with MG - check in before he leaves about accounts
- Quarterly Meeting went well
Agenda (with notes from the meeting):
- JK update Twitter spreadsheet and schedule tweets for resource gallery
- MetPy tutorial next Wednesday the 9th, schedule a practice for Tuesday
- EarthCube call for abstracts due March 8 - June annual meeting. Looking for notebooks
- SciPy abstracts due Feb 11th, tutorials due Feb 15th, aim for a lightning talk, look for a “birds of a feather” education group
Action items
- Tweet tweet
- Tweet about feedback form
- Meet with Damien to bring information to larger Pythia group
- Email Virginia to send bias training to DC and BR
2022-01-27: Quarterly Meeting
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany @ktyle
- Julia Kent / NCAR / @jukent
- Kevin Paul / NCAR / @kmpaul
- Max Grover / NCAR / @mgrover1
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Brian Rose / UAlbany / @brian-rose
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
Kudos to all
What we’ve accomplished
- Portal infrastructure deployed and operational (and looks damn good!)
- Pythia Foundations content
- Growing recognition and interest from the community
What we still need to do
- Pythia Platform
- Advanced content
- More community engagement, more usage
IWG report out - Kevin P.
- Pythia Portal is relatively stable and complete
- The Resource Gallery is usable and easily modifiable, using the JS filtering technique (Kudos to Julia for her work on this!)
- We have our own JupyterBook-compatible Sphinx theme, which is released on PyPI and conda-forge!
- CI / Teams + round-robin reviewer selection / Useful Issue templates all working and in place, providing automation to facilitate community contributions!
- We have a lot of our content Binderized, making it interactive for the user!
- Next Steps
- Still need true tag-based search of Gallery contents!
- Do we need a DB + dynamic web host for the gallery?
- Can we do this with just JS on a static site? And how?
- Perhaps this is a lower priority for now?
- Pythia Platform research needs to begin in earnest:
- Need to investigate a solution to Binder on HPC!
- If we can provide the Binder experience fast using our own dedicated hardware and generalize it to work on any on-prem cluster, we have a solution that anyone can use.
- Also investigate usage of Binder on JetStream
- Need to investigate how to speed up the cloud-based Binder loading process
- e.g., what does it take to make it “instantaneous” or close to it? How much would a service like that cost?
- e.g., can we activate thebe for an entire book, instead of activating for each individual page
- Need to investigate how to provide users with persistent storage space for their educational content
- Would require authentication!
- Metrics from Google Analytics for last quarter:
- In the last 90 days, the Portal averaged around 50 "users" per week for a total of 694 users.
- Of these 694 sessions on the Portal site:
- 436 were direct visits,
- 123 were found from online searches,
- 29 were directs from Twitter,
- 9 were from Medium.com, and
- 6 were from AGU poster links.
- During these 694 sessions, the most popular pages were:
- 1100 visits were made to the main page,
- 482 visits were made to the Resource Gallery,
- 124 visits were made to the "About Project Pythia" page,
- 27 to the Contributor's Guide,
- 5 to the Code of Conduct, and
- 2 to the Team page.
- In the last 90 days, the Foundations averaged around 100 "users" per week for a total of 1100 users.
- Of these 1100 sessions on the Foundations site:
- 464 were direct visits,
- ~420 were found from online searches,
- 173 were directed from Twitter,
- 12 were from Medium.com.
- During these 1100 sessions, the most popular pages were:
- 894 visits were made to the main page,
- 336 visits were to the "NetCDF and CF: The Basics" page,
- 322 visits were to the "Introduction to Pandas" page,
- 239 visits were to the "Introduction to Cartopy" page,
- 194 visits to the "How to use this Book" page,
- 167 to the "Overview" page,
- 155 visits to the "Getting started with GitHub" page,
- 154 visits to the "Cartopy" page,
- 150 visits to the "Xarray" page,
- 132 visits to the "zero to Python" page,
- 129 visits to the "NumPy" page,
- 125 visits to the "Getting started with Python" page,
- 116 visits to the "Getting started with Jupyter" page,
- 110 visits to the "NumPy Basics" page,
- and the rest were <100 visits.
- Need data on how long people are staying on each page (or at least how many / what percentage of visits were “click throughs”)
EWG report out - Brian
- Last major section (git and github) finally settling into place (will still need polishing)
- Between Foundations, advances in the Gallery, Tutorials infrastructure, … we have much content to be proud of
- We have built some nice websites. It’s been a labor-intensive process.
- I see a crucial need to start building infrastructure and organizing around community-contributed advanced content
- Max Grover’s articulation of the “Pythia Cookbook” vision here is spot on. We can (must) accelerate growth of Pythia content by making it easy and attractive for people to contribute cookbooks to a well-organized gallery.
- I think these efforts are deeply coupled to the Pythia Platform initiative. For Pythia to have lasting value, we need to create something bigger than just a repository where people can upload their analysis notebooks. What the world needs is easier pathways to fully reproducible workflows on real data.
- Pythia Cookbooks will make their mark by being “plug and play” reproducible.
- Priorities for the short, medium, and long term:
- Short (weeks to couple of months)
- Finish the build-out of Foundations
- Polishing of GitHub material
- Get other works-in-progress reviewed, revised, merged
- Follow up on AMS momentum, especially Damien’s suggestions around merging PyAOS content with Pythia
- Set goals for summer intern
- Medium (this calendar year)
- Pivot Foundations work toward maintenance
- Develop a clear vision of the Pythia Cookbook
- Get that vision written down in a way that encourages participation and submission
- Work closely with IWG to tie Cookbooks to the emerging Pythia Platform
- Long (life of the EC grant)
- Nurture a community organization structure to ensure that Pythia content has a life beyond the EC grant
OWG report out - Julia
- 6 months since first OWG meeting
- STICKERS!
- Twitter
- Advanced Matplotlib (64 in attendance, 223 views)
- Object Oriented Programming (85 in attendance, 145 views)
- Intake-ESM (54 in attendance, 89 views)
- 234 Youtube subscribers
- Goals – Add Youtube link to portal, promote foundation book content in tutorial, create page with each tutorial’s Binder link, title and ending video cards
- Use of new GitHub organization to host all tutorial repositories
- Upcoming: Metpy (Feb 9), Thinking w Xarray (March 9)
- Drew (slides)
- Kevin T (slide deck)
- Max (slides)
- Liaise with AMS Python Conference (S. Collis, Chair) + PyAOS mailing list (D. Irving, Chair)
- Damien Irving - damien.irving@unsw.edu.au
- Scott Collis (scollis@anl.gov)
- AMS 2023 theme is on Data … could be a good opportunity to leverage Pythia
- Submit abstracts for SciPy
- Other conferences to have Pythia representatives
- Gathered interest in a satellite data cookbook
- Rebekah Esmaili (NOAA JPSS)
- Encourage (and measure) more use for more short-form feedback
- Design long-form feedback plans
- Project 7. Project Pythia Content Development (anyone want to be more involved on this project?)
- Project 8. Python data analysis & visualization and Jupyter notebook development for unstructured grids data
- ClymateAcademy (work with Abigail Bodner)
- Outreach science groups
- Ocean Hack Week?
- STEM outreach?
- Reach out to professor’s, share material, guest lectures
- Max advertising Pythia in a Meet the Experts talk with high schoolers
How are we doing?
- What is working well and what isn’t?
- Are there logistics or procedural changes we should consider. E.g.
- Standing meeting structure and frequency?
- How do we get more people taking advantage of the tremendous infrastructure and content that we’ve provided?
Open discussion
2022-01-20: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Max Grover / NCAR / @mgrover1
- Kevin Paul / NCAR / @kmpaul
- Brian Rose / UAlbany / @brian-rose
- Chris Cardinale / UAlbany / @cjcardinale
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- John: TAC hackathon update:
- Training resources (draft) now live: https://throughput-ec.github.io/ec-workshops/
- Notebooks, Conda, and GitHub sections link directly to Pythia content
- Finalize content by 1/26
- Hackathon training events kick off in Feb
- Can someone attend the TAG logistics meeting, Feb 2, 12pm MST? (Kevin T can)
- JC: Quarterly meeting next week during our normal EWG time slot.
- Max’s last day coming up soon (Feb 11). Are there open PRs owned by Max that we should prioritize (assuming Max has any cycles :-)
- All clear to stay involved, not funded
- Dask and Matplotlib Content
- Not co-mentor for Pythia content SIParCS internship
- Quarterly meeting instead of IWG next week!
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Pythia presence at student conference workshop and other networking events
- KT, MG, and DC talks – shoutouts on Twitter
- Feedback form
- Any last minute things you want to prioritize?
- Ping DC to add note to his talk
- Condensed ppt slide “how to get involved”
- Highlighted Pythia’s GH content and KPaul’s XDEV talk in summary/resource slide
- JC - pointers to all past talks on Pythia site? (AGU, AMS, etc)
- Discuss Open Issues and PRs
- If you didn’t use the “Content” label, it might be missing from these lists!
- Discuss Open Issues and PRs
- Priority shifted to a ppt slide for AMS
- Max: May we to re-assign this to someone else with Max leaving NCAR.
- MG - Higher priority and more attainable
- Lots of open PRs, missing any piece of this?
- Clean up the lead-in and make consistent with the now included lessons
- Worth to go back and read over it all as one piece, make stylistic choices consistent so it feels like one book. Do this AFTER the content is all there.
- Set up a GitHub milestone that tags all relevant issues and PRs. - BR
Action items
- “How to get involved” ppt slide - DC and KT
- Review GitHub PRs to merge by the 26th
- GitHub milestone - BR
- Open new issue for coalescing all presentation material (all public presentations) - JC
- Matplotlib content - MG
2022-01-18: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
Action items
- Create quick tweet graphic for communicating Pythia AMS presence
- Tuesday 10:45 am CST - Kevin Tyle in education conference:
- Session: Teaching Meteorology Online: A Retrospective and a Forward Look
- Presentation: Leveraging Project Pythia in an Advanced Geoscience Analysis & Visualization Course
- Monday 4:30 pm CST - Drew Camron in python
- Drew, Kevin create getting involved slide for virtual presentations
2022-01-13: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- Brian Rose / UAlbany / @brian-rose
- Max Grover / NCAR / @mgrover1
- Julia Kent / NCAR / @jukent
- Orhan Eroglu / NCAR/ @erogluorhan
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KP: New theme released on PyPI and Conda-Forge (2022.1.13)
- JC: meeting with the publication manager of Scientia to discuss “feature your work in the annual earth and space sciences edition of our science communication publication, Scientia”
- JC: Meeting with EC TAC committee next week for preview of materials for TAC hackathon. Gang of GitHub 4 (Julia, Brian, Kevin T, Rob, John) scrambling to submit “Intro to GitHub content”
- JC: Is it just me, or is everyone getting hammered by notifications from our repos? :-)
- See the number of merged PRs below! We’ve been active!
- K Paul: good idea to add a section in intro to github about best practices for controlling notifications. Easy for a newbie to get overwhelmed. Within the notifications section, you can filter to what you really want to see at a given time (e.g. filter on Pythia). One strategy: turn off email notifications. KT will add this to the github-advanced-settings chapter.
- Add a page/chapter on how to customize your GitHub notifications to help people manage their notifications
- JC: Quarterly meeting in two weeks during this time slot
Agenda (with notes from the meeting):
- Feedback on the new theme / changes needed:
- Foundations 62: WIP: First draft of self-generating cover art on landing page [DRAFT]
- Low Priority, in progress
- Open Infrastructure Issues:
- Low Priority
- Maybe have a page with a list of contributors, too?
- Portal 157: Direct users to all Project Pythia content gallery from landing page
- Done? No. We want a direct link to the gallery that is already filtered!
- Maybe add a link off of the portal to the filtered gallery directly. How to do that with JS?
- Maybe a separate gallery for just Pythia content?
- TODO: An issue to review the tags on all gallery content
- TODO: Investigate how to quick link to pre-filtered gallery
- High Priority
- Portal 126: Automated addition of Foundations material to the Gallery
- Portal 117: Pull local theme into separate package
- TODO: Done. Close this issue.
- Is this done now with new theme?
- TODO: Close
- Auto-merging of maintenance PRs would be nice
- Low Priority until more contributors join
- Foundations env is the “everything env”; and portal env is just for building the portal
- We could separate “build”, “testing” and “dev” environments
- We could use dependabot to update our dependencies, if we switched to requirements.txt
- Low Priority
- Datasets 20: What procedures do we need for adding or modifying Pythia datasets?
- Quarterly meeting is 2 weeks from today (no IWG meeting in 2 weeks!)
- Conflicts with AMS meeting!
- Maybe move the quarterly meeting?
Action items
2022-01-06: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Abigail Bodner / NYU / @abodner
- Rob Ford / UAlbany / @r-ford
- Nicole Corbin / Unidata / @ncorbin-id
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Max Grover / NCAR / @mgrover1
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Volunteer to take notes today? Kevin
- Five applications submitted for the TAC hackathons (four will be awarded). Need to address Intro to GitHub content
- TAC hackathons will go forward
- We are on the hook for Intro to GitHub content (see issue above)
Agenda (with notes from the meeting):
- TODO items for AMS? (RIP)
- AMS is happening, but the opportunity to hand out stickers is limited (purely virtual, as of this morning!)
- Drew is planned to give the update; will sync with John
- OWG report (Max/Drew/Julia/Kevin T)
- Not much to report
- Let them know what needs to be done before AMS
- Maybe schedule a hackathon for anything that comes up
- Let us know that your PR was merged here!
- Foundations and Portal quick links
- How to use this book PR merged!
- Ryan A nudged Abigail to talk with us
- (The “y” is for Python!)
- Pitch is recorded (we will discuss how/whether to share this with the group)
- 100% virtual summer education program
- Motivated by neuroscience community that went 100% virtual during COVID
- Goals
- Learn basic python skills
- Explore principles of climate science by analyzing data
- Apply method to collaborative projects among pods of students
- Lectures conducted 2x/day for different time zones
- TA sessions across global time zones
- TAs work with students in/with a Jupyter Notebook that is built/designed with lecturer
- Uses Gather.Town for 24/7 collaborative environment
- Week 1 covers climate science content + Pythia Foundations content
- Week 2 covers past/present/future climates + advanced analysis methods (including ML)
- Week 3 covers group projects
- Want to collaborate with Pythia; possible funding opportunities
- Want to leverage Binder/Colab to run notebooks
- Can we bring international connections?
- Can we bring new lecturers or TAs?
- Lots of opportunities for collaboration with Pythia here!
- Currently has no funding, but have been talking to folks at Google and Nvidia that seem interested in making this happen; but looking for help with funding and governance
- Would like to steer away from small grants and go after bigger funding so that we can make the service free to as many people as possible
- Future plans: Will organize a meeting with all interested parties to brainstorm ideas around organization and address funding
- Discuss Open Issues and PRs
- Is the top navbar good for both sites?
- Should not differ on different sites
- Maybe we don’t need it? Remove it?
- Or make it a fixed link to same as logo link
- Are these links the ones we want?
- Should the GitHub icon (top-right) point to the org or the repo(s)?
- Currently points to the org
- Individual pages in the Foundations book have a separate github icon that points to the repository (and clearly labeled by hover popup)
- What should be in the bottom nav menu (in the footer)?
- Should there be common content on both sites?
- It should all be common… But we don’t need this menu.
- Things that should be there:
- About Project Pythia
- How to contribute
- Contact us
- Email alias
- Links to GitHub Discussions pages
- Should there be site-specific content?
- Nothing should be site-specific?
- Site-specific content should be in the extra_navbar content
- Should it be more customizable? (i.e., write a Jinja/HTML template for each site?)
- With the Sphinx Pythia Theme 2022.1.6 release, you can use a custom HTML template for the menu, if we want to make something more stylish (the NSF acknowledgement now uses a custom template which is specified in the footer_items theme option)
- Can we make the feedback form more prominent in the bottom menu?
- Discuss Open Issues and PRs
- Let’s schedule a meeting soon where all hands interested can join to discuss and possibly plan a hackathon
Action items
- Schedule a brainstorming meeting for GitHub content
2022-01-04: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / NCAR / @mgrover1
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- Julia: report from AGU
- Drew: have one person who left feedback
- Drew: working on getting involve content, have this out before AMS
- Julia willing to help with this too!
- Kevin: getting started with Github content
- Drew: student workshop will advertise for Pythia
2021-12-23: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Max Grover / NCAR / @mgrover1
- John Clyne / NCAR / @clyne
- Ryan May / Unidata / @dopplershift
- Nicole Corbin / Unidata / @ncorbin-id
- Kevin Paul / NCAR / @kmpaul
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- AGU report out
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- ~20 new followers after AGU
- Using the new theme for Foundations:
- Debates on whether top nav bar should be THE SAME across all sites, and what it should contain
- Should clicking on the logo take you to Portal home, or the home of the site you’re currently on?
- Depends on what you think is “home”
- Users probably don’t care about different subdomains etc. Just want to navigate easily and not be taken by surprise
- Can put additional link in Book sidebar back to title page of the book
- “Static” nav bar across the site: Project Pythia logo always takes you to portal home.
- Resources could be accessed through a menu?
- Interim solution:
- “Static” nav bar across all sites
- From left to right:
- Everything else (e.g. team page) in footers
- Broken “extra_footer” is being fixed. …There are difficulties with multiple theme inheritance, it turns out.
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
- #199 Update “how to use” with conda env instructions
- Ready to merge? RM will review shortly…
- #127 Video walk-through is waiting on this.
- Video will wait for new theme to be deployed
- Video will also cover giving feedback
- Title is now “Pythia Foundations” everywhere on Foundations and Portal site
- Can we close this? YES.
- Waiting for comment from JC
Action items
- KP: update the nav bar and footers on portal theme and Foundations
- RM: review #199 (DONE)
- BR: look into why reviews for #170 didn’t appear from ReviewNB.
- JC: take a look at #168 Dask dependencies, close if appropriate.
2021-12-21: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
Action items
2021-11-18: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- Rob Ford / UAlbany / @r-ford
- Brian Rose / UAlbany / @brian-rose
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / NCAR / @mgrover1
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KP: New theme imminent! (sphinx-pythia-theme.readthedocs.io), on PyPi and conda-forge
- MG: looking at Binder alternatives for Jupyterhubs @ NCAR
Agenda (with notes from the meeting):
- KevinP presented on Monday about Pangeo & Project Pythia
- Foundations 62: WIP: First draft of self-generating cover art on landing page [DRAFT]
- Low Priority, in progress
- Open Infrastructure Issues:
- Portal 184: PR template that reminds users how to open in draft to avoid triggering premature review requests
- Ryan will open a PR on this with a simple alert message
- Medium/Low Priority
- Maybe have a page with a list of contributors, too?
- Portal 157: Direct users to all Project Pythia content gallery from landing page
- Needs 156 first
- High Priority
- DUE BY AGU
- Portal 146: Add support for hosting video tutorials and associated notebooks created by Xdev
- Portal 126: Automated addition of Foundations material to the Gallery
- Is this superseded by #156 and #157? Yes and no.
- How to automate? Not sure.
- Should be more fine-grained (i.e., don’t just add whole book)
- Low Priority
- CLOSE FOR NOW (can be reopened)
- Portal 117: Pull local theme into separate package
- Auto-merging of maintenance PRs would be nice
- Low Priority until more contributors join
- Just merged it today (Portal PR166)
- TODO: Closed
- Already have a primary environment for building the book (which runs the notebooks)
- See thread on Foundations 127
- Should we have one environment that includes the build of the book and the running of the notebooks? Or should we have two separate environments for the two use cases?
- Use metapackages? Do we want to have to support our “own stack”? Probably not the best thing to do...
- We are trying to encourage our contributors to learn how to manage their own environments, so it's reasonable to assume that they will manage their dependencies themselves
- Requires clearly indicated what packages are required for each notebook
- Can hide (collapse) the “install these packages to run this notebook” cell on the Jupyter Book
- Datasets 20: What procedures do we need for adding or modifying Pythia datasets?
Action items
- See inline TODO items above
2021-12-09: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- John Clyne /NCAR /@clyne
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany @ktyle
- Ryan May / Unidata / @dopplershift
- Kevin Paul / NCAR / @kmpaul
- Chris Cardinale / UAlbany / @cjcardinale
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- BR: volunteer to take notes today (Kevin T)
- AGU next week! (Julia is on her way now)
- John giving lightning talk in EC town hall. Feedback on slides requested by COB today. Must be uploaded tonight.
- EC office is including Project Pythia on its list of AGU talks.
- Kevin P. is presenting virtually in hybrid poster hall, although it’s still listed as in-person. Poster session Monday 12/13 5-6 PM CST.
- EWG 12/23 is still planned, although some folks may be on holiday.
- IWG 12/16 will proceed as planned although some folks will be @ AGU
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Julia is bringing stickers to AGU
- Brief feedback form, developed at last week’s hackathon; Drew will verify if it is being disseminated at AGU.
- Let us know that your PR was merged here!
- Foundations and Portal quick links
- Contributor’s Guide for ProjectPythiaTutorials Template repo
- Portal #187 - Foundations Content to Gallery: can now filter gallery by “Pythia” to quickly find our content
- Foundations #190 Adding the Google feedback form to the footer of each foundations page AND to the contributor's guide.
- Foundations #169: Outline for GitHub content
- Discuss Open Issues and PRs
- Discuss Open Issues and PRs
- #191 and #192: drafts of first couple of GitHub lesson
- 191 just updated; some of previous version to be incorporated into 192
- #163 is being closed as issue has been resolved by updates to sphinx/jupyterbook upstream
- #167: Title of the “Foundations” book will be Pythia Foundations. Now need to go through content so that this title is consistently used.
- #180: Add Pythia Foundations to the Gallery of Jupyter Books
- #127 (video supplement): outstanding issues about instructions for downloading and running individual notebooks. (We discussed this at the last IWG meeting but not clear if we reached an actionable decision??) . Right now we have instructions to launch via Binder, and instructions to do a full clone/build, but we need a middle ground where someone wants to download and run one or more of the Pythia Foundations’ notebooks.
- Ryan had suggested just using the YAML file as the target for a conda environment build.
- Action: verify that environment.yml is up-to-date (which it should be since it’s tested every night) and then develop the relevant documentation for how one could do the conda build. Don’t worry about creating a separate, stripped-down environment.yml.
- John envisions a typical use case would be someone who wants to run a notebook as-is, but with their own data source
- KP plans to push a new Sphinx book theme (for Portal as well as Foundations)
- #184: (Low-medium) Update contributor’s guide to discuss items such as opening a PR as a Draft vs. a “traditional ready-for-review PR. John will continue work on this. Suggest, but do not require, that contributors pose PRs initially as drafts.
- Rather than updating contributor’s guide, provide a PR template that discusses draft vs regular PR
- #148: Add overarching “how to get involved”. Drew hopes to have this ready by start of AGU. GitHub itself presents a barrier to people who want to reach out and contribute. How can we make it easy for potential contributors to do so and not fall into the mindset of “unless I am well-versed in GitHub I can’t contribute”? For the person who wants to contribute something, provide a quickest route for someone who wants to participate (work this into GitHub lesson 1?)
- #116: Rewrite & update the “About” content so it’s not so wordy.
- BR keeps saying he’s going to do something on this, but it hasn’t gotten done.
- Would be “nice” to have this done by AGU, but not top priority
- Goal: Develop a set of EarthCube-related & branded workshop materials for folks in the Earth Sciences domain
- Donata Giglio (CU Boulder), PI, Simon Goring (U. of Wisc), co-PI
- Applications due Dec 22, Workshops to start in January, and must be completed by April 8.
- Pythia provides:
- Introductory content for conda (done), Jupyter Notebooks (done), GitHub (WIP)
- consulting support during 4 pre-Hackathon workshops (via slack, etc.)
Action items
2021-12-07: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / NCAR / @mgrover1
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
Agenda (with notes from the meeting):
- I fell behind on posting foundations content… can post more frequently during AGU? - MG
- Still to post
- 4 Foundations content
- 3 Resource gallery
- Additional posts
- Monday/Wednesday - Foundations content (Max)
- Tuesday/Thursday - Resource gallery (Kevin Tue Julia Thu)
- Pin “Pythia at AGU” to the top
- How to get involved section?
- Reach out to Drew to see where this is at
- Do we have a QR code for the Pythia feedback form?
2021-11-23: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / NCAR / @mgrover1
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
Agenda (with notes from the meeting):
- > 220 followers
- Foundations content
- Max is responsible for rest of November
- Drew is responsible for rest of November
- Status of how to get involved?
- Drew should get this done before AGU
- Would this be next week?
- Do we have the infrastructure in place?
- General pre-AGU stuff too
- Cookbooks to work on
- HRRR data
- CESM2-Large Ensemble
- Surface observations
- Embedded on the portal site somewhere
- Questions to ask
- Legible and understandable?
- Did you complete the content?
- Did you learn?
- Limit the number of questions
- Open ended feedback
- Open discussion on the Portal
- Want to have this done by the end of the hackathon
- QR code for posters and slides!

- Helping with the data help desk
- Tweet at the Data help desk, encourage people at AGU to check out the resources
- Schedule hackathon - Max
- Put together feedback form, post on discussions - Julia
- Tween to visit KP and JK at AGU (links to content)
2021-11-18: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
- Rob Ford / UAlbany / @r-ford
- Max Grover / NCAR / @mgrover1
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KP: Note that AGU iPosters are due Dec 7!
Agenda (with notes from the meeting):
- Update on Tutorial Content Location
- Not using a single repository to hold all tutorials. Why?
- No guarantee that the software stack used for each tutorial is or will remain compatible with each other
- Searching through a big repository is not as easy as searching through an organization of repositories (in a repo, all you see are directory names; in an organization, you see repo names and short descriptions)
- Using releases/tags with each new tutorial added only guarantees that the new tutorial added with that release will work, and previously added tutorials could break due to dependency changes needed for the new tutorial
- Using different branches in a single repository can achieve the same functionality as multiple repos in a single organization, but it is harder to search and more difficult for people to learn about.
- Created a new GitHub Organization (https://github.com/ProjectPythiaTutorials)
- Each tutorial will be its own repository in this organization
- Each tutorial will be treated as a “snapshot in time” and will not be keep up-to-date with changes in the stack used/described in the tutorial
- Looking into archiving tutorials after a fixed time after their use (1 month? 6 months?), which will force them to be “snapshots in time”
- CI can be used to monitor the health status of tutorials in each repository
- Run nightly or weekly CI to execute the notebooks in each repo
- No way of health-checking textual content, so need a way of indicating how much content “can be checked” and how much “cannot be checked”
- Can indicate a warning when the “current versions” of software needed for the tutorial do not match the tutorial versions
- In addition to tutorial repos, there will be 2 additional repos in this organization:
- A GitHub Pages repository with a simple Jupyter Book showing all of the tutorials
- A tutorial template repository
- We will keep the organization membership invite-only for now
- Invite contributors to transfer a repository to the ProjectPythiaTutorials organization, and then create a “repo-level team”
- Policy related to PR review requests: What are reviews for?
- Reviewing a PR takes time, and I routinely get pinged to review PRs that are not in a “ready to review” state.
- GitHub sends a notification to any reviewer on a PR when any new commits are pushed to the PR branch.
- I get review notifications for draft PRs, too.
- There is a difference between using reviews to ask “Please give me permission to merge” and “Please see if this approach is a good one”.
- What policy should we use to indicate that a PR is ready to be reviewed now?
- Can we prevent automatically asking for reviews on draft PRs (i.e., notify reviewers only when it is converted from draft state)?
- By default, draft PRs do not automatically ask for reviews/reviewers
- Start all PRs as draft PRs
- @-mention people to get “Please take a look at this work in progress” for draft PRs
- Don’t convert PRs to “regular” status until build/tests pass
- TODO: Add new policies to contributor guide
- Found a solution using JS and HTML input checkboxes
- Currently trying to figure out why it is not working
- Needs a minor tweak to layout (center vertically in navbar)
- TODO: Let’s fix and merge now and revisit later if desired
- Foundations 62: WIP: First draft of self-generating cover art on landing page [DRAFT]
- Low Priority, in progress
- Open Infrastructure Issues:
- Medium/Low Priority
- Maybe have a page with a list of contributors, too?
- Portal 172: Manually add Pythia content to gallery
- High Priority
- Also good fodder for Hackathon
- Waiting for affiliations tags to be merged in first (Issue 156, PR 160)
- TODO: Update button to auto-generate PR -- doe
- DUE BY AGU
- Portal 157: Direct users to all Project Pythia content gallery from landing page
- Needs 156 first
- High Priority
- DUE BY AGU
- Portal 146: Add support for hosting video tutorials and associated notebooks created by Xdev
- Portal 126: Automated addition of Foundations material to the Gallery
- Is this superseded by #156 and #157? Yes and no.
- How to automate? Not sure.
- Should be more fine-grained (i.e., don’t just add whole book)
- Low Priority
- High Priority (see Portal 178)
- Can easily be done with JS content “filtering” based on classes, but it might not be “pretty”
- Portal 117: Pull local theme into separate package
- Auto-merging of maintenance PRs would be nice
- Low Priority until more contributors join
- Just merged it today (Portal PR166)
- TODO: Close it
- Already have a primary environment for building the book (which runs the notebooks)
- See thread on Foundations 127
- Should we have one environment that includes the build of the book and the running of the notebooks? Or should we have two separate environments for the two use cases?
- Use metapackages? Do we want to have to support our “own stack”? Probably not the best thing to do...
- We are trying to encourage our contributors to learn how to manage their own environments, so it's reasonable to assume that they will manage their dependencies themselves
- Requires clearly indicated what packages are required for each notebook
- Can hide (collapse) the “install these packages to run this notebook” cell on the Jupyter Book
- Datasets 20: What procedures do we need for adding or modifying Pythia datasets?
Action items
- See inline TODO items above
- Add new policies to contributor guide
2021-11-15: Ad Hoc Tutorial Content Meeting
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Julia Kent / NCAR / @jukent
- Kevin Paul / NCAR / @kmpaul
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / NCAR / @
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- Outstanding issues from conclusions in last meeting:
- Wanted all tutorials in a single repository containing many directories (one directory for each tutorial) and a single environment file for all tutorials
- Version/tag repository for each tutorial release
- Consequences of sharing a single repository
- Binder build time
- Some tutorials require specific versions, inconsistent across versions
- Tutorial content being forced to stay out of date or break
- `conda update -all` and things start to break
- Being able to build the environment is a priority!
- Separate organization -- ProjectPythiaTutorials
- Owns multiple repositories (could be so many)
- Nice for users, might be a headache for developers
- Can have template (below)
- Multiple environments files in single repository
- One per directory / tutorial
- Can this work for Binder? (MyBinder only respects one environment file)
- CI builds, health checks on each tutorial in repository
- Create a GitHub template repository
- Templates and Binder set up ready for people
- Separate location with corresponding Binder link and Markdown page
- Pageo on the Pythia Portal
- Point Binder to unique branch
- Same repository but with different branches, acts as on repository
- Main branch is template
- Impose branch protection rules
- Not as navigable, would like you’re going to the repository
- Orphaned branch
- Standard branch naming scheme (dates)
- Can be hidden
- Potentially moving away from flat Binder
- Easier to remain on Binder for old ones with separate repositories (or can GitHub trigger on certain branch names)
- Want to optimize Binder, make it more robust and operable (want it to work on HPC? -- should HPC look more like cloud)
- How to make Binder work more quickly and more customizable
- Pay for more sophisticated hardware, based off of Pangeo Binderhubs with dask capabilities
- Fast spin up - 3 to 5 minute spinning wheel
- Cost prohibitive ? (test on Jetstream2?)
- Repo to Docker, more performant
- Look at Thebe and how it spins up notebooks in Jupyterbook
- Go forward with new organization and multiple repositories
- Each tutorial has its own environment, which is a “snapshot in time”, not necessarily preserved content
- Too much work to keep tutorials up to date perpetual concepts -- maybe down the road this can be a goal.
- Cookbooks are a better place for maintaining / updating content
- When alerted, add note that things are out of date
- Will 100% match video!
- Archive repository, doesn’t change URL or hide. Indicate if something is out of date or being replaced.
- As soon as a tutorial is given, archive it? After a set amount of time
- Will be easy to find the most recent tutorial
- One org not multiple (can search in one place)
- Keep cookbooks within Project Pythia organization
Action items
- Create this organization - ProjectPythiaTutorials - KP
- Create template repository (Readme, directory structure, Binder, CI actions)
- GitHub landing page (tutorials.projectpythia.org, e.g.) that uses Jupyterbook, how to contribute, etc
2021-11-11: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Kevin Paul / NCAR / @kmpaul
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
- Nicole Corbin / Unidata / @ncorbin.id
- Chris Cardinale / UAlbany / @cjcardinale
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- BR: volunteer to take notes today? - JK
- KP: Met with Leah Wasser from the CU EarthLab about collaboration (see below about discussion)
- BR: introduce Chris Cardinale to the group
- JC: Quarterly meeting Dec 9, 2021. Please provide John input on agenda items appropriate for quarterly meeting (e.g. forward looking, general)
- JC: SIParCS projects open (https://www2.cisl.ucar.edu/siparcs) - Project Pythia Content, Using/Optimizing Binder as Pythia platform, GeoCAT Examples Project
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- How to share at upcoming conferences
- Pythia cookbooks button-down
- Have a pre-AGU hackathon, interest? Week after Thanksgiving?
- AMS : Kevin Tyle, Drew Camron, Brian Rose, John Clyne
- AGU: Kevin Paul, Julia Kent
- (Mostly infrastructure PRs merged this last 2 weeks - celebrate at IWG?)
- Foundations and Portal quick links
- Don’t be shy - link us to your merged PR here :)
- Improve the review timeline!
- Submit by Nov. 30
- Skeleton of this PR continues to develop: see current rendering
- Proposal reference here
- KT incorporated comments, mock up in link
- Build failure recently popped up
- Fix build issue first
- Discuss Open Issues and PRs
- Add photos and up to date Team page #180
- #177 is superceded by #180, can be closed when #180 is merged
- Add Chris to team
- MERGED
- Add support for hosting video tutorials and associated notebooks created by Xdev #146.
- Status? Should this stay open now that the new Tutorials repo has been created?
- To be discussed on 11/15 Ad Hoc Tutorial Content Meeting
- Add overarching “how to get involved” page #148
- Outline done, still in progress
- GitHub content #169 (already listed above)
- How to Use This Video script is ready for comment #127 -- also need a discussion about where to keep video creation materials (project files, graphics, etc)
- Use Google Drive, can be public
- JC inviting everyone
- Repurpose clickable links at the end of Youtube videos
- Fine tune the script before the video.
- Update Xarray content #98: Is this done? Close
- Discussion with Leah Wasser from CU EarthLab
- Leah agreed that their current content is not Pangeo focused (older, not much Xarray and Dask -- numpy focused)
- Currently they don’t have bandwidth to update material
- Content is meant to be supplemental material for CU courses (not stand-alone educational material) - so not duplication of efforts, worth having in both places
- What they are really looking for is analytics (AI/ML) content focused on geosciences
- Will meet again after Jan 1 to reconnect and discuss more
- https://github.com/ProjectPythia/pythia-foundations/discussions/162
- https://www.earthdatascience.org/
- Use our infrastructure for related project, interest from both parties
- 2 weeks from now is Thanksgiving
- Push quarterly meeting to January?
- Dec 9 EWG
Action items
- Plan Cook-Book Hackathon - OWG
- Review and approve GitHub content outline (#169) - BR, JC, KP (and anyone else)
- Review script for video supplement (see #127) JC, BR, anyone else
- 11/15 Ad Hoc Tutorial Content Meeting (discuss #146)
- Add overarching “how to get involved” page #148 - DC
2021-11-09: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / NCAR / @mgrover1
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
Agenda (with notes from the meeting):
- Need to get stickers to Julia for AGU
- Meet up next week
- Student conference workshop - Max, Drew, Kevin helping with this
- Pandas and datetime coming up soon :)
- Resource gallery
- Potentially post on Mondays too
- Blog post being shared for the Python tutorial
- HRRR data
- Would be good to have a hackathon for the cookbook
- How to get involved - Drew working on this
- KP has reservations about path we are choosing
- A little too easy with meeting - issue with long build times of env
- AMS career fair booth presence
- Share table with SOARS, not a ton of staff
- Approaching in-person outreach
- Would be derived from the how to get involved
- Requirements
- Overview of project
- QR codes to the foundations material and main portal
- Discuss internship opportunity
- Sticker bribery
- Follow to make sure they get one :)
- Make sure that Pythia Twitter is in presentations
- Take advantage of #AMS2021 / #AGU2021
- Share the cookbooks with Foundations links before AGU/AMS
- Week after Thanksgiving would be good hackathon time
- Bring this up on Thursday
- Helping out with this again - Max
2021-11-4: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
- Ryan May / Unidata / @dopplershift
Quick Updates:
Agenda (with notes from the meeting):
- Council for Funded projects (11/3) report out - JC
- TAC hackathon proposal awarded.
- Target date for GitHub content, 11/30/2022
- Other Pythia deliverables: virtual tutorials for up to 4 hackathons
- Copy of proposal here
- Send link to PR #169 to TAC for feedback?
- EarthCube Early Career Travel Grants
- The 2021-2022 EarthCube Early Career Travel Grant Application is open to attend a conference or workshop (virtually or in-person) between November 1, 2021 and March 4, 2022 with a due date for your application of 3 weeks before the start of your conference.
- Please find the application linked here, which includes details about the grant.
- Research Software Alliance (ReSA) Presentation- The Research Software Alliance (ReSA): Empowering the research software community worldwide. Daniel Katz and Michelle Barker
Sign up now for the Jumpstart Your Sustainability Plan mini-course! It’s free, and it’s virtual. December 6-8, 2021
- Volunteer for the Virtual Data Help Desk at AGU 2021: Volunteer as a data or software expert for the Data Help Desk at AGU 2021 (Dec. 13th-17th, 2021).
- Funding for CFP office ends in March
- Are we taking full advantage of Earth Cubes outreach “services” -
- Video tutorial update - Julia, Max, Kevin T?
Notes
DC: we have stickers and can mail them out
2021-11-01: Ad Hoc Tutorial Content Meeting
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / John Clyne
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
- Max Grover
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- Common Place and Environment
- Identical Download and Installation Steps
- Dedicated Repository
- Be able to run on Binder
- Version controlled (be able to access version before each new tutorial)
- Why do we want a separate repository?
- Ownership, consistency, silo-ing if someone leaves or deletes account, lack of control, long-term maintenance and version control
- Be able to fix things when they break
- Easier for binder compatibility
- Easier to jump in and have prep done
- Consequences moving forward?
- Template for tutorials?
- Add a note if certain environment doesn’t work
- Sticky old versions to side if something doesn’t work in an up to date environment
- Use continuous integration services
- Kevin Paulwould be the first to test this with OOP?
- Learning goals, summary
- Designed to help the instructor and do some of the leg work, not meant to be onerous and discourage people from contributing
- Individual directories for each tutorial containing notebook and data directory and a readme markdown (to cover content not read during live tutorial recording - i.e. links to videos worth watching before hand - similar to the ESDS blog posts)
- Each directory for each topic (can have more than one tutorial or session in each directory)?
- Run into issues when tutorials covers multiple topics
- One folder per tutorial
- Landing page ReadMe with Pythia styling and table of tutorials
- Steps would be”
- Add folder with tutorial notebook, data directory, and markdown
- Edit markdown for overall repository to list new tutorial (add content to table)
- If need to update main environment -- pin version release -- OR is it a new release for every single tutorial (or every tutorial series)
- Common data directory lends to data reuse across tutorials, less memory.
- Pythia datasets? What were the reasons for the data repository being separate?
- Nice to have the data locally and cloned with the repository (one step for updating)
- How to leverage the notebooks in the Foundations book
- Reduce redundant work between the 2
- Some differences in content when designed for videos
- Lower barrier for getting them in
- Different teaching styles - some people won’t want to type out their entire explanation and just give a skeleton of the work
- Don’t want too onerous for the tutorial creator
- The foundations book is more prescriptive in formatting and the review process
- Pythia datasets better once we have 50+ tutorials with data
- Location for Past Content Table?
- Extant material - no current massive reformatting. It will be scattered.
- Add a landing page that identifies old content (in new tutorial readme and on portal?)
Action items
- Create repository - done
- Use mamba for solving environment - done
- Continuous integration services - ask AB
- Binder link - MG - done
- Create a template
- Create a README for entire directory
- README for data directory
2021-10-26: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Max Grover / NCAR / @mgrover1
- Drew Camron / Unidata / @dcamron
Agenda (with notes from the meeting):
- Have them - sending email to verify we can open them
- Total number - 500 - how to split these up
- 100 for Albany
- 100 for Julia for AGU
- 100 for AMS
- 100 for NCAR/UCAR
- 100 for different giveaways (backup stash)
- Sticker distribution - how does postage work? Checking with Sheri
- Mail to Albany
- Giveaways - best to coordinate with AMS/AGU
- Asking about getting stamps and envelopes
- Student workshop
- Short Course
- What is the overall focus of the material appropriate for a post?
- Introduction - introducing some problem
- Prerequisites
- Introduction of new libraries/concepts
- Dig into the example
- Visualization/conclusions
- Pythia-Cookbook repo?
- Formatting
- How to integrate with Binder
- Flush this out at the hackathon
- Create a repo for this
- Link to foundations content
- Add to the resource gallery
2021-10-07: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC: Still waiting on notification for TAC hackathon award. Had to rework the proposal a bit. Asked for GitHub material to be due later (3 weeks to 3rd week of Nov). Actual hackathons wouldn’t start until calendar year 2022
- BR: Julia was able to visit Albany; gave a nice overview of Pythia and the educational efforts in ESDS (shoutouts to GeoCAT and Xdev at NCAR); hybrid in-person/Zoom seminar
- KP: AGU presentations/talks given for sessions on Pangeo and Pythia have been accepted
Agenda (with notes from the meeting):
- Needs a minor tweak to layout (center vertically in navbar)
- Jupyter-Book pins pydata-sphinx-theme at <0.7.0rc; Can we fix this?
- Open infrastructure issues:
- 173: Move the Resource Gallery to an external website
- Is this what we want?
- Separating gallery to its own repo was a way around the limitations of #125; but if we can do this with JS, then we shouldn’t separate out gallery
- Close? For now, yes. Let’s go forward with a JS approach to #125.
- 172: Manually add Pythia content to gallery
- High Priority
- Also good fodder for Hackathon
- Waiting for affiliations tags to be merged in first (Issue 156, PR 160)
- 157: Direct users to all Project Pythia content gallery from landing page
- Needs 156 first
- High Priority
- 156: Add affiliation filtering content
- Will be closed by PR160
- What affiliations do we want?
- Automate the list of affiliations from tags in gallery config
- Allow multiple affiliations for each item in gallery
- High Priority
- 146: Add support for hosting video tutorials and associated notebooks created by Xdev
- TODO: Set up a meeting for future discussion (Julia to set up)
- 126: Automated addition of Foundations material to the Gallery
- Is this superseded by #156 and #157? Yes and no.
- How to automate? Not sure.
- Should be more fine-grained (i.e., don’t just add whole book)
- Low Priority
- 125: Multiple gallery filters
- High Priority
- Can easily be done with JS content “filtering” based on classes, but it might not be “pretty”
- 117: Pull local theme into separate package
- 45: Clean up portal look and feel
- 20: What procedures do we need for adding or modifying Pythia datasets?
- NOTE: Watch this repo, too!
- 157: Previous and next page links are formatted differently in new Xarray notebook
- 128: Revert Google Analytics Workaround Once pydata_sphinx_theme Fix Released
- Related to PR160
- Low Priority
- 119: Protected branch for main?
- Auto-merging of maintenance PRs would be nice
- Low Priority until more contributors join
- 92: Notifications for build failures?
- High Priority
- TODO: Anderson will take care of it
- 85: Documentation for the book preview workflow
- Just merged it today (Portal PR166)
- TODO: BR will bring up at EWG meeting next week
- 74: External links template available from Foundations
- 56: Pythia Foundations Environment
- Already have a primary environment for building the book (which runs the notebooks)
- Low Priority
- 54: Internal vs External Links
- Low Priority
- For future discussion
Action items
2021-10-14: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / brian-rose
- Drew Camron / Unidata / dcamron
- Rob Ford / UAlbany / r-ford
- John Clyne / NCAR / clyne
- Max Grover / NCAR / mgrover1
- Nicole Corbin / Unidata / ncorbin-id
- Kevin Tyle / UAlbany / ktyle
- Kevin Paul / NCAR / kmpaul
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- BR: Volunteer to take notes today?
- JC: No news on EC/TAC hackathon proposal. If awarded, I would ask for extension to Oct 31 deadline for GitHub content.
- Supposed to hear by Oct 6, hopefully hear soon...
- Good to ask for this extension
- JC: John was invited to give a lightning talk on Pythia during Earth Cube town hall meeting at AGU
- Application for Pythia content intern submitted to SiParCS program
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Delay in shipping… not in time for Julia’s visit, got stuck in Kentucky
- Won’t be here this week…
- Plan on distributing at AGU/AMS
- 16 “claps” (likes) on our post
- 229 views, average of ~3 minutes
- Most traffic coming from Twitter (~50%)
- Kevin working on post related to HRRR data
- November is the #30daymapchallenge event!
- Up to 142 followers - gained ~50 over past few days (EarthCube posts helped)
- 38 Tweets so far
- Improved engagement (~10%) as of recent
- John giving a lightning talk for Earthcube - going to distribute stickers there too!
- Suggest binder launch in jupyterlab content #153
- Discuss Open Issues and PRs
- High priority - need to find someone to do this
- Would be good to label as Hackathon
- Drew willing to help with this, good to get done at Hackathon
- Good to find someone else to help too!!
- Pull headshots from GitHub?
- Thinking we should people’s GH profile pictures
- Good hackathon task - can get done in a couple of hours
- BR self-assigned this months ago and hasn’t done it…
- Target the hackathon to do something about this
- Issue with JupyterBook…
- Good to accept before this is fixed
- Rob working on incorporating the suggestions from Kevin
- Ready for review 😁
- Reviewers?
- Would be good to tag Anderson on this too
- Value from people not as familiar with the package/tool
- Top priority - last missing section… (hackathon main priority)
- Group to get together before hackathon - over next week
- John
- Kevin T
- Ryan
- Brian
- (Kevin P)
- Drew (if convenient)
- BR and KT are supposed to take this on, using material from KT’s class (mostly repurposed from Carpentries)
- Plan on putting together a rough cut (try to get done)
- At hackathon, review this, distribute
- Can hopefully extend - sort of arbitrary
- Just want a draft by this date; not necessarily hard deadline for polished product
- Work to get into Foundations is mainly templating
- Would be good to add to hackathon content (more fitting into format)
- BR would work on this at hackathon
- Tall order to take Jeckyll-built content to JupyterBook
- Take quite a bit of time - not necessarily one afternoon
- Should we just link to that material??
- At some point, draw some line
- Should have basic material
- Know enough to contribute
- Could add valuable content on pull request
- Hackathon soon (would be good to have one before AGU/AMS)?
- Thoughts on having it in two weeks?
- Send out an availability poll
- Max will send this out
- If time works out, would be good to have this on same day as working group
Action items
- Schedule next hackathon, tentatively before end of October - Max
2021-10-07: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / NCAR / @
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- JC:
- Will be late today because of conflict with the NSF EC CFP meeting!
- Will hear today whether the TAC’s hackathon proposal will be awarded. If it is, we are on the hook to provide "Intro to GitHub” content by ~Oct 31 (and help support up to 4 hackathons with user support via slack, etc.)
- Kudos to the OWG: This article was published yesterday: https://www.earthcube.org/project-pythia
Agenda (with notes from the meeting):
- SIParCS Internship Proposals:
- The deadline is tomorrow! Who is submitting?
- Project Pythia Content Development
- CISL Mentor: Julia Kent
- Co-Mentors: Max Grover, Brian Rose, Kevin Tyle
- This will be funded by Pythia
- Max will submit this later today
- Link to submission doc
- Binder optimization for speed + HPC (i.e., Pythia Platform)
- CISL Mentor: Kevin Paul
- Co-Mentors: Anderson Banihirwe
- Historically, Binder implementations have been optimized for *cost*
- Should IWG focus on the portal and EWG on foundations? Or should IWG focus on “infrastructure” labeled issues and EWG focus on “content” labeled issues?
- Working groups are not associated with individual repos; but with education/content vs infrastructure
- Stick to content vs infrastructure division
- Some overlap:
- Submitting data (which involves both infrastructure and content)
- What is the procedure for submitting / modifying / etc the datasets?
- Put an issue in the dataset repo to discuss this
- DASH repository can host large datasets, too
- Current infrastructure is not tied to the repository (is general and won’t need to change if we use DASH)
- Update to CONTRIBUTING.md
- Followup: Remove RTD builds and config; remove ci/requirements.txt file
- Need another issue/PR to update docs on how to use the previews
- Needs a minor tweak to layout (center vertically in navbar)
- One typo to fix. Might need additional verification, but I think it’s good to go.
- Some changes have been suggested
- Needs 156 first
- High Priority
- Will be closed by PR160
- What affiliations do we want?
- Automate the list of affiliations from tags in gallery config
- Allow multiple affiliations for each item in gallery
- High Priority
- Is this resolved with the merging of PR167
- KP: Remove requirements.txt and readthedocs.yml files and close? Yes.
- Is this superseded by #156 and #157? Yes and no.
- How to automate? Not sure.
- Should be more fine-grained (i.e., don’t just add whole book)
- KP: Create an issue to manually add all of the Foundations book content as items in the gallery
- How bad is it to do this statically?
- # Domains: 8
- # Formats: 6
- # Packages: 24
- 8 + 6 + 24 = 38 total number of options (without affiliations)
- Total number of possible pages:

- I think that might be too much for static pages… Need javascript
- High Priority
- Do we need an external contributor to help write the JS?
Action items
- KP: Issue about datasets and procedures in pythia-data repo
- KP: Issue to create gallery.projectpythia.org with a dynamic site
2021-09-30: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- Max Grover / NCAR / @mgrover1
- Ryan May / Unidata / @dopplershift
- Nicole Corbin / Unidata / @ncorbin-id
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- BR: Volunteer to take notes today?
- JC: Potential SIParCS mentors met today - more discussion needed
- JC: Orhan, Anissa, Alea, Michaela (GeoCAT team) added to “pythiaeducation” group for PR reviews
- JC: Moving ahead with TAC proposal to support jupyter notebook hackathons (more below)
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Confirmation on the stickers

- Blog post being added to the Pangeo Medium publication
- MG - helping with the Data Help Desk next week
- Recording of our “resource” (portal) is due October 5
- One pager also due October 5
- One pager is in the google drive
- Logo due October 6
- These would be used in future help desks too
- Helping with moderating Twitter and such
- Yay! Thanks Julia! Thanks all reviewers!
- Lots of infrastructure improvements and fixes!
- Already on Week 6
- Book is not entirely up to date, update the book week by week
- Pythia Content
- Only have one (quality) Pandas notebook currently in Foundations
- Figured it was good to break things up a bit more
- “Pythia-ized” content built around the NY mesonet dataset
- Working on something that is a bit more local
- These might be merged into foundational content…
- Would like to put one together for both the class, one in general
- Want to get more active feedback
- What’s on our pages?
- Currently have “open an issue”, “suggest an edit”
- Is this sufficient?
- Where would this Albany NY Mesonet content belong?
- Educational content → go into the foundations book
- Nearly ready for re-review
- Tremendous demand for Dask training!
- PR still in draft for now
- Plan to have it ready for review by next EWG
- Adds the tutorials videos as individual cards
- Should be ready to merge - need one more approval
- KP will take a look, get this merged
- KP + JK worked on dealing with cards with long lines
- Folded into that PR too :)
- Moving forward on this 👌
- Conversation ongoing regarding how to deal with our own content and the portal
- NC and JC working together on this
- Plan on working on this after October 15
- Medium priority tag
- Would be helpful to have a video on getting started with Jupyter
- 2 GIFs put in some of the content, Alea would be good person to help with this
- Not as high of a priority since we have some video, but not necessarily a perfect match
- Might be good to highlight sooner
- Include “click this”, detailed instructions walking through Binder
- Transferring this discussion into GH
- Medium priority tag
- Transfer this to the discussion board
- Discussions seems like a better place for this
- Moved to Discussions (Idea)
- Mentioned during the previous hackathon
- Not sure how to label here
- Tagging Anissa to see where we are here
- Medium Priority
- Brought up throughout the past year…
- Think about what is “too advanced” for the foundations
- Ex. Dask might be the last “chapter”
- Excluding MetPy in this since it is more domain specific
- New section
- Core content, python packaging, advanced
- More content other than foundations
- Could do none of these things
- Don’t have time for everything - could open opportunity for people to contribute
- How do we support something like this?
- Max will open a discussion item about this - some sort of “Pythia Cookbook”
- Build the possibilities here - no place to put it now
- SIParCS intern discussion
- NSF EC Technical Architecture Committee proposal to support jupyter-notebook based workshops is moving forward
- Led by Donata Giglio, CU
- $0 award to Pythia
- SOW
- contribute conda env setup, intro to jupyter, and intro to GitHub modules.
- Provide up to four training sessions on above topics.
- Provide tech support to workshops on Pythia topics
Action items
- DC: open PR for binder link in Jupyterlab content
2021-09-28: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / NCAR / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
Agenda (with notes from the meeting):
- Do we absolutely need to have the partner logos on there? If not, I think we should stick with the original design. If so, we can add the “official” partner logos to the bottom of the sticker
- Might need to get a larger sticker to make it actually legible
- Drew will follow up with Ryan to make sure it is okay not to include
- Make sure that the “flyer” has the correct branding and logos on there
- Thoughts on posting from the Project Pythia Medium account on the Pangeo blog?
- Would still have our official account, utilize the 700+ followers they already have
- Good to engage with this community, provide some regular content here
- Going to be added as a writer to the Pangeo medium blog
- Future blog post ideas (every other week)
- Example doing something cool with Python
- Have some prerequisites at the top
- Quarterly meeting updates
- What has gotten done over past few months
- HRRR data on the cloud - post Wednesday October 13
- CESM2-LE data on the cloud - post Wednesday October 27
- Radar on AWS example
- METAR from thredds post
- RDA via THREDDS using ERA5 data
- Encourage teachers to invite their students to post to the page
- MG - sorry about not keeping up with “interaction” posts
- Could we post these on Tuesdays instead? Fridays are a bad day for me, plus typically less interaction on these days…
- Monday - retweet MetPy Mondays
- Tuesday - “interaction” posts (asking questions, polls, etc.)
- Wednesday - Resource gallery highlight
- Thursday - foundations content highlight
- Friday - retweet the ESDS blog post, or post it via Pythia?
- Use your best judgement - focus on education, examples, blog posts
- The degree to which participants find the training favorable, engaging and relevant to their jobs
- The degree to which participants acquire the intended knowledge, skills, attitude, confidence and commitment based on their participation in the training
- The degree to which participants apply what they learned during training when they are back on the job
- The degree to which targeted outcomes occur as a result of the training and the support and accountability package
- Your comfort level
- How do you use this content?
- Personal learning
- Curriculum for a class
- Workshop
- Hackathon
- Evaluation of each section (ex. Pandas, NumPy)
- Scale of 1-5 on content? What do we want from this?
- Beginner to Advanced
- Seen before or brand new
- Satisfies what your were looking for → need more content
- Entire content sections (ex. Dask)
- More advanced or more introductory content for each section
- Would you be interested in being involved?
- Things to bring up in the EWG
- Confirmation on the stickers
- Blog post being added to the Pangeo Medium publication
- Foundations content
- Still need more advanced sections… e.g. expanding pandas
- Thoughts on first “cookbook” with the NY Mesonet? Generally station data?
- Might be best with ASOS data
2021-09-23: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / clyne
- Kevin Paul / NCAR / kmpaul
- Brian Rose / UAlbany / brian-rose
- Drew Camron / Unidata / dcamron
- Orhan Eroglu / NCAR / erogluorhan
- Julia Kent / NCAR / jukent
- Rob Ford / UAlbany / r-ford
- Max Grover / NCAR / mgrover1
- Anderson Banihirwe / NCAR / andersy005
- Ryan May / Unidata / dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Project Pythia stickers - JC
- Unidata may be able to purchase or help with purchase with discretionary funds; will look into it
- If clearly not an option for involved parties, what would we do?
- One more backup option, JC will look into that...
- Volunteer to take notes today? - KP
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Fixed preview mechanism for foundations!
- Should we hire an intern? Two?
- Need project ideas by Oct 8!
- There is money in our (UCAR) award for internships
- Not sure if SIParCS internships for 2022 will be in-person or hybrid or virtual
- Might be a good idea to push on the Pythia Platform (binder) or a project on vetting/developing educational content
- Set up a meeting for interested parties who want to write a project idea
- Invitees: BR / KP / JC / JK / KT? / MG
- Unidata has its own internship program that might be able to find help, too
- Material to help with EC hackathons:
- Intro to Jupyter - close to merge
- Setting up conda envs - ??? have something minimal in place now
- Git and GitHub - ??? sounds doable by end of Oct
- Can we have this material in place by the end of October?
- Could we also provide training?
- Virtual
- Time commitment is how long it takes to walk through our content
- Audience: hackathon participants…? geoscience
- Is there a risk of our material being too AOS focused?
- Do we need to find solid earth, etc. folks to join us?
- May happen prior to hackathon (not sure about timeline)
- Some of this is fixed, it looks like
- CODEOWNERS file needs better refinement:
- Need the teams to be fixed
- Need better refinement in files
- Move to Netlify? Can build the environment on GitHub Actions and then push to Netlify (guarantees that the build env is the same for deployment and preview)
- No objections. Let’s do it.
- AB: It’s a complicated process, but it should be transferable from foundations to portal. Most of the process is documented.
Action items
- DC: Will look into paying for stickers
- JC: Set up a meeting for SIParCS project discussion
- RM: Fix CODEOWNERS file for portal
- AB: Transfer Netlify previewing to portal
2021-09-16: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Deborah Khider / USC ISI / @khider
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR /@clyne
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- We are still exploring options for paying for stickers - JC
- Can EarthCube buy them directly?
- Council for Funded Projects (CFP) Proposal for EarthCube governance funds to develop EarthCube-related notebook-construction workshops/hackathons - JC
- Volunteer to take notes today? - JK
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Other forms of Contribution that aren’t strictly technical content/infrastructure
- Test users for materials and giving feedback
- Ambassadors (teachers using in coursework, sharing on social media, plug material if you found it helpful)
- Feedback on first Medium blog post (MG)
- Link to draft
- Sorry I can’t attend today - but any feedback would be appreciated! Feel free to leave comments in the doc here, or reach out (mgrover@ucar.edu)
- Where to direct people as a “newsletter”
- “Get Involved” Flyer should be consistent with site and blog, just a screenshot PDF with QR code
- is it “external”, or should it also feature Pythia-owned content? (currently doesn’t)
- What are our broader design goals for this gallery (to guide infrastructure dev)
- Related to https://github.com/ProjectPythia/pythia-foundations/discussions/118
- Comprehensive (w our material) or External (sends people away) - BR
- Make finding our content as easy as possible! No matter where they are looking. Put our own content first on landing page, and again in the gallery - KP
- “External Gallery” was just because we didn’t have content yet, but now we do have some that we want accessible through the gallery - KT
- 100% agree, but distinguish what is a Project Pythia project-- - JC
- Consensus is that yes we should include it in some way
- Add a fourth tag “pythia or not”, also include other institutions to be fair (filter by Unidata)
- Add 2D search with script - IWG issue
- Update contributing guide, add content to links gallery once it is done.
- Looking for feedback
- IWG issue with a dataset that is too large to host on GitHub
- BR and RM to give general feedback and data file problems
- RF: Thoughts on layout of the notebooks?
- Content from Anderson’s Xarray tutorials
- One is a short example on using Xarray - separate from other template
- Belongs in an example notebook gallery?
- Merge this is, and create a later issue for moving some content into example notebook gallery
- Planning on getting back to this next week
- MG not here right now
- Awaiting one more review (DC will get to this today)
- JC - Nice to have an accompanying video to go along with it
- DC - video not at risk of going out of synch
- DK - why `cells`and other sticky points on teaching Jupyter
- BR - add video to the landing page of Jupyter, let people know that it is related but slightly different.
- DC - fan of pulling out pieces of video
- Make older content match template, awaiting one review
- Portal, Held up by ReadtheDocs build (possible fix w IWG PR154)
- Pip will only read `requirements.txt` file, other solutions in the future but apply this band-aid of PR154 and worry about refactoring later.
- Portal, Held up by ReadtheDocs build (possible fix w IWG PR154)
- Issues Related to a PR: -- Safe to Skip?
- Closing with PR131, above
- Mentioned in PR107, above
- Mentioned in Draft PR62, above
- Mentioned in PR147, above
- Related to Discussion 118, below
- Potential Duplicate/Related Issues:
- Portal,
- Related to #97, above
- Portal
- Duplicate of #63, above - close this one?
- Review High Priority Issues
- Unprioritized/Unassigned Issues
- Community submitted issue requesting plotting beyond Matplotlib
- JC - we already plan on doing this
- Explain to the user’s why we teach matplotlib instead of other libraries, flesh out 3rd sentence a little bit more
- Change title to covering other plotting libraries
- Keep issue open
- Review/Assign Medium Priority Issues -- Skip?
- Low Priority Issues -- Skip?
Action items
- ALL: feedback to Max on draft of blog post
- JK: Add affiliation tags (Unidata and Pythia) and add to filter drop-down
- Once there is a page to link to, link to it from landing page
2021-09-14: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / NCAR / @mgrover1
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
Agenda (with notes from the meeting):
- Worked with John to iterate on graphic to use for stickers (need url there)

- Thoughts?
- Anything branding related, media and marketing group would need to approve at UAlbany… UCAR got audited, no go so far…
- Going to see if Unidata could order!
- Putting together doc for this today - will send it onto ya’ll for feedback
- Status on adding Twitter link to Pythia Portal?
- Still an open issue
- Julia taking a quick look tomorrow, Max can help too!
- Everything is scheduled for the next two weeks
- Good to get the October content scheduled
- Drew is scheduling foundations content
- Kevin will be scheduling content for October by our next meeting (Sep 28)
- Go ahead and schedule engagement on Fridays - Max
- Metpy Mondays
- Max retweeting from accounts like this rest of the month of Sep
- Kevin helping with this in October
- Contributors to the foundations content, infrastructure
- University communities
- Various conferences
- Hackathons
- Workshops
- Two talks on Pythia
- Have some sort of “git involved” flyer
- Page on how to be a part of this
- Adding content (learning Python)
- Using/testing it as a learning resource
- “Pythia ambassadors”
- Teachers using this
- People utilizing at workshops (ex. OceanHackWeek, Pangeo)
- Attention grabbing way of getting stickers
- Flyer - QR code to get involved
Ex. 
- “Advent of code” exercise during the conference
- Finishing up the medium blog post (first one)
- Schedule rest of the tweets for September and early October
- Drew - bring up idea of section on the main site - how to get involved
- Focus more on providing feedback
2021-09-09: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne /NCAR / @clyne
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- Deborah Khider/USC ISI/@khider
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / Qerogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Can someone volunteer to take notes during this meeting? - JK
- Reminder: Earth Cube call for participation in Virtual Data Help Desk at GSA 2021
- To volunteer, please complete this form, as soon as possible by Friday October 1st, 2021.
- Stickers: we may have a problem. Purchase may not be allowable under NCAR’s contract.
- KT: Maybe UAlbany could handle the purchase? Possibility (~$150)
- JC attending NSF EC call for notebook hackathons brainstorming meeting later today. Stay tuned
- AMS abstracts status ?
- KT has submitted to the Education Conference’s “Teaching Meteorology Online: a retrospective and forward look” session
- JC submitted an overall update on Project Pythia, Drew taking the lead?
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Biweekly update standing item to match biweekly meetings?
- Weekly opportunity to pass ideas on to OWG
- Review open issues and PRs
- .py files are turned into links, can work around with ``
- Style guide - should paths always be in code style?
- Remove linkify - should be intentional on code writers part
- Awaiting approval from KP as chair of IWG
- Pydata sphinx to be releasing their own fix, then we can revert our own manual workaround.
- OE will keep eyes out for this
- Low Priority
- Nightly builds, things were broken and failing every night but nobody saw it. How to alert IWG?
- AB offered to set something up, give him the go ahead.
- Medium Priority
- It pulls from the organization level (ProjectPythia), not from the repository
- Remove link from foundations site, leave on main portal site.
- Low Priority
- Keep separate environments in each repository so that each repository can be as simple as possible, head off build issues
- Environment just for users to download for running notebooks without having to build the sites. (developer vs user environment)
- Hard to test and maintain 2 environments
- Wait to see if this becomes an issue/is needed, get feedback from users
- Cartopy, URL for shape files changed, so if you don’t have cached files and have to update environment
- Low Priority
- Different behavior or appearance for internal links (one color, change the page) vs external links (different color, open in a new tab)
- Low Priority
- When PRs are submitted to the foundations book, get a render of the book so people don’t have to build the book themselves.
- AB made this work. BR would like to learn from this behavior and see it documented
- Medium Priority
- CodeOWNERS - kicked in PR
- What to do with videos?
- Community contributions here
- Ipykernel in display_name, Ipykernel is benign
- KT to move forward on this
2021-09-02: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Max Grover / NCAR / @mgrover1
- Kevin Tyle / ktyle / @ktyle
- Deborah Khider/USC ISI/@khider
- Ryan May / Unidata / @dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Can someone volunteer to take notes during this meeting?
- KP: Has anyone seen https://quantecon.org?
- Have concept of landing page down, more than one book has been in discussion
- This other group has published book at large scale, with theme
- Cheat sheets
- Lots of books
- Good organization/naming convention
- This looks like the organizational structure we want...
- KP: I will be on vacation next week. Can anyone take point on the IWG meeting next Thursday?
- Who wants to help? :)
- John signed up!
- Accept one pagers and demos
- Event: Oct 11 - 13
- To volunteer, please complete this form, as soon as possible by Friday October 1st, 2021.
- Info about GSA
- Complete that form, good way to connect with people about our project!
- MG: Sticker request submitted
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- Met on Tuesday
- Schedule our tweets
- Working on first Medium blog post
- Might be good to include on the portal itself too
- Reaching out to Earthcube once we get the Medium blog post good to go
- Ways to gather feedback on the content
- Make sure to send people who want to test the content our way!
- Kevin is getting structured feedback from the class
- AMS abstract submission. Draft is here. (now due Sept. 8)
- Submitting to building communities in Python session
- Get feedback by 5 PM MT on Tuesday
- Presenting author?
- Brian planning on travelling for committee obligations
- Kevin planning on submitting one for Education
- Drew mentioned he’s willing to present
- Developing a Jupyterbook version of the online class!
- Parallel of what’s on Blackboard + Pythia Project
- Link that is setup is what anyone can view
- Borrowing heavily from Pythia theme
- Schedule
- Student feedback - starts week of Sep 13
- Git related for now…
- Intent is to use Pythia Matplotlib notebooks
- Design way to get feedback from students
- Through October - parallel of Pythia content
- Carrot + stick with class participation grade to make students take providing feedback seriously
- Make this openly available beyond UAlbany
- Would host on cloud based platform at some point
- Currently integrating git in part of the curriculum
- Slightly modified version of the Software Carpentry git tutorials
- Would be nice to integrate Software Carpentry tutorials/lessons into Jupyter Book format
- Trying to leverage sphinx-exercise to implement the Q/A aspects of the Carpentries
- Q: What’s licensing for Carpentries material?
- A: CC4, so they are willing to share as long as attribution is given
- Assess how much of Carpentries material could fit into the Foundations Book
- MG: What is the plan in terms of Dask in the Foundations? Is it okay to include?
- Intro to Dask + Distributed computing
- Thinking section on Xarray + Dask (what we have currently in PR)
- Section on Dataframes with some sort of observations (Ex. METARs?)
- Compute some sort of climatology?
- Even if you are just working on your laptop. Lazy evaluation, multi-core use...and that’s separate from Dask Distributed
- Is there a “too complex” line that we shouldn’t cross for “new students”?
- Things that go in Foundations are useful across disciplines
- Don’t really think that complexity should be a concern
- Do we need a heading in our Foundations for “advanced topics”?
- It needs to be somewhere, but where to put it is unclear.
- See a basic role for a “what is dask” and “why use dask” in the basic material, but more advanced “how to use dask” might be somewhere else
- Use it as an opportunity to teach “lazy computation”
- There is space for more advanced Dask usage in the Foundations book. Not sure what the best path is, but Dask is so useful that it's worth noting.
- Might be hard to figure out what that path is without a working example of parallel computation in our material (examples, book pages, etc.); e.g., using Pangeo BinderHub with Dask
- Think there is a need for better documentation and better examples of Dask for parallelism
- JC: NSF EC call for notebook hackathons - Sept. 3. please complete this brief interest form by September 3rd.
- There’s funding here, but the amount is unclear
- Maybe there’s an opportunity to have a hackathon on how to use the material
- Could we provide to other people a hack space on how to write Jupyter Books?
- JC will fill out the interest form and then we can finalize our ideas later
- AR/VR focus...not discipline specific, but could have overlap with geoscience
- Is there any existing material online that we can add to our gallery?
- Not Python-based applications
- Seems out of scope for Pythia
- MG: Can we set up a prototype for “external data cookbooks” (ex. Satellite, CESM, etc.)
- Or would this be better served through blog posts with pre-requisites/links back to the Foundations content?
- [didn’t get to this]
Action items
- Max - submit form that John mentioned
- JC - will fill out NSF EC notebook hackathon interest form. Done
- ALL: review AMS abstract, make suggestions
2021-08-31: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / NCAR / @mgrover1
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
Agenda (with notes from the meeting):
- Need to obtain the high quality svg, work with John to order this ASAP
- Action item - chat with John about getting this ordered (Max)
- Julia making sure we are advertising tutorial series
- Contact Nicole about creating an intro/outro slide/image for this
- Action item - reach out to Nicole about intro image/slide (Drew)
- Action time - reach out to Earthcube about advertising social media (Drew)
- Kevin teaching it in his class
- Determining who else?
- Spinning up blog post series
- Relevant application of resources for example (e.g. Ida viz)
- Consider “community office hours” by Pythia; bring up with pangeo and others?
- Reach out to PyAOS mailing list
- How to structure engagement
- Unidata governing bodies
- Tribal colleges - Jeff Weber helping with this
- Figure out what we want from feedback
- What we want to find out
- How to reach out
- Who to reach out to
2021-08-26: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Max Grover / NCAR / @mgrover1
- Julia Kent / NCAR / @jukent
- John Clyne / NCAR / @clyne
- Kevin Paul / NCAR / @kmpaul
- Ryan May / Unidata / @dopplershift
- Orhan Eroglu / NCAR / @erogluorhan
- Rob Ford / UAlbany / @r-ford
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Stickers :)
- How many would we want to order?
- 2”x2” sticker prices (stickermule)
- 100 - $73
- 300 - $115
- 500 - $151 *** Seems like the right amount!
- AMS and AGU as opportunities to share/give
Agenda (with notes from the meeting):
- Stickers
- Reaching out to NSF EC for promotion in newsletter
- Top post (twitter) has had 6000 impressions and 55 link-clicks (~1%)
- Working on Jupyter content (moving from presentation to notebook)
- Goal to have PR by next week
- Working on Google Analytics issues
- Using the default JBook theme, extending pydata-sphinx-theme, which is where the problem lies (PR is currently failing tests)
- Manually added a custom template to book
- Will work with Kevin to figure out how to fix
- Maybe through a draft PR (KP: +1)
- Working on Xarray content
- trying to move datasets into Pythia and using them
- Started some Dask content
- Porting from Anderson’s tutorials
- What do we need to push these through?
- Need to push Pythia datasets through before moving forward with Xarray/Dask
- No new PRs in foundations repo are getting reviewers automatically
- Put team names in the CODEOWNERS file
- Can we determine a team for review based on tag?
- Tags are associated with issues, not PRs
- We might need to visit open PRs to select reviews
- Video content from Xdev and GeoCAT
- NEED to get that into Pythia
- Make videos easy to find
- Can we move the notebook-specific tutorials into Pythia?
- Can we add embedded video for each tutorial to Notebook template?
- Videos walk through the notebooks (sometimes multiple notebooks), and so the videos should be “alongside” the notebooks
- MetPy Mondays stopped sharing notebooks because the videos are static and the software is constantly changing
- Notebook “health checks”?
- Many newer tutorials have content that is based on the new notebook template, but older tutorials are not based on template and need to be “ported”
- Maybe we have a “Pythia Tutorial Collection” or Gallery
- Acknowledge that these might have overlap with the Foundations Book, but separate content...make it easier for people to contribute to Pythia without needing to meet too high of standards
- Maybe less cohesive
- Maybe a blog post instead?
- Maybe need to pull down videos if they are no longer valid
- Don’t underestimate the value of tutorial videos that are even old
- Collect material, even if its old (and maybe don’t put the old stuff first)
- Pythia Platform (binder) can provide some solution
- Can freeze environment to match video environment to keep notebooks working with videos
Action items
- Max: Get 500 stickers for AMS and AGU conferences
- Ryan: Create a CODEOWNERS file
- Julia: Gather the notebooks from the tutorials for Pythia
2021-08-19: Education Working Group
Meeting link:
- On gather.town for today’s Pythia hackathon
- See calendar for gather.town link
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Max Grover / NCAR / @mgrover1
- Julia Kent / NCAR / @jukent
- Kevin Paul / NCAR / @kmpaul
- Ryan May / Unidata / @dopplershift
- Orhan Eroglu / NCAR / @erogluorhan
- Rob Ford / UAlbany / @r-ford
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Hackathon in progress!
Agenda (with notes from the meeting):
- Working on advanced xarray notebook
- Issue
- Intro to Jupyter - overview of parts
- Kevin Paul’s tutorial on Jupyter - JupyterLab has enough content
- Jupyterhub at some point?
- Sample of a subsection file
- KP: Is this asking for a tutorial content on how to use the JupyterHub? Or is this asking for tutorial content on how to deploy a JupyterHub? JK: both? This seems like a later goal - having a separate spin-up document for every method of deploying Jupyter. JupyterHub was just an example. KP: Deployment is too advanced, link off to it (Jupyter and Pangeo both have content on deployment and extending deployment to dask etc)
- Working on “how to use the book”, at some point working on video of this
- Sharing a google document with the content - easier than markdown + git
- Make sure to assign yourself when you are working on something
- Working with Anissa on Matplotlib expansion
- Will add additions to Cartopy later
- PR on main graphic - plan is to work on this next week
- Issue for matplotlib expanded content
- Working with matplotlib addition
- Issue with Google Analytics - overlapping experience with GeoCAT project
- If no one in the group that has started working on this, willing to work on this :)
- If you use pydata theme, uses template to place this snippet in configuration
- Putting this in the wrong place currently…
- Could move the code snippet from body to header?
- Just got here - plan for the afternoon - tackle the cartopy notebook Michaela put together
- Here to help with cartopy related questions and issues :)
- Back from vacation, moved into the new building!
- Planning on integrating this content into classes
- First day of class is coming up on Monday
Action items
- Tag up time at the end? Debrief?
- Come back to the table at 2:45 PM MT
- High priority item not assigned
- Intro to Github - this would be good to get someone to work on
- Kevin Tyle is developing some of this content for his fall semester class. KT and BR will work on integrating this into Foundations.
2021-08-12: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Julia Kent / NCAR / @jukent
- Brian Rose / UAlbany / @brian-rose
- Rob Ford / UAlbany / @r-ford
- Max Grover / NCAR / @mgrover1
- Orhan Eroglu / NCAR / @erogluorhan
- Drew Camron / Unidata / @dcamron
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Assistance available from EC office with outreach (8/12 email). Forward to OWG?
- Annual report approved!
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T):
- Resource gallery - Wednesday
- Foundations - Thursday
- Retweeting MetPy Mondays on Mondays :)
- Engagement challenges?
- Challenge people looking at our feed - call to action
- Ex. how did you get started with Python in the geosciences?
- Google Analytics on Foundations Book
- Active tag but no recorded activity
- Post issue on JBook repo?
- Next Pythia Hackathon day (MG)
- Best date - Thursday August 19
- Perhaps from 9-3 PM MT? Seems to be the best time for most people
- Still hold the EWG meeting - tag up time!
- Come and go as it works for you
- #136 - Add twitter link to portal
- #125 - Multiple gallery filters
- #116 - Rewrite and update the “about” content
- - How to include Xdev Tutorial content to Portal
- - Migrating theme to Jupyter Book
- #97 - intro jupyter content; connect with JMTE existing content
- #98 - add xarray content adapted from xdev tutorial seminars
- #99 - intro github and external github resources
- #100 - add zarr data format content
- GitHub content for Xdev Tutorial is here:
- Jupyter content from Xdev Tutorial is here:
- Infrastructure Deliverables Review:
- allow externally-managed platform deployments (e.g., Binder and BinderHub) to link their content directly into the portal
- External Resources Gallery
- allow collection of metrics
- provide communication channels with community
- email: projectpythia@ucar.edu
- twitter: @Project_Pythia
- GitHub Discussions
- GitHub Issues/PRs
- Weekly Meetings open to public
- support user contributions
- GitHub PRs
- Resource Gallery submission button/form
- allow launching content on dynamic, customizable, interactive platforms deployed in the Cloud, where users can modify, tweak, and explore content
- runtime environments for each interactive contribution to the portal can be provided and maintained by the contributors themselves, rather than requiring that the portal hosts maintain the various runtime environments needed by each external contribution.
- Users will be able to deploy their own Pythia Platforms and add new “Binderized” Notebooks
- Define what the Platform is and how to “package” it?
- Improve Binder launch performance
- Can we invest money into our own BinderHub?
- Can we maintain Docker images for our content?
- Facilitate the development of an HPC-capable version (“Binder for HPC”)
- “Binderize” all of the first pass of content for the Pythia Portal
- External Resources include Binderized content
- JupyterBook satisfies Binderizable content
- Maintaining content (i.e., Notebook) “health”
- How do we perform health performance checks on all content?
- JupyterBook content is already using CI for nightly builds
- Health monitoring of External Resources might be harder
- Investigation and implementation of an “automated Binder” solution, where the step of “binding” the notebooks to their dependencies can be done automatically.
- AMS abstract submission (Sept 17 deadline)
Action items
- KP: Issue on JBook site about Google Analytics - DONE
- Apparently this is a known issue:
- KP: Add Hackathon label to Portal issues (above) - DONE
- KP: Add Drive links to appropriate GitHub Hackathon issues
- KP: Look into issues for unchecked Portal options and Platform bullets
2021-08-06: Outreach Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Max Grover / NCAR / @mgrover1
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
Agenda (with notes from the meeting):
- One piece per week on Thursdays
- One resource per week on Wednesdays
- One month of posts scheduled, posts planned for over a year
- Will brainstorm next session
- Add Twitter link to Youtube About Section - Done!
- Add Twitter link to Portal
- Issue added to the portal repo
- Brainstorm for next meeting
2021-08-05: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Max Grover / NCAR / @mgrover1
- Ryan May / Unidata / @dopplershift
- Nicole Corbin / Unidata / @ncorbin-id
- Rob Ford / UAlbany / @r-ford
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Kevin Paul / NCAR / @kmpaul
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- John would like to submit first year report this Friday.
- Need person-months-worked from PIs for all of their staff
- Pay attention to yellow highlighted-areas
- Abstracts submitted to AGU sessions IN030 and NS013
- IN030 - Open-Source Software, Notebooks and FAIR Software
- Submitted: “Project Pythia: A Resource to Help Geoscientists Navigate the Scientific Python Ecosystem”
- NS013 - Open-source software for near-surface geophysics and its applications
- Submitted: “Pangeo and Project Pythia: Helping Geoscientists Navigate the Scientific Python Ecosystem” (invited)
- FYI JupyterLab Real Time Collaboration officially launched as part of JupyterLab 3.1+
- The Albany Pythia team is in the middle of our big move, things are chaotic. KT is unavailable today.
Agenda (with notes from the meeting):
- OWG report (Max/Drew/Julia/Kevin T)
- First tweet!
- @Project_Pythia
- Medium blog created
- Next tasks to links resources together
- Tweet-storm to be scheduled Friday
- Next Pythia Hackathon day -- status?
- Review of open issues and PRs: set hackathon goals
- #97 - intro jupyter content; connect with JMTE existing content
- #98 - add xarray content adapted from xdev tutorial seminars
- #99 - intro github and external github resources
- #100 - add zarr data format content
- GitHub content for Xdev Tutorial is here:
- Jupyter content from Xdev Tutorial is here:
- AMS abstract submission (Sept 17 deadline)? - tabled for next meeting
Action items
- BR: contribute to annual report ASAP
- MG: schedule hackathon day.
2021-07-30: Outreach Working Group (Initial Meeting!)
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / NCAR / @mgrover1
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda (with notes from the meeting):
- Initial meeting: Define short-to-medium term goals and accompanying tasks of the Outreach Working Group:
- Twitter (short-term goal)
- @Project_Pythia as Twitter ID (@ProjectPythia is already taken); password has been decided
- If @ProjectPythia is willing to give up their handle, we can rename our Twitter to it later (Drew will send a DM)
- Establish list of authorized “Tweeters”: to start, include the OWG members
- Email address for Twitter account (projectpythia@ucar.edu)
- Logo - emailed to meeting attendees!
- Nicole has discussed initial tweets (and blog posts) with Drew to get things started
- Active engagement through challenges, puzzles, scavenger hunts, content searches, polls, etc.
Educating geoscientists in the Scientific Python Ecosystem and Cloud Computing for big data analysis. Sponsored by @NSF in collaboration with @UCAR @UALBANY
A community educational resource for the Python Geoscience Software and Data Ecosystem.
- Add twitter favicon to foundations and portal?
- Get portal “about us” updated with photos
- News tab on main projectpythia.org page (medium-term goal)
- How should we integrate this into the main page?
- Blog posts / Medium (“medium”-term goal)
- Pangeo uses (or has used) Medium for longer articles; should we do the same? Is there a cost? - No cost, Julia will obtain a Medium URL
- Call to actions
- Find things in foundations book or wherever else
- Challenges - coding!
- Weekly skill tests?
- Challenges - contributors submitting work from conferences, etc.
- Place a discussion item to solicit ideas for challenges/polls/quizzes etc. and include a link to a Google Doc that will contain these ideas
- Videos (Pythia Thursdays?): add intros/outros a la MetPy Monday
- Outreach efforts (short-term goal)
- Discussion topic open on the main Project Pythia GitHub page. Please add feedback and additional groups!
2021-07-29: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / John Clyne
- Max Grover / NCAR / @mgrover1
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Orhan Eroglu / NCAR / @erogluorhan
- Anderson Banihirwe / NCAR / @andersy005
- Rob Ford / UAlbany / @r-ford
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- PIs: please have your input on our NSF annual report in by Friday, Aug 6, 2021
Agenda (with notes from the meeting):
2021-07-22: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
- Kevin Paul / NCAR / @kmpaul
- Rob Ford / UAlbany / @r-ford
- Nicole Corbin / Unidata / @ncorbin-id
- Julia Kent / NCAR / @jukent
- Deborah Khider/USC ISI/@khider
- Max Grover / NCAR / @mgrover1
- Alea Kootz / NCAR / @pilotchute
Quick Updates:
Agenda (with notes from the meeting):
- Deborah Khider (USC) on the PaleoCube project (new EarthCube grant) and interactions with Pythia
- Bring user workflows into the cloud in the Paleoclimate / paleoceanography space
- JupyterHub maintained by 2i2c to allow people access to analysis-ready paleo data and climate model data
- Hoping to intersect with Pythia on
- Infrastructure (Binder platform)
- Education and training
- Was not enough for students who didn’t have previous python
- Conference submissions related to Pythia:
- AMS: submission deadlines moved to 9/1/2021
- Submit something to “Teaching, Training, Outreach, and Building Communities around Python” session in the AMS Python symposium?
- Anything else?
- AGU: deadline is 8/4/2021 (< 2 weeks from now)
- KP: I am definitely going to this; I have asked for guidance on how to craft this presentation to this audience; Will share abstract as it is developed
- KP will ping Lindsey Heagy about submitting a general Pythia talk to IN030.
- KP will open a github discussion thread about this and put together a draft abstract
- Outreach WG: anything to report? (this will be a standing agenda item going forward)
- Initial discussion requesting organizations to target for outreach has been posted on main project Github
- Initial ad-hoc meeting should be scheduled to get things rolling (KT and MG co-chairs)
- Group members :) (also reach out to Drew C, not here today but might be interested)
- Kevin
- Julia
- Max
- Possibly Drew?
- Meeting at some point later next week: please indicate your availability at this When Is Good link
- Coordination with Jupyter Meets the Earth team on “Getting Started with Jupyter” material -- any updates?
- Open issues / PRs discussion:
- Content sprint soon via GatherTown?
Action items
- Will ping Lindsey Heagy about submitting a general Pythia talk to IN030 (AGU meeting)
- Will open a github discussion thread about this and put together a draft abstract
- Will bring up porting JMTE Jupyter Intro material to Project Pythia
- Reserve substantial time at next EWG for a triage of open issues and PRs so we develop action plan for the next Pythia hackathon day
- Try to schedule a next Pythia hackathon day sometimes after next EWG meeting on 8/5/2021
- KT and MG: set up the ad-hoc first meeting for OWG
2021-07-15: Quarterly Meeting
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Matt Long / NCAR / @matt-long
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Kevin Paul / NCAR / @kmpaul
- Max Grover / NCAR / @mgrover
- Orhan Eroglu / NCAR / @erogluorhan
- Drew Camron / Unidata / @dcamron
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Year end report due 8/31. John will take the lead and would like to submit earlier to avoid delays with other awards.
- Project Raijin funded by Earth Cube. John and Orhan PIs. Will leverage Pythia for delivery of educational content.
- Matt is considering hiring a postdoc partially supported on Pythia funds; the postdoc would work on analyses of ocean oxygen distributions.
- See “Agenda/Notes” from last week’s Education meeting … many of the topics discussed are good fodder for today’s quarterly meeting
- SciPy 2021 Conference is in progress this week
- Pythia is highlighted as a resource in this week’s SnowEx Hackweek (UW-EScience Institute / NOAA)
- Brian’s Pangeo Showcase presentation on Pythia from 6/23/2021 (slides and video): https://doi.org/10.5281/zenodo.5037185
- Kevin Paul was invited to present about Pythia at the AGU session NS013 ("Open-source software for near-surface geophysics and its applications"); Could be a good opportunity to get feedback and expand into the non-AOS geosciences
- If anyone has “near-surface geophysics” examples (e.g., notebooks), I could use them for this presentation.
Agenda:
- A very brief “state of the state” presentation - john
- Pythia presentations for upcoming conferences
- AGU abstract deadline: Aug 4, 2021
- AMS abstract deadline: Aug 3, 2021
- Which, what, who?
- Possible discussion topics for today:
- Where do we go next?
- How do we maximize the impact of training videos produced by Julia et al?
- Is our process working? Are there things we should consider changing?
Notes
On outreach
- KT: Maybe we should have a PR working group to focus on outreach and community building efforts? Outreach Working Group (OWG)?
- BR: What would the scope/mission of such a group be? Maybe have a standing agenda item at the EWG to have an update from this committee. Possible activities: start a twitter feed. Etc. ping other communities such as scipy
- KT: need a place on web site for up to date news and have fresh content
- KP: community building should be a priority.
- DC: We need to make sure when in a place where we have enough material to publish frequently. Agree should have standing agenda item at EWG and IWG meetings to update from OWG.
- ML: critical for us to make more outreach efforts at NCAR (via ESDS).
- ML: OWG should have an explicit charge to reach out to groups to provide feedback. We should get ESDS group on board.
BR: is the bi-weekly meeting working schedule?
Action: KT and MG will move forward with get an OWG set up.
2021-07-08: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Tyle / UAlbany / @ktyle
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
- Julia Kent / NCAR / @jukent
- Rob Ford / UAlbany / @r-ford
- Nicole Corbin / Unidata / @ncorbin-id
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- UAlbany Atm/Env department building move on 3 August (https://www.albany.edu/about/our-campuses/etec ); prep for this has occupied a significant part of Kevin T’s time
- KevinP: I’m sorry, but I cannot make this meeting today. I haven’t been able to work on this for the last week; busy with other things.
- JC: Quarterly meeting next week , July 15, usual time and place
Agenda (with notes from the meeting):
- We have been on a midsummer pause while folks are vacationing … let’s focus on some big-picture issues today.
- What remains missing from Foundational content?
- Dask/big data would be a good last topic and perhaps a stepping stone for an “Advanced Topics” book
- Now that we have an initial set of Pythia-templated notebooks, time to add new ones!
- Need to plan for how best to link to existing notebooks, gallery items, etc. …
- Think about adding in-notebook exercises to existing and future notebooks … similar to the Carpentries’ model.
- Nicole will provide guidance here (follow Bloom’s taxonomy)
- For a prototype, KT will update the Cartopy notebook to incorporate Carpentries-esque problems and exercises
- Link to relevant Xdev videos (and videos from other sources) in Foundation chapters
- On the subject of videos, felt that this is a better topic for the Infrastructure group. How can we capitalize on them? Where is/are the best place(s) for them?
- Publicity/community engagement to enhance the project’s visibility
- Pythia twitter account
- News items (*must* be kept fresh!)
- Think about having a rotation of Pythia personnel so it doesn’t fall on just a couple of folks
- Open issues, discussions, and PRs
- D. Carmon responds to D. Irving’s discussion topic; will keep a package index in mind for later.
2021-06-10: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
- Kevin Paul / NCAR / @kmpaul
- Ryan May / Unidata / @dopplershift
- Erin Robinson / Metadata Game Changers & Openscapes / @erinmr
- Max Grover / NCAR / @mgrover
- Julie Lowndes / Openscapes, NCEAS/UCSB / @jules32
- Kevin Tyle / UAlbany / @ktyle
- Nicole Corbin / Unidata / @ncorbin-id
- Alejandro Coca / The Alan Turing Institute / @acocac
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- New portal theme is live: https://projectpythia.org
- BR: Pangeo Showcase talk yesterday, lots of enthusiasm and good feedback
- BR vacation time: seeking volunteer to chair next EWG meeting 7/8/2021
Agenda:
- Should our Gallery have links to our Foundations tutorials?
- Yes. It’s good to have cross-referencing
- Is there some automation possible for this?
- Searching through content of a JupyterBook, lots of uses beyond Foundations book
- Open an Issue to look at this
- Openscapes: to fill gap in supporting teams transitions to open science
- Moving NASA data to the cloud
- Pythia Foundations material as a base layer for this curriculum
- Partnering with Carpentries for instructor training
- Needs: material on Xarray and Dask
- Hoping for two-way interaction: use Foundations material for learning, but also contribute back
- EarthData Cloud Cookbook https://nasa-openscapes.github.io/ could be volume 2, Pythia Foundations is volume 1.
- Last sentence of current landing page paragraph: good candidate for snappier intro.
- Foundations content -- next steps
- Carpentries collaboration (reach out to Damien Irving?)
- https://carpentrieslab.github.io/python-aos-lesson/
- https://carpentries-incubator.github.io/geospatial-python/
- One nice thing about the Carpentries’ lessons is that they include exercises in the form of interactive coding, or, simply, thought questions. We should do the same.
- KT will reach out to Damien Irving, invite participation
- Think about sizes of lesson, so material can be taught in Carpentries-style workshop.
- Nicole Corbin specializes in assessment material, can help create material for Foundations
- Resources for developing Carpentries-style Lessons:
- Jupyter Meets the Earth collaboration (Fernando Pérez)
- Our Foundations “Getting Started with Jupyter” section is basically already written here: https://jupytearth.org/
- Fernando has been discussing with Kevin Paul moving this material over to Project Pythia.
- Material specifically designed for onboarding of geoscientists, excellent fit with Pythia
- KP needs to talk with Fernando and team about how this material will get maintained and where it will be housed (coupled questions)
- Material on data access, e.g. intake
- Where does dask fit in?
- Xdev is developing dask tutorials, too. We can link to that content.
Action items
- Open Issue about automating adding links to Foundations (BR)
- Kevin Tyle will reach out to Damien Irving
- Draft a PR with updated tagline for landing page + updates to About page (BR)
- Kevin Tyle will chair next EWG meeting
2021-06-17: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / kmpaul
- Brian Rose / UAlbany / brian-rose
- Drew Camron / Unidata / dcamron
- Julia Kent / NCAR / jukent
- Julia Clyne / NCAR / clyne
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Pythia Quarterly Meeting move to July 15, 11am MT
- BR will give a Pythia talk at Pangeo showcase next Wednesday 6/23
Agenda:
- EarthCube annual meeting report out - John
- About 50 people at the poster session
- Steady stream of people coming to poster, ranging from people who knew about the project and those who didn’t
- Questions about how to contribute
- Questions about programming languages AND spoken languages
- Interest from faculty in how to use this material for classes
- New Python Tools in the Atmospheric and Oceanographic Sciences
- Python in Operations and Research to Operations
- Teaching, Training, Outreach, and Building Communities around Python
- Visualization and Data Discovery Using Python
- Working with Large Datasets Using Python
- Ready for PR?
- Get the bugs worked out and get it deployed!!!
- Should the Foundations Book be “integrated” with the Portal? Or should it look/feel like a different resource?
- Maybe the minimum bar is just a nice landing page
- Discussion on triaging issues - John
- Let’s discuss open issues and PR in future meetings
- Triage issues and PRs
- Standing agenda item
Action items
- KP: Will get PR for new landing page out
2021-06-14: EC Final Prep.
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Kevin Paul / NCAR / @kmpaul
- Brian Rose / UAlbany / @brian-rose
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Did we break the external links filter?
Agenda:
- Need to verify info in #97 is consistent with reality
- #45 - portal landing page
- #71 - contributors guide
- #35 - external data repos
- #33, #36, #37, #38, #39 - Unidata drop in PRs
- Do we need more reviewers,someone to take ownership of PRs or issues?
- Review assignments on portal - round robin enabled?
2021-06-10: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Kevin Paul / NCAR / @kmpaul
- Max Grover / NCAR / @mgrover1
- Nicole Corbin / Unidata / @ncorbin-id
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- 8 PRs merged in the last 7 days, including several during yesterday’s hackathon
- NSF annual report due by June 30 (I think). Unfortunate timing with EC annual meeting. John will lead, but we’ll need help.
- Our poster will be presented at EC annual meeting, June 16, 3pm to 4:30pm, EDT
- Quarterly meeting calendaring fiasco fixed. Next meeting June 24. May combine with standing bi-weekly meeting. Thoughts?
Agenda:
- June 16: our first “public” release, or a WIP? How do we prioritize? Use of GitHub “labels” and “milestones”
- First public release, Wednesday June 16. What should be in it?
- Should be good for next week (milestone)
- Notebook will be reviewed and merged (milestone)
- Need material on numpy.datetime64, cftime, links to timeseries processing in pandas and xarray?
- Yes we will develop more advanced material (but not by next week)
- Standard language added to page inviting contributions for work in progress?
- Look at this blog post: https://ncar.github.io/xdev/posts/time/
- Should be flagged for review by end of today (milestone)
- Will also need more advanced material
- Ready for review now, should be good for next week (milestone)
- Anderson has additional notebooks that fit the template, just used in workshop
- Can get submitted as new PR
- Max will reach out today
- Pandas and xarray notebooks current use data files
- Minimal instructions in “How to Contribute”
- Verbose and clear error message?
- “Foundational material” still mostly blank:
- Getting started with Jupyter
- KP gave a tutorial on this
- Getting started with GitHub
- KP gave a tutorial on this
- Can we identify any existing material that can be ported in?
- This is both a demo and an instructional document, but the document hasn’t been written in a way to share (e.g., a notebook)
- This more of a demo, less an instructional document
- And I can provide slides...
- Thoughts on contributors guide(s) (time allowing)
- External links may be our biggest source of contributions from the outside. How do we make that as easy as possible (and broadcast it’s availability)?
- Contributors' information is still scattered and incomplete. Suggest having repo’s CONTRIBUTING.md guide address mechanics of contributing (clone, build, view), and jupyter ‘chapters’ address more about what a contributor might do (e.g. add an external link, write their own notebook, etc.)
- WIP: a PR template for adding new links to external links gallery
- JK will reach out to Anderson, assess where we are with this.
- Material on “How to create a Python package” (time allowing)
- Xdev training videos - how do we get those incorporated?
Action items
- Respond to review requests ASAP!
2021-06-03: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- NSF Annual Report is “due NOW”. John will circulate a draft, but not NOW :-)
- Our EC Annual Meeting poster has been uploaded. Thanks all!
- Thanks to everyone who joined in the successful first Pythia Hackathon last week:
- Next Hackathon Wednesday June 9, 11 am - 3 pm Eastern / 9 am - 1 pm Mountain
- Cover Art for Foundations book:
- Automating Round-Robin Reviewers:
- This has been enabled on the Foundations repository as a trial.
- If it works well, I can enable it on all other repositories.
- See discussion here. There is some ambiguity about where this is enabled and how it works.
- Brian: we should revisit this topic after we have some experience with it
- I have an example website for the landing page. Ran into problems with some features of bootstrap not working properly with Sphinx (or the other sphinx extensions), and in order to get it in a working state, I had to drop Sphinx. So it is currently just pure HTML/CSS/JS. I don’t think that is what we want to deploy in the long run, as it’s very hard for others to contribute without breaking the site.
- Sorry I didn’t have time to deploy something for you to take a look at for this meeting.
- What is the status of the data repo?
Agenda:
- Review open infrastructure PRs and issues
Action items:
- JC will attempt to identify (or open as needed) issues for components we hope to have in place for EarthCube annual meeting (june 16). One new item that came up today is a mechanism to facilitate external contributions to external links gallery. Possibly could be done via a GitHub issue template (or google form). The former would facilitate automation.
- All: revisit ‘round robin’ PR review assignment in a couple of weeks and make sure everyone is good with it.
- KP?: document how round robin assignment works (if can’t be completely automated). This will be particularly needed for external contributions.
2021-05-27: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Kevin Paul / NCAR / @kmpaul
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Max Grover / NCAR / @mgrover1
- Julia Kent / NCAR / @jukent
- Nicole Corbin / Unidata / @ncorbin-id
- A. Kootz
Quick Updates:
(we will transition from zoom to Gather once the formal EWG meeting wraps up)
- 2nd Pythia hackathon Wednesday June 9, 11 am - 3 pm EDT
- Deadline for EarthCube Annual Meeting (June 15-17) is this Friday (5/28). Register here.
Agenda:
- John: Our Project Pythia poster draft is available here. It’s in powerpoint format so there is a separate document for comments here. We can also does now. The poster will be uploaded tomorrow at noon MDT.
- Introduce the foundations target audience and material overview with Nicole
- Who is the target audience
- What do we want them to find/do when they arrive at the site?
- How will we get the word out / build the audience?
- Develop action plan for the Hackathon today
- Foundations book open PRs:
- Notebook template
- Sphinx + Jupyter Book Theme:
- Some Pythia Quarterly Meetings on weekends and Fridays?
- Saturday, Sept 25, 2021?
- Saturday, June 22, 2022?
- Friday, June 25, 2021?
- Friday, March 25, 2022?
- These must be calendar glitches, We’ll get this fixed.
Action items
- Merge JK’s open PRs (BR and JK)
- Build the template from xarray notebook (DC, KT, MG)
- Deploy new portal theme, get feedback (KP)
- Apply same theme to foundations (KP)
- Contributors Guide (JC)
- Debrief meeting 4:40 pm EDT / 2:40 pm MDT in gather.town
2021-05-20: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / kmpaul
- John clyne / NCAR / clyne
- Brian Rose / UAlbany / brian-rose
- Julia Kent / NCAR / jukent
- Ryan May / Unidata / dopplershift
- Orhan Eroglu / NCAR / erogluorhan
- Max Grover / NCAR / mgrover1
- Kevin Tyle / UAlbany / ktyle
- A. Kootz / NCAR / pilotchute
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- 1st Pythia Hackathon scheduled for Thursday May 27, 11 am - 3 pm Mountain / 1 pm - 5 pm Eastern
Agenda:
- Session 1: Next Thursday at this meeting time (and then for 3 hours after)
- Session 2? Wednesday, June 9 (9am-1pm mountain; 11am-3pm eastern)
- Contingent on success of 1st hackathon session
- Gather could be a good tool for running live meeting
- I feel like this is ready to merge?
- KP: Will re-review PR
- KP: Looking at the theme now; needs some new design considerations to deal with layout (very different expectations when dealing with Jupyter Books)
- KP: Not sure when I can get to this, though.
- Can we use the theme on the portal while continuing to develop the foundation element?
- Reduce confusion by unifying/harmonizing PR review approach across all repositories
- GitHub automerge?
- BR: In favor of an Agile approach; merge things early and fix later; very little risk associated with early merging
- JC: reasonable
- JC: Hope to have draft of poster before next week’s meeting
Action items
2021-05-13: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
- Julia Kent / NCAR / @jkent
- Max Grover / NCAR / @mgrover1
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Kevin T’s cartopy PR was merged, but it does not appear on https://projectpythia.org/. Shouldn’t it? - jc
Agenda:
- Content-focussed discussion of open PRs
- Prerequisites and audience
- Combine questions and objectives
- Objectives mapped to numbered sections?
- Second-level heading ## are reserved for major content sections to appear in navbar
- Use ### for subheadings
- Is it working for us? Are we using it to full potential?
- Needs review from infrastructure team
- (jc) Earth Cube annual meeting poster is due May 28. Presentation is June 16 and 17. What content can we (should we) target having in place for this event? Possible considerations:
- Unidata drop-in notebooks?
- Getting started with (python, jupyter, github)?
- What else?
- 3-4 hours
- Maybe schedule 2 different days
- Gather could be a good tool for running live meeting
Action items
- Put together notebook template (DC)
- Write down any more feedback on xarray notebook (EVERYONE)
- 2nd draft of xarray notebook incorporating feedback (DC)
- Tag infrastructure team for Filtered Galleries review (BR)
- Schedule Pythia hackathon days (BR)
2021-05-06: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / kmpaul
- John Clyne / NCAR / @clyne
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
- Kevin Tyle / UAlbany / @ktyle
- Max Grover / NCAR / @mgrover1
- Anderson Banihirwe / NCAR / @andersy005
- Brian Rose / UAlbany / @brian-rose
Quick Updates:
- Registration for annual meeting (june 16, 17) opens next week (free)
- Findable, Accessible, Interoperable, and Reusable (FAIR) website live
- EC ending. What comes next for CFP?
Agenda:
- Status of the new landing page
- Status of external links gallery
- How do we link reviewNB to individual PRs?
- Other repos to add reviewNB?
- Can add a bot to create a link in the PR to reviewNB
- (Kevin T) How best to collaborate on text-centric content?
- (jc) Earth Cube annual meeting poster is due May 28. Presentation is June 16 and 17. What infrastructure do we hope to have in place for this event?
- Gallery
- New Theme
- Contributor’s Guide
- Add description of how to use HackMD and reviewNB with PRs
- Make multiple guides consistent across all repos
- Kevin T will spearhead a lightning talk
- Need to sync up new Xdev Tutorial Seminar content with the External Links Gallery
- Might need to modify content so it can be presented on the gallery appropriately
- Max and Drew to do this for their Pandas tutorial
Action items:
- EVERYONE: Try HackMD on Julia’s new PR
- KEVINP: Investigate issues in new theme demo
- EVERYONE: Look over the Contributor’s Guide for necessary changes before EC meeting
2021-04-29: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Julia Kent / NCAR / @jkent
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
- Drew Camron / Unidata / @dcamron
- Max Grover / NCAR / @mgrover1
- A. Kootz / NCAR / @pilotchute
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Clyne: Our poster abstract for the NSF EC annual meeting was accepted. Poster must be uploaded by May 28. More to come.
- Unidata instructional designer (Nicole Corbin) interested in contributing, currently has a time conflict
Agenda:
- Open discussion on this content: scope, organization, look and feel
- Each notebook should have some similar elements
- Use more color cues / alerts to highlight problems, significant issues, etc.
- Built in some deliberate failures to show common errors
- Use Jupyter cell tags to flag a cell as “expected failure”
- Create a template notebook that includes common elements
- Get one exemplar notebook into the form we want, then strip out the content.
- Which one will we work on?
- DateTime and Xarray both suggested
- We choose Xarray!
- Need to add some content on Zarr to data formats
- Content derives from Unidata workshop material, usually delivered live in classroom. How to adapt to solo, asynchronous delivery?
- Curate videos to accompany notebooks?
- Instructional designer may have good thoughts about this
- Pair notebook narration video with binderized content
- How to handle versioning? (video ideally always presents current notebook)
- How to use interactive content?
- Best to separate “demonstrative” content (read only) from “interactive” material (exercises to work through)
- An entirely separate “Exercises” section of the Foundations book, following the “Scientific Python Packages” section? With links to exercises given in each content notebook
- Infrastructure for reviewing content
- ReviewNB now enabled for this repository, is it helpful?
- We had a brief demonstration from KT, people find it useful and will experiment with it.
- Identify what needs doing before merging these
- (we didn’t actually get to this today)
Action items
- Exhaustive review of Xarray notebook, to become our exemplar for a template (EVERYONE)
- Reviews of all other content always welcome
2021-04-21: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Julia Kent / NCAR / @jukent
- Max Grover / NCAR / @mgrover1
- Brian Rose / UAlbany / @brian-rose
- Orhan Eroglu / NCAR / @erogluorhan
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
Agenda:
- Review open Infrastructure Issues and PRs
- #35, accessing hosted data files
- Project-pythia
- We reviewed open infrastructure issues and PRs and managed to close a couple and make some progress moving others forward.
- Write up experience with linters in contributors guide
- KEVIN: Try ReviewNB on Project Pythia repositories
- No updates available.
- What’s our PR review assignment policy for content and infrastructure? Are reviewerers being assigned appropriately?
- We don’t really have a well-defined review policy. When PRs come in we want them reviewed expediently, and by the “right” people. It’s not clear how to accomplish this and perhaps we need to have more discussion.
Action items:
2021-04-15: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / brian-rose
- Drew Camron / Unidata / dcamron
- Kevin Tyle / UAlbany / ktyle
- Julia Kent / NCAR / jukent
- Kevin Paul / NCAR / kmpaul
- Ryan May / Unidata / dopplershift
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- Unidata Python materials slotting into datetime, matplotlib, numpy, pandas, xarray for structure and initial content (DC) -- coming soon (copied in Action Items below -- BR)
- PR#32 “Why Python” Merged!
Agenda:
- ACTION: review and give feedback on these paragraphs
- Prefered feedback: implement suggested changes on github, rather than just send suggested changes to JK
- Foundations book: pushing forward the open PRs:
- John Clyne left some review notes, need to incorporate these
- Discussion on how to collaborate on notebook review / edits
- For now, just use GitHub comments and code review to get this forward
- Plan: merge by next week.
- Interesting discussion on GitHub about the value of having the button
- Should we have this button? Consensus is that benefits outweighs costs
- ACTION: merge this PR
- Feedback needed on descriptions on each card
- Divide and conquer: look at least at ones that you are author on
- Is it an improvement to the current table? Then merge ASAP.
- Project Jupyter is creating their own educational content together, we should link to that / contribute to it
Action items:
- Review and give feedback on JK’s Why Python (EVERYONE)
- Unidata Python materials slotting into datetime, matplotlib, numpy, pandas, xarray for structure and initial content (DC)
- Review descriptions on card on links gallery https://github.com/ProjectPythia/projectpythia.github.io/pull/51 (EVERYONE)
- Merge edit button (BR)
- Merge gallery after getting some feedback
- Merge Cartopy content after responding to JC’s comments (and any others that come in before next week)
2021-04-08: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / kmpaul
- John Clyne / NCAR / clyne
- Brian Rose / UAlbany / brian-rose
- Julia Kent / NCAR / jukent
- Drew Camron / Unidata / dcamron
- Ryan May / Unidata / dopplershift
- Max Grover / NCAR / mgrover1
- Kevin Tyle / UAlbany / ktyle
Quick Updates:
- Please sign in on this page above (and add any quick updates)
Agenda:
- “Cartopy” and the “Getting Started with Python” content
- EarthCube meeting abstract by April 13.
- Linting issues resolved?
- Need to add experiences with linters to Contributor guide
- Gallery Development Update:
- Click “The Book” on the top navigation bar to go to the traditional Sphinx layout page (e.g., the book)
- Click the logo to go back to the “One Pager” layout
- Test this theme out on a “real” Sphinx documentation site
- Test theme out with a Jupyter Book
- Requires knowing how to change the Sphinx theme from “Jupyter Book Theme” (semi-hard-coded)
- Create a central configuration file for the navbars, so all sites can have a common navbar on top (i.e., uniform look and feel across all subdomains)
- Simplifying mechanics of reviewing jupyter books
Action items:
- Write up experience with linters in contributors guide
- KEVIN: Try ReviewNB on Project Pythia repositories
2021-04-01: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / brian-rose
- Kevin Tyle / UAlbany / ktyle
- Kevin Paul / NCAR / kmpaul
- John Clyne / NCAR / clyne
- Ryan May / Unidata / dopplershift
- Drew Camron / Unidata / dcamron
- Julia Kent / NCAR / jukent
- Austin Kootz / NCAR
Quick Updates:
Agenda:
- Progress on the Foundations content:
- Getting started with Python (JK)
- Discussion point: getting useful feedback on new content
- Want to have real users / students looking at the material
- But don’t have this hold up the Pull Requests
- Take steps to invite people to review and make comments
- Invite comments through GitHub issues / GitHub discussion, and couple that to tutorials on using GitHub? (but don’t get lost in recursion)
- Probably premature to propose a full session or workshop. We can submit an abstract (or several!) to an educational session later this year.
Action items
- EVERYONE: look closely at Cartopy content and Getting Started with Python content, bring your feedback on content, look and feel, organization, etc.
- When we solve the GitHub collaboration and linting issues, we need to add to our contributors guides.
- Add contributor links / invitations to Foundations book to invite comment (BR)
- EVERYONE: feedback to JC on EarthCube meeting abstract
2021-03-25: Quarterly Meeting
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Kevin Paul / NCAR / @kmpaul
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
- Max Grover / NCAR / @mgrover1
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- NSF EC Annual Meeting poster abstract (due April 15). Please provide feedback by April 13
- BR recruited a new grad student on Pythia grant, start date summer or fall 2021
Agenda:
- Skip today’s IWG meeting or shift to next week?
- Working group overview (10 minutes) - Brian and Kevin
- Education Working Group (Brian)
- Identified three “pillars” of Pythia content to be delivered through the portal: learning material (foundations); galleries of domain-specific content; curated links to external resources
- We’ve directed most effort at new “Foundations” Jupyterbook: https://foundations.projectpythia.org
- Vision for the Foundations book:
- self-consistent set of linked tutorials on core skills for geoscientists getting started with Python and open-source
- Common reference point for notebooks in the gallery
- Well-maintained go-to learning reference for the community
- We have a basic outline online, see link above
- Built with JupyterBook, very simple to change TOC, add content in .ipynb, .md, and/or .rst formats
- Infrastructure in place for automated builds of PRs
- First actual content in an open PR (Julia Kent, intro to Python material)
- Draft of a Pythia infographic: foundations, pillars/pathways, destination/outcome (Julia Kent lead) https://github.com/ProjectPythia/projectpythia.github.io/discussions/55
- Plan to use versions of this graphic as nav features in the book
- Get draft content into the book!
- Get the book linked to the main portal site
- Infrastructure Working Group (Kevin)
- Custom Sphinx Preview Action (for previews in PRs)
- Default repository setup files (automatically included in new repos)
- Enable Google Analytics on projectpythia.org sites
- Custom Sphinx Gallery Extension
- Enable searches of gallery and return galleries
- Current Notable Deficiencies:
- Lots of upfront infrastructure work on site deployment and customization, but nobody is a frontend developer
- Need javascript experience for custom extensions
- KP is the only one with this knowledge, and lacks time (thus the slow pace of some things)
- Review of upcoming milestones (5 minutes) - John
- Portal infrastructure deployment, May 2021
- Searchable
- organized by category and experience level
- allow externally-managed platform deployments (e.g., Binder and BinderHub) to link their content directly into the portal
- allow collection of metrics AND display of metrics
- provide communication channels with community
- support user contributions
- allow launching content on dynamic, customizable, interactive platforms deployed in the Cloud, where users can modify, tweak, and explore content
- Classroom activities, recurring
- Course content developed by UAlbany instructors will be maintained and publicly available via the Pythia Portal.
- Students will use the Pythia Platform for coursework.
- Students will also receive specific training on contributing back to the Pythia Portal
- Workshops/hackathons, recurring, starting in August
- semi- annual workshops/hackathons at NCAR, and AMS or AGU.
- How are we doing?
- How can we do better?
- Have a hackathon for content? Possibly after Albany semester ends, middle of May.
- Need to identify missing pieces of infrastructure.
- Benefit from having org level project board.
- Kevin T. has cartopy material to drop in
2021-03-18: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / brian-rose
- John Clyne / NCAR / clyne
- Julia Kent / NCAR / jukent
- Ryan May / Unidata / dopplershift
- Max Grover / NCAR / mgrover1
- Drew Camron / Unidata / dcamron
Quick Updates:
Agenda:
- Central idea: people discover this material from different paths / directions.
- What’s the intent? Follow one of the pillars to the top of the graphic?
- Harmonizing this with the book content?
- Reuse the graphic (foundations, pillars, goal, direction) for different sections and scopes in the book
- Existing graphic maps best to “getting started with Python” section
- JK is separately working on adding some basic Python tutorial content:
https://60538558fab0d80d60ebb21c--pythia-foundations.netlify.app/foundations/basic-python.html
- Call For Notebooks, Posters and Working sessions - due on April 15, 2021
- Posters on EarthCube tools, science using those tools, educational outcomes using tools, integration of different tools, etc.
- Working sessions aka hackathons to “discuss particular questions, provide feedback on new technologies, or evaluate new frameworks for data policies, procedures, or workflows.”
- Notebooks: “One track will be focused on submissions where the goal is describing scientific outputs. In this case, the novelty is based on the analysis being done and the scientific conclusions drawn. The other is on demonstrating new software and tools. Here the emphasis is on new software and showing how it can be used. “
- $2000 funding opportunity for US grad students to submit new notebooks
- How should Project Pythia be represented at this meeting?
- A poster advertising our Foundations content. By June we will hopefully have something worth pointing users toward
- Use the poster deadline to spur along our development.
- Make use of infographic
- Aligns well with milestones in our EarthCube proposal: “release 1” by May 2021.
Action items
- Get some more content into the book:
- KT will put some Cartopy material
- JK is working on basic Python tutorial -- BR and others will work with her to push this forward
- BR will push forward on the new subject headings in Foundations. Get this merged ASAP.
- JC will put together a draft of EarthCube poster abstract
2021-03-11: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Brian Rose / UAlbany / @brian-rose
- Max Grover / NCAR / @mgrover1
- Julia Kent / NCAR / @jukent
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- BR has successfully recruited a new graduate student onto the project / grant. Introductions coming soon hopefully.
- Other?
Agenda
- Create repo labels for EWG or IWG topics (or both)?
- Automated labeling is useful, creating templates for “education” vs “infrastructure”
- Discussed at 2/11 meeting, but decided to “wait on this”. Why?
- External contributors won’t have permissions to assign labels automatically unless use template.
- ACTION: open issue to generate templates for education and content issues - jc.
- Review open PRs and Issues
- 2021 EarthCube Annual Meeting Scheduled for June 15-17 (virtual).
- Call For Notebooks, Posters and Working sessions - due on April 15, 2021
- Details here
- Discussion on possible submission ideas (topic, format (notebook, poster, session), who?)
- Discuss now or table for next meeting after everyone has time to review CFP?
- ACTION: Everyone please review CFP and will discuss next week at the EWG meeting.
Action items
- Give zoom co-host permission to all project leads - Julia Kent
- Everyone please review EC Annual meeting CFP ( here ) and will discuss next week at the EWG - EVERYONE
- Open issue for creating an issue template - jc
2021-03-04: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / brian-rose
- John Clyne / NCAR / @clyne
- Kevin Paul / NCAR / @kmpaul
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
- Max Grover / NCAR / @mgrover1
Quick Updates:
- Call For Notebooks, Posters and Working sessions - due on April 15, 2021
- Details here
- Working on the sphinx-pythia-theme to work with both the landing page (portal) and the book-style contents, but still a way to go
Agenda:
- Not much recent action on content. At the last EWG meeting 2/25/2021 we identified these priorities (copy and paste from those meeting notes):
- ACTION: reorganize list of topics to be more “user friendly” with more sub-headings
- Starting point: some simple examples of actual Python code, finishing with offering the reader some choices:
- Run python in Jupyter / on Binder
- Install Python on a laptop
- Run Python code in a terminal
- ACTION: Put together this “Python quickstart”
- Open discussion on “user friendly” headings
- Overview
- Basic Python Syntax
- Computing with Jupyter
- Markdown
- Using the conda package manager
- Version Control with git and GitHub
- Suggestions from potential users (via JK):
- Lots of enthusiasm for the project, having all this information gathered together in one place
- In addition to guides to different packages, some instructions or infographics that map out how different packages are interrelated
- E.g. https://pyaos.github.io/stack/
- Could have a good infographic in the Overview, and repeat it (with highlights) in each subsection
- ACTION: sketch out an infographic
- Separate subcommittee meeting to work on this
- JK will arrange a meeting on this
- Goal: two infographics:
- Whole toolkit
- Python stack
- The two graphics will be visually related (stack as a subset of whole toolkit)
- Volunteers: JK, DC, KT, MG
- BR suggestion for more user-friendly hierarchical list:
- Preamble / overview: Why are these tools important?
- Getting started with Python
- Quickstart: what is Python?
- How to run Python code
- Install Python on a laptop with the conda package manager
- Run Python in the cloud
- Getting started with Jupyter
- What is a Jupyter notebook
- Running notebooks locally vs remotely
- Formatted text in the notebook with Markdown
- Collaboration and version control
- What is git
- What is GitHub
- How to open a GitHub issue
- How to open a GitHub pull request
- Discussion on the 3rd major heading:
- Focus on the community, rather than specific tool
- Title: Getting started with the Python Community
- Don’t want to put any barriers in front of people
- Python / Jupyter are cool technologies, but they only scratch the surface of why you should adopt Python.
- ACTION: BR will rough out a Pull Request that reorganizes topics along these lines. We can discuss further through the PR
- ACTION: JK will put in PR with some Python quickstart material (repurposed from another source). JK and BR will iterate on this.
- ACTION: JC will put in PR to add the CC-BY license to foundations site.
- Any other business
2021-02-25: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / Kevin Tyle
- Brian Rose / UAlbany / @brian-rose
- Ryan May / Unidata / @dopplershift
- Orhan Eroglu / NCAR / @erogluorhan
- Julia Kent / NCAR / @jukent
- Anderson Banihirwe / NCAR / @andersy005
- John Clyne /NCAR / @clyne
Quick Updates:
Agenda:
- i.e., it is meant for only 1 input file (index.rst)
- i.e., it is meant for just the “landing page”
- i.e., it’s not really meant for a JupyterBook
- What new structure should the landing page have (index.rst):
- Pillar 1: Ad/link to Jupyter Foundations Book
- with dynamic preview
- Call to Action button!
- ACTION: A good blurb/ad for the book
- Pillar 2: Gallery of examples (subdivided into
- Dynamic view of content gallery
- WIP: Separating content from infrastructure
- Pillar 3: external links
- Julia demo
- ACTION: [Julia] Finish up the gallery; help with sphinx extension is appreciated
- The site structure will have:
- Currently, points to sections in the page (index.rst)
- More on Project Pythia
- About the Team
- How to contribute (& code of conduct)
- Open Meetings Info
- Link to discussion page
- Generalize the theme to work with JupyterBooks and other typical Sphinx documentation?
- “Foundations” Material Infrastructure
- Autobuilding (still needs to be done)
- No formal guide is needed at this point?
- Can wait on this
- BR: Let’s not forget the platform!
- KT: (just to have this documented so I don’t forget it) … what are best practices for an efficient Binder presence? I.e. when does a Python environment become “too big”?
2021-02-18: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / brian-rose
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
- John Clyne / NCAR / @clyne
- Julia Kent / NCAR / @jukent
- Kevin Tyle / UAlbany / @ktyle
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- BR will meet with Julien Emile-Geay (USC) tomorrow (2/19/2021) 11am PST/2pm EST to discuss possible collaboration between Pythia and “PaleoCube” (new EC proposal), others welcome to join the call.
Agenda:
- Following discussion at the last IWG meeting, BR has put together a new JupyterBook to house our “foundations” material:
- Provide suggested paths through the material for users with various backgrounds
- Content should contain roadmaps, what do you need to know in order to use this section?
- First link should be obvious “how to get started with Python”
- Move Markdown into the Jupyter section?
- Just be clear in presentation / organization of foundational material
- Avoid scaring people off with too many links with cryptic names that we insist they need to know about it off the bat
- ACTION: reorganize list of topics to be more “user friendly” with more sub-headings
- Starting point: some simple examples of actual Python code, finishing with offering the reader some choices:
- Run python in Jupyter / on Binder
- Install Python on a laptop
- Run Python code in a terminal
- ACTION: Put together this “Python quickstart”
- Name and location / URL of this Foundations content
- ACTION: open discussion thread
- Licensing the site, or the teaching material, or both?
- ACTION: BR put in the Apache License, same as main pythia repo
- ACTION: JC check with UCAR legal team about details
- What are the infrastructure needs to help this move forward?
- ACTION: tag KP on this infrastructure request
- Contributors Guide, focusing on how to author content for the JupyterBook? (would that help the team in the short term?)
- Repository ownership, reviews and merge procedures
- Top priorities for getting content into the book? Next steps?
- Any other business
2021-02-11: Infrastructure Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John clyne / NCAR / @clyne
- Brian Rose / UAlbany / brian-rose
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
- Anderson Banihirwe / NCAR / @andersy005
- Julia Kent / NCAR / @jukent
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- KP: Started work on a new Sphinx Theme for the portal. Very early stages.
- John and Kevin meeting with NCAR graphics designer next week to get ideas for landing page.
Agenda:
- What are immediate priorities for IWG?
- Are there things that need to be done in the short time to support the EWG?
- Need to have guidance on how to contribute foundational tutorial content
- BR has a first draft “in the works”; an outline of topics that we will cover with (maybe) one bare-bones example of tutorial content
- Use Jupyter Book site (separate repo) to display the content; use main portal page as a landing page to “advertise” content in the Jupyter Book
- Use a gallery repo for examples (later)
- Does it make sense to keep meeting bi-weekly?
- Create repo labels for EWG or IWG topics (or both)?
- Automated labeling is useful, creating templates for “education” vs “infrastructure”
- Wait on this
2021-02-04: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplersh
- Kevn Paul / NCAR / @kmpaul
- Drew Camron / Unidata / @dcamron
- John Clyne / NCAR / @clyne
- Max Grover / Illinois/NCAR / @mgrover1
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- BR has made an offer to a promising prospective PhD student funded on the Pythia EarthCube grant. Fingers crossed!
Agenda:
- https://github.com/ProjectPythia/projectpythia.github.io/issues/42
- KPaul has raised this, since we have one single meeting time but two parallel sets of meeting notes, this may cause confusion for new participants
- Going forward we’re going to merge notes unless there’s new objections
- Action items? (Looks like KPaul has already merged previous notes, need to update links in public calendar, and merge today’s notes.)
- KP will merge notes after this meeting and update Calendar and website
- Interest in standardizing tutorial material
- Including basic shell / command line
- Categorizing content based on data type (e.g. raster vs. vector)
- Basic concepts of cloud computing
- Put placeholders / roadmap on the public portal?
- ACTION: put a draft PR together (BR)
- Getting started on building out the foundations
- Open discussion
- Organizing principles of the foundational material:
- “Beating google”
- Cohesiveness and order like a textbook
- Consistent style
- Use EWG meeting to collaborate on design / scope
- Two or three design guidelines
- What do we expect from the user? How much time required per notebook?
- Discussion on style guidelines:
- Short digestible units that are well organized into larger pieces?
- What will each unit look like?
- Here’s what this lesson will cover, here’s a list of skills you need in order to understand this lesson, here are links to those skills
- ACTION: open a new discussion thread about this (BR)
2020-01-28: Infrastructure Working Group
Meeting link: https://ucar-edu.zoom.us/j/91375487587 (See calendar event for password.)
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- Kevin Tyle / UAlbany / @ktyle
- John clyne / NCAR/ @clyne
- Brian Rose / NCAR / @brian-rose
- Ryan May / Unidata / @dopplershift
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Let Julia know if you want more uploaded to our YouTube channel
- Xdev tutorials up there too
- We have our first Dash repo data set
Agenda:
- Now own projectpythia.org!
- Handling PRs...
- Do we need to make any tweaks to process?
- More specific CODEOWNERS?
- How is the Contributor’s Guide looking? What’s missing?
- NO ACTION
- GitHub Discussions are enabled
- Link to discussions in the “Talk to us!” page on the portal
- NO ACTION
- Package examples as educational tools?
- e.g., a cookie-cutter template for a basic Python package
- Need to put effort into doing it properly; cookie-cutter repos can be confusing for other people; needs a thorough set of tutorials on the components (CI/CD, etc.)
- Points us to maybe include education content on “advanced GitHub” (e.g., GitHub actions), CI services like ReadTheDocs
- ACTION: Add content about CI, how to create a good python package
- Structuring “external resources” in a way that makes them more useful to the community. Currently we simply have a giant list with little organization. Can we make use of user reviews, tagging content, other?
- Perhaps jukent can say something about this…
- User ratings, tagging system (keywords), content recommendations?
- Categorizing links is a necessity first
- Add column in “Lessons and Courses” section describing the contents of the link
- Add description column in the Packages section
- We should be reaching out to other groups and getting involvement early
- Get real content onto this site first
- Need tutorial material for foundational skills
- We need to both reach out to gather content and develop better organization/presentation of the content simultaneously
- We can sort the content in different ways first (i.e., have links organized by X, Y, etc.)
- We need to separate infrastructure from content,
- We currently have a page of external links in a markdown page that is structured in an “ad hoc” way
- Should move link into into a YAML file and build the markdown page from the YAML file content
- Add the markdown build steps in conf.py
- ACTION: Divide infrastructure from content in links page(s)
- Look and feel of landing page
- Maybe we can develop a new Sphinx theme that is aesthetically pleasing, but that is in the eye of the beholder
- Submenus in the nav bar can be toggled on, other content that we don’t want can be easily removed
- Can easily restructure the site to show the gallery first
- ACTION: Create issue for discussion how to change the portal look and feel
- My concern is that to the “untrained eye” we look like we have 1 weekly meeting (same time each week, same Zoom link) but 2 different notes documents. I’m not sure people will know which notes document to open (to see the agenda, etc.)
2021-01-21: Education Working Group
Meeting link:
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- Kevin Tyle / UAlbany / @ktyle
- Julia Kent / NCAR / @jukent
- Rich Signell / USGS / @rsignell-usgs
- Clyne / NCAR/ @clyne
- Drew Camron / Unidata / @dcamron
- Kevin Paul / NCAR / @kmpaul
Quick Updates:
- Please sign in on this page above (and add any quick updates)
- We now have the domain projectpythia.org for our portal
- The portal now has a nicely organized page of external links: https://projectpythia.org/pages/links.html
- We have a new YouTube channel for Project Pythia content
- John Clyne’s AMS presentation now on YouTube and linked from our About page https://projectpythia.org/pages/about.html
- Kevin Tyle attended yesterday’s Pangeo check-in call and gave a brief update about Pythia. Several attendees expressed interest in attending one or both of the Pythia infrastructure and education working groups.
Agenda:
- How do people find what they want? Titles alone may not be enough.
- We have no control over external content, can change rapidly, better not to try to overly manage.
- Going forward, we will host and maintain our own content.
- Can we have a system of keywords or tags to make external content more easily findable?
- Search capability will need some building on infrastructure side.
- Moving from collecting links to content creation / curation
- Let’s put effort into our native content rather than more organizing of external links
- Discussion on the role of cloud computing in Project Pythia
- Desire: content that links together existing resources on particular topic (e.g. accessing ERA5 data)
- User-contributed curricula? E.g. which resources did I use to learn xxx
- Can we identify good starting points / gaps in existing content?
- Does it make sense to start with foundational skills (e.g. git, github, Jupyter), with goal of broadening participation?
- Much content exists but scattered
- What is our list of foundational content, that we will cover comprehensively?
- Cover these skills with multiple formats:
- Videos
- Self-paced tutorials
- Documentation
- Foundational material should NOT be external links, but specific resources hosted and maintained by Pythia
- We already have solid foundational material owned by people in this group.
- Avoid duplication… but the more of this content we can wrap into Pythia, the better
- Brainstorm on foundational skills: what’s on the list?
- Plain Python in a terminal, already have content from KPaul’s team and others
- Jupyter notebooks vs Jupyter lab…
- Natural connection to Jupyter Meets the Earth, KPaul will reach out
- Lab is probably the future, but plain notebook may be simpler starting point
- Arguments either way about which is a better starting point
- Lab makes it easy to spin up a terminal for basics
- Preferred method for setting up environment for using these tools
- But are we going to support this at all?
- Valuable to the community, but completely outside of the notebooks
- A lot of value, a lot of headache, an odd fit.
- Best practices for creating conda environments.
- Also outside of notebooks
- Software Carpentries material is good and up to date
- Essential for entraining new users to contribute back
- Damien Irving’s Carpentries course material will be basis for upcoming AMS short course
- Good material in the Pangeo tutorials
- Teach tool stack from bottom up, vs. show how to use high-level tools first?
- BrianR will open a new thread on discussion page to continue the discussion about foundational tools
2020-01-14: Infrastructure Working Group
Meeting link: https://ucar-edu.zoom.us/j/91375487587 (See calendar event for password.)
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / John Clyne
- Brian Rose / UAlbany / @brian-rose
- Kevin Tyle / UAlbany / @ktyle
- Orhan Eroglu / NCAR / @erogluorhan
- Drew Camron / Unidata / @dcamron
- Ryan May / Unidata / @dopplershift
- Julia Kent / NCAR / @jukent
- Anderson Banihirwe / NCAR / @andersy005
Quick Updates:
- Q: What about getting a domain? projectpythia.org
- Pangeo moving to seminar model for weekly meetings; should get on list
- SciPy 2021 will be fully virtual, again (keep an eye on when to submit)
- Advertise via lightning talk?
Agenda:
- Person submits PR
- Submitter starts PR in draft mode (indicating it is work in progress)
- When switched to non-draft, Code Owners are automatically triggered for review
- Code Owners can add more reviewers if necessary
- When 2+ reviewers approve, and there are no standing suggestions, code owner can merge
- Thursdays @ 1PM Eastern (11AM Mountain)
- Alternating with Education Working Group
- Will use recurring Zoom meeting from now on
- Do we need an open discussion platform/repository?
- Option 1: Create a “meta” repository just for discussion via GitHub issues
- Option 2: Create a Pythia Discourse site
- Option 3: GitHub Discussions added to portal repo
- Go with this option (Ryan did it!)
- KevinP will put in a PR to link to portal
- i.e. how to present content, make it searchable, etc.
- NOT what topics/titles/tags/examples to actually use!
2021-01-07: Education Working Group
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Drew Camron / Unidata / @dcamron
- Kevin Tyle / UAlbany / @ktyle
- Ryan May / Unidata / @dopplershift
- Julia Kent / UCAR / @jukent
Quick Updates:
- Please sign in on this page above.
- Regular bi-weekly WG meeting time is set:
- 11 am MT / 1 pm ET beginning 1/21/2021
- These meetings alternate with the IWG (same time, alternate weeks)
- Portal site now decked out with logo, Code of Conduct, and bare-bones Contributor’s Guide: https://projectpythia.github.io/
- February's NSF CFP meeting will feature introductory outreach by projects to potential users - John
Agenda:
- Finalize future meeting format (e.g. zoom vs google) and update all links
- KTyle (just completed a one-semester course):
- Interactive visualization -- lots of new tool development, students are interested.
- Existing Python ecosystem focussed on static viz, but older tools were more interactive
- Don’t ignore foundations (Pandas, xarray, etc), but we need to be on the leading edge! This will attract community.
- Ryan May: don’t want to be years out of date, but also don’t want to teach irrelevant skills. Hard to pick winners!
- Our pitch to NSF: we provide support for tools that look they have some longevity, help users make sense of the ecosystem
- BUT community needs have to be at the center.
- We should aim for a community-sustained set of educational material. We help put it together and link pieces. We keep our core curriculum updated to reflect the currently most popular tools.
- Key sustainability issue: core devs should be helpfully encouraging to community members who have issues and suggestions -- encourage participation!
- Getting a bigger ball rolling:
- Go from links in github issue to links on webpage
- But start somewhere with organizational principle
- Put links up in multiple tabs, organizing same content in different ways?
- These pages would be fragile, need to note to users that content and urls may change
- Can tag content for future searchable, dynamic galleries
- AMS is next week! Best to get something up immediately. Not many chances for big visibility to community.
- KPaul will find someone in IWG to get this started.
- Set up a Github discussion. Separate meta-repo for discussion and project management.
2020-12-21: Education Working Group
Attendees: [Name / Institution / GitHub Username]
- Brian Rose / UAlbany / @brian-rose
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Kevin Paul / NCAR / @kmpaul
Quick Updates:
- Logo: will be dropped in the drive soon.
Agenda:
- Meeting schedule / coordination with IWG
- State of the portal and content
- Categorize content by format?
- Categorize thematically? With mixture of different content types?
- Or both? Would need a more dynamic website.
- Begin with organizing by format, and move to thematic organization as content matures and specific audiences are identified?
- Some dynamic stuff is possible on static server with javascript
- But good to get something out quickly that is useful.
- Get content online before we get too bogged down with decisions about organization.
- Kevin Paul’s group has content to add
- Most of us own some content. Take a look at the full list, as the starting point. Version 0.1 of our content collection.
- Start listing in the github issue. Start seeding the sourdough!
- Links to existing content:
- Upcoming educational opportunities?
- AMS student conference: Unidata running a week-long workshop. Using existing training materials.
- Unidata will be hosting an AMS short course on at least MetPy, if not python in general. That can basically happen any time in 2021, so that might be able to be designed specifically alongside pythia
- EarthCube meetings: informal list of contacts and content among university faculty. How to proactively engage these community members?
- NCAR tutorial series continuing. At some point getting to “intermediate” level in tutorials, looking for external speakers, topic experts.
- We’re probably not ready to give a guide for contributing directly to the portal
- Can start with an informal invitation to contribute links, contribute to meetings.
- Move toward a step-by-step contributor’s guide
- Ask people to make sure content is clean and legally shareable.
- Will need a Code of Conduct that we can reference in the Contributor’s Guide.
- External links vs hosted content. Maybe we start with guide for external links.
- KP will put a Code of Conduct on site, open for iteration.
- BR will put in a PR to add a basic Contributor’s guide.
- Quarterly meeting tomorrow: hold or postpone?
2020-12-16: Infrastructure Working Group
Meeting link: https://meet.google.com/xov-wzro-imx
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Kevin Tyle / UAlbany / @ktyle
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @erogluorhan
Quick Updates:
- Thank you for accepting invitations!
- Needs a logo
- Questions about the current portal
- New logos received (will distribute)
Agenda:
- Zoom?
- Open to the public? How?
- But maybe we don’t need screen sharing all the time
- What time?
- GitHub discussions?
- Who will test? Drew will demo on MetPy
- Use Pangeo discourse for now; until conflicts (if any emerge)?
- Yes. Set up a team account.
- Blogs: Medium posts (Ask about posts in Pangeo Medium account)
- Open issues in the main repo
- Contribute via PRs
- TODO: Contributor’s Guide [Orhan]
- Curated content via markdown files, etc.
- Create an issue to discuss [Kevin]
- Example gallery via notebooks
- Want to organize content topically (not by format)
- So we need to figure out what kind of content we will need to organize and make sure the portal can present it!
- Need to be able to present embedded video, written text, notebook examples
- Can we present custom “solutions” to users?
- Can we do this with a static website?
- Need to get more information from Education/Content WG
- Gallery examples launchable via link to external binder hub (e.g., pangeo binder hub or mybinder.org)
- Uses free infrastructure...so slow
- Speed up binder start times
- Optimize existing Binder (e.g., custom Docker images, etc.)
- KT: I wonder if we can roll our own Binder using XSEDE/Jetstream2 once that gets spun up?
- KP: Yes!
- Pay for better service on cloud (how much?)
- Might require Pythia Portal login to keep track of identities
- But enable free/anonymous usage (i.e., non-parallel notebooks)
- Provide easy install options…
- Pythia “endorsed” conda meta-package?
- Docker images
- Versioning of “stack”? And CI testing using gallery notebooks?
2020-12-10: Education Working Group
Attendees: [Name / Institution / GitHub Username]
- John Clyne / NCAR / @clyne
- Kevin Paul / NCAR / @kmpaul
- Kevin Tyle / UAlbany / @ktyle
- Brian Rose / UAlbany / @brian-rose
- Drew Camron / Unidata / @dcamron
Quick Updates:
- AMS presentation. Friday, January 15, 8:10 - 8:15am (TZ?)
- Logo
- NSF EC Council of Funded projects meeting
Agenda:
- More than curating a list of links; need to present something cohesive (or useful)
- Should also develop a style of doing things
- Tying content to classes, and want students to be involved in developing content, too!
- Need to start by deciding what goes on the portal
- Should this be a “content” WG or an “education” WG
- How do we tie portal to University classes in a way that engages students, and has them contribute back?
- Jump starting portal development with content
- Move prototype portal to main github repository
- Collect and discuss content through github issues
- Should we advocate using JupyterLab rather than legacy notebook interface?
- Yes
- Lots of development of user-facing technology in JupyterLab extensions. Get users started with it from the beginning
- The distinction does not apply to static examples, only to deployed content e.g. through Binder.
- Choose JupyterLab for the deployment
- Need to develop a governance structure for decision-making about content. That will be key for drawing in a broader community
- Address questions such as what is “good” content
- But also need to get started
- Need to get a contributors guide going so the outside world (as well as ourselves) can contribute
- GitHub Organization? Maybe make this outside NCAR
- Once portal is up we will start by opening issues for links to content we’d like to see. Kevin Paul will get Portal set up.
- Separately start a discussion about organizing the portal.
2020-12-09: Infrastructure Working Group
Meeting link: https://ucar-edu.zoom.us/j/99099923041 (Password: y.nEg+11)
Attendees: [Name / Institution / GitHub Username]
- Kevin Paul / NCAR / @kmpaul
- John Clyne / NCAR / @clyne
- Julia Kent / NCAR / @jukent
- Drew Camron / Unidata / @dcamron
- Orhan Eroglu / NCAR / @
- Brian Rose / U Albany / @brian-rose
- Kevin Tyle / U Albany / @ktyle
- Emilio Mayorga / U Washington / @
- Aaron Kaplan / Columbia U / @aaron-kaplan
Quick Updates:
Agenda:
- Note that this time conflicts with the weekly Pangeo meeting time. We may have to change it.
- What is Project Pythia? [10 minutes]
- Pythia Portal
- Jekyll-based (i.e., ruby)
- Has a nice UCAR/NCAR modern theme...but do we want that?
- Deployable on GitHub Pages (static)
- Missing Gallery features and search features
- sphinx_panels
- myst_nb
- gallery_generator
- Content can be written at Jupyter Notebooks and/or markdown (no restructuredText needed)
- Has a Notebook Gallery with simple search capability (search entire notebook page) built in
- Searches content of all notebooks
- No search or display by tags, yet
- No gallery display of search results
- Deployable on GitHub Pages (static)
- Adding content to the page requires site rebuild
- Site rebuild can be automated with CI, but still requires a PR...not user friendly!
- Content Types and Handling:
- Example: Jupyter Notebooks
- Can be “local” (i.e., in the portal repo) or external (i.e., in another repo)
- Can be detected based on URL extension (ipynb) and then rendered to HTML
- Local or external
- Can be detected based on URL extension (py) and then rendered to HTML (or markdown)
- Tutorial/Documentation: External Sites
- External URLs (jupyter books, etc.)
- Cannot be detected automatically; requires manual curation (i.e., a little more work in the PR)
- Communication Forums: External Sites
- Should we have our own communication forum?
- I think we should leverage Pangeo forums and not split the community
- Should work like external sites
- Should all content types be displayed in the same way? i.e., via a Gallery? Or should only examples be displayed in the Gallery?
- Personally, I think all links to external sites should be manually curated with explanations for each...including the pros/cons of each item.
- Contributors Guide is needed
- Link to sphinx source is needed (I actually can’t find it right now - BR)
Infrastructure Working Group (IWG)
Oct 15, 2020
Agenda
- Kevin Paul
- Do as much planning/discussion in GitHub as possible
- Kevin: hold IWG meetings in open. Publish links and invite folks to attend.
- Announce in Pangeo Discourse
- Infrastructure components we need to address:
- Portal (immediate priority). Review of requirements (below)
- Jupyter Notebooks, Tutorials, etc.
- Kevin: What are going to use to host the web site? Can’t be static?
- Kevin: What are the first things we are going to put on the portal?
- Platform
- “Vanilla Binder” -- a la Pangeo Gallery
- Kevin: may need a dynamically rendered site. But we should push GitHub site as much as possible until we hit a wall.
- Ryan: search services may be possible within GitHub world. Concur that we start with GitHub until it breaks.
- Julia: examples of other portals supporting dynamics content? https://scikit-learn.org/stable/
- Kevin: need to flesh out requirements.
- Kevin, Ryan: move ahead with GitHub/Sphinx until it becomes a problem.
- Kevin till take point on fleshing out the existing web site, and presenting the draft to the group.
- Schedule regular IWG meetings
- Post announcement on Pangeo Discourse
- Develop agenda for first IWG public meeting
- Portal requirements (user stories, proposed requirements, etc.)
Summary of Portal requirements from proposal:
We will develop and deploy a searchable online portal of educational materials
• keyword searchable
• organized by category and experience level
• allow externally-managed platform deployments (e.g., Binder and BinderHub) to link their content directly into the portal
• allow collection of metrics AND display of metrics
• provide communication channels with community
• support user contributions [jc: both hosted content and external links]
• allow launching content on dynamic, customizable, interactive platforms deployed in the Cloud, where users can modify, tweak, and explore content
• the runtime environments for each interactive contribution to the portal can be provided and maintained by the contributors themselves, rather than requiring that the portal hosts maintain the various runtime environments needed by each external contribution.
Action items
- Schedule IWG meetings.
- Post announcements for IWG meeting (where: Pangeo Discourse)
- Flesh out the initial web site and present the draft to IWG.
- Assess alternatives to Github for web site
- Reach out to UCAR for a Pythia logo.
- Need to do a technical assessment for Portal: