1 of 25

Open Source Program Offices in Research Universities

CHPC National Conference 2022

”Democratisation of Cyber-Infrastructure for Sustainable Development”

December 2, 2022

Carlos Maltzahn, UC Santa Cruz

2 of 25

Carlos Maltzahn

Adjunct Professor, Step 5, �Computer Science & Engineering, UC Santa Cruz

Founder & Director, Open Source Program Office UC Santa Cruz (ospo.ucsc.edu)

Founder & Director, Center for Research in Open Source Software (cross.ucsc.edu)

Co-Founder & Director, UCSC Systems Research Laboratory (SRL)

1999-2004: Performance Engineer, Netapp

Advising 5 Ph.D. students

Graduated 9 Ph.D. students, 12 M.S. students

people.ucsc.edu/carlosm

Current Research:

  • Programmable Storage Systems (programmability.us)
  • Arrow-native Storage (bit.ly/skyhookdm)
  • Big Data Storage & Processing
  • Scalable Data Management
  • Distributed Systems Performance Management
  • Practical Reproducibility� (getpopper.io, shrinkwrap)

Past Research:

  • Team processes in repositories [IJICIS’92]
  • Network Intermediaries [SIGMETRICS’97]
  • Automatic Behavioral Modeling of HDDs [MSST’14]
  • Data Management Games [GamifIR’14]

2

3 of 25

Outline

The Value of �Open Source

What is an OSPO?

OSPO UC Santa Cruz

Amplifying Research Impact

Simplifying Reproducibility

Case study: Skyhook

4 of 25

Industry values organizations who know how to engage with open source

5 of 25

$100+ billion valuation of open source companies

  • Commercial Open Source Index (COSSI) tracks 50 companies
    • Would not exist without open source
    • Revenue of at least $100M/year
  • Total of $14 billion in VC funding raised
  • Total of 76,000+ employees
  • Total est. valuation: $237 billion
  • Total exit value so far: $90 billion

bit.ly/coss-index

6 of 25

Linux Foundation

  • $10 million new funding into OpenSSF
  • White House meets with OpenSSF representatives recently for 7 hours

7 of 25

84 companies and counting �critically depend on open source ecosystems�$13.4 trillion total market cap�

Source: Todogroup.org

8 of 25

Open Source Program Office

  • Track and manage relationships with open source ecosystem
  • OSPO umbrella organization: Todogroup.org

Source: Todogroup.org

9 of 25

But what about universities?

10 of 25

UC’s Mission

We teach

We do research

We provide public service

“UC disseminates research results and translates scientific discoveries into practical knowledge and technological innovations that benefit California and the nation.”

11 of 25

Six� universities to develop playbooks for OSPOs

12 of 25

ospo.ucsc.edu

13 of 25

STUDENT OPEN SOURCE TRAINING

VALUE OF OPEN SOURCE

BROAD INDUSTRY ENGAGEMENT

Common gaps in universities

14 of 25

Amplify research impact �via open source

Open Source Ecosystem

Research results

15 of 25

Source: NSF Pathways to Enable Open Source Ecosystems (POSE) Webinar

Skyhook

16 of 25

Simplify reproducibility in computational research

Reproducible research results

Research results

Repeto�U Chicago, UC Santa Cruz, NYU

17 of 25

Reproducible research results

Research results

18 of 25

Case Study: �Skyhook

Storage Object

Host

Storage Server

Read/Write

Query

  • Merged Ceph storage plugin with Apache Arrow
  • Published in CCGrid 2022

Efficient and composable scientific data management in storage and network layers

19 of 25

Case Study: �Skyhook

Storage Object

Host

Storage Server

Read/Write

Query

  • Working with Argonne
  • Leveraging Thallium
  • Use case: particle data

20 of 25

Case Study: �Skyhook

Storage Object

Host

Read/Write

Query

Computational Storage Device

  • Working with Seagate
  • Leveraging Kinetic
  • Use case: Human Cell Atlas

21 of 25

Case Study: �Skyhook

SmartNIC

Storage Object

Host

Read/Write

Query

  • Working with Sandia
  • Leveraging FAODEL
  • Use case: particle data

SmartNIC

SmartNIC

22 of 25

Case Study: �Skyhook

Source: NSF Pathways to Enable Open Source Ecosystems (POSE) Webinar

23 of 25

Case Study: �Skyhook

Graphic from Heath Arensen’s keynote at CROSS Symposium ’20

24 of 25

Case Study: �Skyhook

  • Scoping Skyhook community (NSF funded)
  • Raising funding for Research Software Engineering staff
    • Skyhook maintainer
    • Reproducibility artifact builder
  • Collaboration with Apache Arrow community
  • Industry funding via open source gift letters

25 of 25

Thank You!

ospo.ucsc.edu

acm-rep.github.io

Carlos Maltzahn carlosm@ucsc.edu

people.ucsc.edu/carlosm