1 of 32

Ready Your Laptops

Daily Reflections

We appreciate

your feedback!

Log into GitHub

Password and Two-factor auth

Check into #quantbio_2025

React “green sticky” and reply to thread

Start

Tools

Terminal, RStudio, VS Code

WiFi: carnegie / ciwwireless

2 of 32

Quantitative Biology Bootcamp/Lab I

2 September 2025

3 of 32

30,000 ft Overview

  • You’re in the right room
  • Spend next four days together
    • Gain comfort with Python, Unix, git, R
  • Prepare for fall Quantitative Biology Lab I
    • Reproducible pipelines/workflows, statistical analysis, computational algorithms
    • Research Project – Work on your own idea for the entire semester
  • Geared for biologists with zero computational experience
    • If you have lots of experience, we have some nuggets to explore, but please also help out your colleagues
  • We’re here to help
    • Can only help if you ask

4 of 32

Introductions

Rajiv

Mike

Fred

Lance

Sneha

Sadhana

5 of 32

Design by Rebekka Paisner

James Taylor (1979-2020)

6 of 32

What? Short Overview

Quantitative and computational methods are increasingly essential to all sub-disciplines of modern biological research. The goal of this intensive week-long “boot camp” is to empower students with the fundamental skills to apply these methods, as well as connect them to resources for further developing their knowledge and abilities.

The class starts at 9 am with formal instruction ending at 5 pm daily. The course demonstrates the importance of version control, documentation, testing, and other methods for enhancing reproducibility, reliability, and useability of software. This is achieved through live coding sessions and use of active learning exercises, where for the majority of the class, students perform data analysis to address biological questions and reinforce core bioinformatic concepts.

7 of 32

Why? Modern biology is data intensive

Large scale data acquisition has become easy,

e.g. high-throughput sequencing and imaging

Experiments are increasingly complex

Making sense of results often requires mining and making connections across multiple databases

Nearly all high-profile research involves some

quantitative methods

8 of 32

The methods section of a paper should provide sufficient detail to reproduce an experiment.

9 of 32

The methods section of a paper should provide sufficient detail to reproduce an experiment.

For computational research, you can go a step further: provide the raw data and the code.

10 of 32

The methods section of a paper should provide sufficient detail to reproduce an experiment.

For computational research, you can go a step further: provide the raw data and the code.

How has this worked in practice?

11 of 32

12 of 32

Spectrum of Reproducibility

13 of 32

Our Goals

Become comfortable using and writing software

to work with large-scale biological data.

Be able to reuse and reproduce analyses.

Understand the importance of tools and

practices like version control and apply them.

Get some initial exposure to key data analysis

problems and be prepared to approach a wide variety

of data analysis problems throughout the coming year.

14 of 32

Learning Objectives

  • Develop comfort working within a UNIX environment and at the command line to run programs
  • Develop confidence using Python and its extended ecosystem of tools for bioinformatic data analysis
  • Develop ability to use the R statistical programming language to conduct exploratory data analysis
  • Develop appreciation for and practice concise and transparent presentation of data
  • Develop good habits for ensuring reproducible research

15 of 32

Three Parts to Our Training Strategy

Bootcamp

Core Labs

Project Work

16 of 32

Quantitative Biology Bootcamp

Tuesday

Python Basics using Notebooks

Wednesday

Python FileIO + dicts, transitioning to Unix + Scripts

Thursday

git and practice

with Mini-Project #1 and #2

Friday

R for data analysis, plotting, and Bioconductor

17 of 32

Quantitative Biology Core Labs

Computational Algorithms

Genomic Workflows

Statistical Analyses

18 of 32

Quantitative Biology Project Work

Independent 🐣

Develop your own project e.g. reproduce a figure, compare two methods

Feedback 🏋️‍♀️

Open science on GitHub, Instructors and TAs,

Peer Review

Groups 👨‍👩‍👧‍👦

Self-assemble

into teams of 2

Deliverables ✅

Proposal,

Check-in #1 + #2,

Final Presentation

19 of 32

Example Projects from Quantitative Biology 2024

20 of 32

For Those with Prior Experience

  • Bootcamp assignments may not take the entire time
  • Get started on your semester long Project Work
    • Identify partner and create new repo with title and short description
    • Get feedback from Instructors and TAs
    • Draft proposal w/ background, inspirational image, dataset, method

Prior Experience of CMDB 2025 Cohort (n=24)

21 of 32

Bootcamp Logistics

22 of 32

23 of 32

Four-Day Agenda

Tue

Wed

Thu

Fri

Python: data types, control structures, etc.

Python: file i/o, dictionaries; Unix, Python scripts

Review; Git; Mini-project #1 and #2

R: tabular data analysis, plotting, Bioconductor

24 of 32

Daily Agenda

25 of 32

Attendance

Students should not participate in any other meetings, courses, or lab work throughout the week.

Please be on time for course activities, and budget extra time in anticipation of technical delays.

Please let us know about any emergencies, family responsibilities, illness, etc. that may prevent attendance, and we will work to accommodate reasonable requests.

Do not come to class if you are feeling sick. There will be no penalty for such absences, and we will work with you to make arrangements for catch-up work if you are too sick to participate.

26 of 32

For Biologists with No Python/Unix Experience

27 of 32

Guidelines and Expectations

  • No question is too simple
  • No question has been asked too many times
  • Raise your hand to ask a question to the instructors
  • Use the pink sticky to call over a TA for help
  • Instructors will occasionally check in during live coding:
    • Use a green sticky note if you’re following along and ready to move on
    • Use a yellow sticky note if you need more time
    • Use a pink sticky note if you need assistance
  • You can also send Slack messages in #quantbio_2025

28 of 32

Technology

29 of 32

Computational Tools

Terminal

VS Code

RStudio

30 of 32

Communication Tools

Slack

31 of 32

Documentation and help

32 of 32

Ready Your Laptops

Daily Reflections

We appreciate

your feedback!

Log into GitHub

Password and Two-factor auth

Check into #quantbio_2025

React “green sticky” and reply to thread

Start

Tools

Terminal, RStudio, VS Code

WiFi: carnegie / ciwwireless