1 of 12

Welcome to Data Science Discovery!

Karle Flanagan

Wade Fagen-Ulmschneider

2 of 12

Introductions

No good party starts without introductions...

3 of 12

Introductions

Karle Flanagan (kflan)

Instructor of Statistics

College of Liberal Arts and Sciences

Wade Fagen-Ulmschneider (waf)

Teaching Asst. Prof. of Computer Science

College of Engineering

4 of 12

Introductions

5 of 12

Syllabus

Course Website: http://courses.las.illinois.edu/spring2019/stat107/�...or: Google “STAT 107 uiuc”�...or: http://go.illinois.edu/stat107

6 of 12

Introductions, Part II

Open up this Google Sheet on your phone/computer: https://docs.google.com/spreadsheets/d/15gPquAhVuQBb2csckZUo5FrZHQN69X0p8IgQ4DlYsCg/edit?usp=sharing

...link is also on the STAT 107 website.

7 of 12

What to expect this semester?

CRN,Course Subject,Course Number,Course Title,Course Section,Sched Type

41758,AAS,100,Intro Asian American Studies,AD1,DIS

47100,AAS,100,Intro Asian American Studies,AD2,DIS

47102,AAS,100,Intro Asian American Studies,AD3,DIS

51248,AAS,100,Intro Asian American Studies,AD4,DIS

51249,AAS,100,Intro Asian American Studies,AD5,DIS

8 of 12

What to expect this semester?

9 of 12

Data Science Tool: Jupyter

Interactive programming tool, optimized for programming data science-type questions.

Main benefit “jupyter notebooks” that allow us to program and document our process in the same interactive file:

10 of 12

Data Science Tool: Pandas

Python Data Analysis Library:

At its core, pandas provide Excel-like access to datasets. Can quickly become much more power than Excel, particularly for large datasets.

11 of 12

Data Science Knowledge: Statistics

How can we know if A is better than B?

How can we compare A with B?

How do we do this under uncertainty?

12 of 12

Experimental Design