Conclusion
What you learned in this class, and what you will learn in the future. A farewell.
Data 100/Data 200, Spring 2022 @ UC Berkeley
Josh Hug and Lisa Yan
1
LECTURE 26
Announcements
Final Exam Logistics�Friday, May 13th: 7:00 PM - 10:00 PM
Final Exam Review Sessions:
Tuesday (May 3rd): Pre-MT2 content� Thursday (May 5th): Post-MT2 content
3:30 PM - 5:00 PM�In Li Ka Shing 245 + on Lecture Zoom
Online Final Exam Accommodation Form (due Mon, May 2 11:59 PM)
Course evaluations extra credit (Ed post)
2
What did you learn in Data C100/C200?
Lecture 26, Data 100 Spring 2022
Logistics
What did you learn Data C100/C200?
What’s Next?
Join course staff!
3
What were we supposed to teach you?
4
Prepare
Enable
Empower
Prepare students for advanced Berkeley courses in data management, machine learning, and statistics, by providing the necessary foundation and context.
Enable students to start careers as data scientists by providing experience working with real-world data, tools, and techniques.
Empower students to apply computational and inferential thinking to address real-world problems.
The Data Science Lifecycle
5
Question & Problem
Formulation
Data
Acquisition
Exploratory Data Analysis
Prediction and
Inference
Reports, Decisions, and Solutions
?
The Data Science Lifecycle
6
Question & Problem
Formulation
Data
Acquisition
Exploratory Data Analysis
Prediction and
Inference
Reports, Decisions, and Solutions
?
The Data Science Lifecycle
7
Question & Problem
Formulation
Data
Acquisition
Exploratory Data Analysis
Prediction and
Inference
Reports, Decisions, and Solutions
?
The Data Science Lifecycle
8
Question & Problem
Formulation
Data
Acquisition
Exploratory Data Analysis
Prediction and
Inference
Reports, Decisions, and Solutions
?
Feature Engineering
The Data Science Lifecycle
9
Question & Problem
Formulation
Data
Acquisition
Exploratory Data Analysis
Prediction and
Inference
Reports, Decisions, and Solutions
?
Modeling
The Data Science Lifecycle
10
Question & Problem
Formulation
Data
Acquisition
Exploratory Data Analysis
Reports, Decisions, and Solutions
?
Prediction and
Inference
Modeling
What’s Next?
Lecture 26, Data 100 Spring 2022
Logistics
What did you learn Data C100/C200?
What’s Next?
Join course staff!
11
What Didn’t We Focus on in Data 100?
12
Classes to Take Next
Data Management:
Statistics:
13
Classes to Take Next
Machine Learning, Optimization, and Artificial Intelligence:
14
Real-world Applications
15
Question & Problem
Formulation
Data
Acquisition
Exploratory Data Analysis
Prediction and
Inference
Reports, Decisions, and Solutions
?
Real-world Applications
A great way to strengthen your knowledge of the ideas you learned in this class (and to build a portfolio to help you find jobs!) is to use your new skills to analyze real-world data.
Places to look for data and applications of data science:
16
Real-world Applications
Even though you’re new to the discipline, you are also among the most skilled people in the world at data science. Use your power wisely!
Let us know what you do next!
17
Spider-Man (2002), or 1st century BC [Wikipedia]
Join course staff!
Lecture 26, Data 100 Spring 2022
Logistics
What did you learn Data C100/C200?
What’s next?
Join course staff!
18
Data C100/C200 Spring 2022 Staff
This class would not have been possible without our GSIs, tutors, and academic interns.
Kunal Agarwal, Anirudhan Badrinath, Parth Baokar, Bella Crouch, Jay Feng, Francis Geng, Kanu Grover, Kelly Han, Neha Haq, Samantha Hing, Aaron Huang, Priyanka Kargupta, Andrew Lenz, Michelle Li, Wallace Lim, Yulei Lin, Dominic Liu, Lucy Liu, Vasanth Madhavan, Mrunali Manjrekar, Minh Phan, James Susilo, Arda Ulug, Zachary Wu, Xinqi Yu, Ayela Chughtai, Eric Hao, Alina Herri, Jenny Jiang, Neal Kothari, Emily Le, Shiangyi Lin, Rachel McCarty, Conan Minihan, Ishaan Mishra, Pragnay Nevatia, Yiming Ni, Siddhant Satapathy, Yike Wang, Nancy Xu, William Xu, Jacob Yim, Natalie Chan, Tingyue Cui, Ziyi Ding, Nei Fang, Floyd Fang, Mary Guo, Brandon Hong, Daniel Huang, Arya Krishnan, Arjun Kshirsagar, Tanish Kumar, Ariel Kuo, Angela Lin, Wesley Little, Ruchi Maheshwari, Kaiona Martinson, Staten Maughan, Saurabh Narain, Jeanice Santosa, Amber Shao, Heather Sizlo, Jenea Spinks, Verona Teo, Albert Tran, Lili Wang, Andrea Wang, Winnie Xiao, Jennifer Yang, Andrew Zhang, Michael Zhu
19
Helping out with Data 100
Teaching is a great way to deeply understand course material.
We need you!
https://data.berkeley.edu/joining-data-course-staff
Academic Intern applications to be released closer to the start of the Fall semester.
20
Thank you!
Data 100/Data 200, Spring 2022 @ UC Berkeley
Josh Hug and Lisa Yan
21
Bonus links - demos at the end
22