INFOMDSS - Course Schedule
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

 
View only
 
 
ABCDEFGHI
1
infomdss2018
DateMonday C3Tuesday C4 WorkshopsTuesday C5 LecturesThursday C7 lecturesWeekly assignmentsWeekly readingsDevTest Lab
2
week 1 (37)2018-09-03N/AN/ARegular lecture (MS):
Course introduction
1. Book review:
- Submit Top 3 Books
1. Davenport & Patil (2012)
2. Stair & Reynolds (2012) - CH 1,3
3. Pritzker, P., and May, W. (2015) - CH 2, App. A
4. Chapman et al. (2000) - CH 1,2
5. Spruit & Lytras (2018)
N/A
3
week 22018-09-10Required attendance:
- Tutorial 1A: Azure VMs
- Tutorial 1B: Linux Bash
Regular lecture (MS):
Big Data Engineering with Hadoop
Regular lecture (MS):
Hadoop MapReduce and HDFS
1. Complete Tutorial 1:
- Bash in Ubuntu
2. Start Tutorial 2:
- Wordcount in Hadoop
- White (2015) - CH 1,2
- Dean & Ghemawat (2008)
- Get & Read your selected book to review
LabA: Ubuntu 18
1 core, 2 MB
4
week 32018-09-17Hands-on Tutorial 2:
Wordcount in Hadoop
Required:
Guest lecture (DV):
UMCU/Neonatology on Big Data for Small Babies
Regular lecture (MS):
Spark Architecture & Transformations
1. Complete Tutorial 2:
- Wordcount in Hadoop
2. Start Tutorial 3:
- Neonatology Part I
- White (2015) - CH 3,19
- Complete reading your selected book to review
LabB: Ubuntu 18 + Hadoop artifact
2 cores, 4 MB
5
week 42018-09-24Python
tutorial
& Q/A
1. Complete Tutorial 3:
- Neonatology Part I
2. Start Tutorial 3:
- Neonatology Part II
Guest lecture (SM): ORTEC on Big Data Knowledge DiscoveryWrap-up lecture (MS):
- Wide Transformations
- Walkthrough of Tutorial 3 in Hadoop & Spark
- Midterm Q/A
1. Submit Book 2-pager
2. Complete Tutorial 3:
- Neonatology Parts I,II
- Complete readings above
- Review tutorials to understand at the command level
- Review _all_ lecture slides
LabB: Ubuntu 18 + Hadoop artifact
2 cores, 4 MB
6
week 52018-10-01MIDTERM EXAMNO LABStudent pich session:
- The TOP-20 Data Science & Society books
Regular lecture (MB):
Methods & Statistics I
1. Complete Tutorial 4
- Neonatology in Spark
2. Start Tutorial 5
- Statistics in Jupyter
- Lazer et al. (2014, March 28)
- Broniatowski et al. (2014, July 28)

- Chambers & Zaharia (2018) - CH 1,2
LabA: Ubuntu 18 DSVM
2 cores, 4 MB
7
week 6 (42)2018-10-081. Complete Tutorial 5
- Statistics in Jupyter
Regular lecture (MB):
- Methods & Statistics II
Required:
Guest lecture (COM):
UMCU/Epidemiology on Menopause and
Cardiometabolic Disease Risk
1. Continue Tutorial 5
- Statistics in Jupyter
2. Start Tutorial 6
- Epidemiology Analytics in Jupyter Part I
- Chambers & Zaharia (2018) - CH 3LabA: Ubuntu 18 DSVM
2 cores, 4 MB
8
week 72018-10-151. Explain assignment Big Spatial Data
2. Complete Tutorial 6 Part I
- Epidemiology Analytics in Jupyter Part I
Regular lecture (MB+MS):
- Methods & Statistics Wrap-up
- SQL in Spark
Guest lecture (TB,JWvE):
ESRI NL on Big Data in Geographical Information Systems
1. Complete Tutorial 6 Part I
- Epidemiology Data Preprocessing
2. Start Tutorial 6 Part II
- Epidemiology Analytics
- Chambers & Zaharia (2018) - CH 10LabA: Ubuntu 18 DSVM
2 cores, 4 MB
9
week 82018-10-221. Complete Tutorial 6 Part II
- Epidemiology Analytics
Guest lecture (WO):
- CoreLifeAnalytics on Machine Learning in Cell Screening
Guest lecture (MM):
UMCU/Julius on Big Data Ethics in Research, Privacy and Data Protection
1. Complete Tutorial 6 Part II
- Epidemiology Data Analytics
- Complete ALL readings above
- Review ALL tutorials to understand at the command level
- Review ALL lecture slides
LabB: HDInsight VMs on Azure DataBricks cluster
10
week 92018-10-291. Complete Tutorial 6 Part II
- Epidemiology Data Analytics
2. Work on Tutorial 6 Part III
- Epidemiology Advanced Analytics
Guest lecture (FS,VM): UMCU/Psychiatry on Big Data in PsychiatryFINAL LECTURE (MS):
- "Towards Tomorrow: Trends in Data Science & Society"
- Endterm Q/A
1. Complete Tutorial 6 Part III
- Epidemiology Advanced Analytics
- Complete ALL readings above
- Review ALL tutorials to understand at the command level
- Review ALL lecture slides
LabB: HDInsight VMs on Azure DataBricks cluster
11
week 102018-11-05ENDTERM
EXAM
12
week 12010-01-032ND CHANCE EXAM
Loading...
Main menu