ABCDEFGHIJKLMNOPQRSTUVWXYZ
1
Get Tickets: http://www.dataengconf.com/tickets
2
Sponsorships: http://www.dataengconf.com/contact
3
Code of Conduct:http://www.dataengconf.com/code-of-conduct
4
Note: SCHEDULE IS SUBJECT TO CHANGE BY ORGANIZER
5
6
DAY 1 ( 11/3 ) - SCHEDULE
7
8:00 - 9:00AMRegistration and Breakfast
8
9:00 - 9:15AMWelcome, announcements & track host intros
9
9:15 - 9:50AM Keynote #1: Data Science: Past, Present, Future
Speaker: Hilary Mason - Founder, Fast Forward Labs
10
TRACK 1 - Data EngineeringTRACK 2 - Data ScienceTRACK 3 - Office Hours
11
10:00 - 10:40AMApache Kudu: Fast Analytics on Fast Data
Speaker: Will Berkeley - Apache Kudu PMC & Solutions Architect, Cloudera
Bias, Variance, and Adaptive Products
Speaker: George Davis - Founder & CEO, Frame.ai
Rm 1: Hilary Mason-
12
10:45 - 11:25AMKafka Streams: Stream Processing Made Easy
Speaker: Guozhang Wang - Kafka Committer & Software Engineer, Confluent
Forecasting with Chaotic Models and Sparse Data: Lessons from Numerical Weather Prediction
Speaker: David Kelly - Assistant Professor, NYU
Rm 1: Will BerkeleyRm 2: George Davis
13
11:30 - 12:10PMBuilding a Cloud-Native SQL Database
Speaker: Alex Robinson - Member of Technical Staff, Cockroach Labs
Anomaly Detection Algorithms for Real-World Systems
Speaker: Manojit Nandi - Lead Data Scientist, STEALTHBits
Rm 1: Guozhang WangRm 2: Jeremy Nilmeier
14
12:15 - 1:15PMLunch--
15
1:15 - 1:55PMUnified Pipeline Architecture: The Evolution of Data Processing at Spotify
Speaker: Erin Palmer - Applied Data Scientist, Spotify
Statistical and Computational Aspects of News Article Clustering
Speaker: Jeiran Jahani - Research Data Scientist, Chartbeat
Rm 1: Alex RobinsonRm 2: Manojit Nandi
16
2:00 - 2:40PMPeloton: The Self-Driving Database Management System
Speaker: Andy Pavlo - Assistant Professor of Databaseology, Carnegie Mellon University
Relevance in Twitter's Home Timeline
Speaker: Parag Agrawal - Principal Software Engineer, Twitter
Rm 1: Erin PalmerRm 2: Jeiran Jahani
17
2:45 - 3:15PMCoffee Break--
18
3:15 - 3:55PMProcessing Geographic Data at Internet Scale
Speaker: Peter Lenz - Senior Geospatial Analyst, Dstillery
A/B Testing is Hard: Lessons from Ad Exchanges
Speaker: Sergei Vassilvitskii - Research Scientist, Google
Rm 1: Andy PavloRm 2: Parag Agrawal
19
4:00 - 4:55PMCareer Panel - Leveling Up in Your Career as a Data Engineer/Scientist
Panelists: Nick Chamandy, Lyft; Jonathan Lenaghan, PlaceIQ; Josh Schwartz, Chartbeat
Moderator: Jared Polivka, Galvanize
VC Panel - The Present Future of Data-Oriented Startups
Panelists: Matt Hartman, Betaworks; David Beyer, Amplify Partners; Evan Nisselson, LDV Capital; Tim Devane, Next View Ventures
Moderator: Pete Soderling
Rm 1: Peter LenzRm 2: Sergei Vassilvitskii
20
5:00 - 7:00PMCONF PARTY - Lightning Talks, Drinks & Food + Mingle with Smart People
21
22
DAY 2 ( 11/4 ) - SCHEDULE
23
8:00 - 9:00AMRegistration and Breakfast
24
9:00 - 9:15AMDay 2 - Welcome, announcements & track host intros
25
9:15 - 9:50AM Keynote #2: Computational Social Science: Exciting Progress and Future Challenges
Speaker: Duncan Watts - Principal Researcher, Microsoft Research
26
TRACK 1 - Data EngineeringTRACK 2 - Data ScienceTRACK 3 - Office Hours
27
10:00 - 10:40AMGenomic Data Analysis with Spark
Speaker: Ryan Williams - Software Engineer, Mt. Sinai's Hammer Lab
Reinforcement Learning for Data Scientists
Speaker: Brian Farris - Data Scientist, Capital One
Rm 1: Duncan Watts-
28
10:45 - 11:25AMElastic Big Data Platform @ Datadog
Speaker: Doug Daniels - Director Engineering, Datadog
Experimental and Observational Causal Inference in Massive Data
Speaker: Sinan Aral - David Austin Professor of Management, MIT
Rm 1: Ryan WilliamsRm 2: Brian Farris
29
11:30 - 12:10PMPython Data Wrangling: Preparing for the Future
Speaker: Wes McKinney - Senior Vice President, Two Sigma
SystemML & Spark: A Framework for Scalable Data Science Algorithm Development
Speaker: Jeremy Nilmeier - Chief Data Scientist, IBM Spark Technology Center
Rm 1: Doug DanielsRm 2: Sinan Aral
30
12:15 - 1:15PMLunch--
31
1:15 - 1:55PMThe Trials and Tribulations of Scaling Data Science and Engineering
Speaker: Ashley Miller - Software Engineering Manager, BuzzFeed
Causal Inference in Data Science
Speaker: Amit Sharma - Postdoctoral Researcher, Microsoft Research
Rm 1: Wes McKinneyRm 2: David Kelly
32
2:00 - 2:40PMLessons Learned Optimizing NoSQL for Apache Spark
Speaker: John Musser - VP Engineering, Basho
To Get the Value, Ditch the Hype
Speaker: Nick Ursa - Data Scientist, The New York Times
Rm 1: Ashley MillerRm 2: Amit Sharma
33
2:45 - 3:15PMCoffee Break--
34
3:15 - 3:55PMThe Future of Column-Oriented Data Processing with Arrow & Parquet
Speaker: Julien Le Dem - Principal Architect at Dremio, Apache Parquet Co-founder & PMC Chair
Stop Obsessing About Data Infratructure
Speaker: Yair Weinberger - Founder & CTO, Alooma
Rm 1: John MusserRm 2: Nick Ursa
35
4:00-4:45PMKeynote #3: How to Change a City with Data Science
Speaker: Ben Wellington - Quantitative Researcher, Two Sigma
Rm 1: Julien Le DemRm 2: Yair Weinberger
36
5:00 PMConference Day 2 End
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100