Lessons from reluctant data engineering
Yanir Seroussi
Disclaimer #1: I am not a data engineer
Data engineering: More valuable than data science?
Why am I here today?
My first data engineering conference talk
2023
My first remote data science job
2017
My first enterprise consulting stint
2015
My first head of data science job
2013
My first committed climate and biodiversity moves
2022
My first data science job
2012
Disclaimer #2: Lessons may be obvious or better learnt the hard way
Snippet 1/5
My first data engineering conference talk
2023
My first remote data science job
2017
My first enterprise consulting stint
2015
My first head of data science job
2013
My first data science job
2012
My first committed climate and biodiversity moves
2022
2012
My first data science job
“Big Launch” prep with shiny new tech
Lesson: Shiny tech ain’t always shiny
We should forget about small efficiencies, say about 97% of the time:
premature optimization�is the root of all evil.
Yet we should not pass up our opportunities in that critical 3%.
– Donald Knuth, 1974
Snippet 2/5
My first data engineering conference talk
2023
My first remote data science job
2017
My first enterprise consulting stint
2015
My first head of data science job
2013
My first data science job
2012
My first committed climate and biodiversity moves
2022
2013
My first head of data science job
Real launch leads to real scaling problems
Lesson: Shiny tech can be transformative; but principles beat tools
As to methods, there may be a million and then some, but principles are few.
The person who grasps principles can successfully select their own methods.
The person who tries methods, ignoring principles, is sure to have trouble.
– Harrington Emerson, 1911
Snippet 3/5
My first data engineering conference talk
2023
My first remote data science job
2017
My first enterprise consulting stint
2015
My first head of data science job
2013
My first data science job
2012
My first committed climate and biodiversity moves
2022
2015
My first enterprise consulting stint
Enterprise-scale blockers; human-scale workarounds
Lesson: Solve problems; don’t be the problem
Focus on the user and all else will follow.
– Google, ~2004
Snippet 4/5
My first data engineering conference talk
2023
My first remote data science job
2017
My first enterprise consulting stint
2015
My first head of data science job
2013
My first data science job
2012
My first committed climate and biodiversity moves
2022
2017
My first remote data science job
Automattic: Remote company; normal data problems
Lesson: Go deep; trust but verify
Given enough eyeballs, all bugs are shallow.
– Eric S. Raymond, 1999
Snippet 5/5
My first data engineering conference talk
2023
My first remote data science job
2017
My first enterprise consulting stint
2015
My first head of data science job
2013
My first committed climate and biodiversity moves
2022
My first data science job
2012
2022
My first committed climate and biodiversity moves
Recent highlights
Lesson: Tech & titles are tools; focus on what matters
You are not obliged to complete the work, but neither are you free to desist from it.
– Rabbi Tarfon, ~100
Why am I here today?
My first data engineering conference talk
2023
My first remote data science job
2017
My first enterprise consulting stint
2015
My first head of data science job
2013
My first committed climate and biodiversity moves
2022
My first data science job
2012
Takeaway:
Data problems have human roots – and human solutions
Recap: Data problems have human roots & solutions
Questions?