Starts 12 July 2021
London Office of Technology and Innovation
LOTI and ONS Data Science Bootcamp
@LOTI_LDN
www.loti.london
#LOTI
2
Introduction
The Office of National Statistics (ONS) Data Science Campus is at the heart of leading-edge data science capacity building with public sector bodies in the UK and abroad. They equip analysts with the latest tools and techniques, giving them the capability to perform effectively in their roles.
The London Office of Technology and Innovation (LOTI) in collaboration with the ONS Data Science Campus and the Greater London Authority (GLA) have designed a cross-borough Data Science Bootcamp.
The Bootcamp will be available across multiple public sector organisations, working in collaboration in learning clusters. It aims to develop the capability and skills of officers and provide practical demonstrations of the value of data science within London’s local government.
The Bootcamp will be offered to members of LOTI’s Data Science Network made up of Data Scientists and Analysts learning to code.
Overview
3
Details about the Data Science Bootcamp
Format of a typical day
Programme Availability
We have 15 places available! Places will be distributed fairly across the boroughs, subject to the skills and capabilities of those who apply. We welcome sign-ups from more than one participant per borough, but cannot guarantee everyone a place.
Timetable
Beginner - Intermediate
Monday 12 July – Monday 6 September (This is a 9 week period as the programme won’t take place on Monday 30 August)
Advanced
Monday 13 September – Monday 4 October
Train the Trainer
As there are limited places on the Data Science Bootcamp, we are hoping to develop a process by which learning resources can become learning pack templates for programme participants to share with colleagues in their boroughs.
Overview
Data matching is a foundational skill set in data science, enabling data to be brought together for descriptive, diagnostic and predictive analytics.
Taking part in LOTI and ONS’s Data Science Bootcamp will give you the ability to use data matching techniques to tackle some of these common challenges:
4
Data Matching
Data Matching
Overview
Some of the skills and techniques you’ll acquire as a result of the programme:
Basic:
Intermediate:
Advanced:
5
Draft Course Overview - Beginner / Intermediate (1/2)
Pre-learning
Required: Into to Data Linkage and either Intro to Python or Intro to R - all available on the ONS Learning Hub.�Optional/recommended: foundations of SQL, RAP pathway, PySpark�
Week 1: Consolidate
Week 2: Exact Matching
Week 3: Further Preprocessing for Data Linkage
Draft Course Overview - Beginner / Intermediate (2/2)
Week 4: Mini Project 1
Week 5: Rule Based + Score Matching (deterministic)
Week 6: Probabilistic Matching
on analysis
Week 7: Further methods in data linkage
Week 8: Mini Project 2
Week 3: Unsupervised Machine Learning Matching
Week 4: Applied Project
8
Draft Course Overview - Advanced
Pre-learning
Required: Introduction to Python/R, Data Linkage in Python/R, Machine Learning in Python / R��Participants will ideally have their own business problem they can explore with these techniques to supplement the open data provided.
Week 1: Consolidation
Week 2: Supervised Machine Learning Matching
Pilot Cohort
9
Name | Organisation |
Chiadi Lionel | Camden |
Luke Ballance | Camden |
Malgorzata Lachowska | Greater London Authority |
Yiran Wei | Greater London Authority |
Toby Meller | Kingston upon Thames |
Huu Do | Hackney |
Lindsey Coulson | Hackney |
Lee Latchford | Havering |
Anna Trichkine | Hounslow |
Ejaz Hussain | Hounslow |
Ian Hanson | Kensington and Chelsea |
Sean Pedrick-Case | Lambeth |
Karen Kemsley | Lewisham |
Emmanuel Steadman | Tower Hamlets |
David Saxton | Tower Hamlets |
FAQs
How many people can sign up from one borough?
We accept multiple applications from the same borough.
How many places are available on the course?
15 places in total, but there are limited spaces on the Advanced stream.
How will ONS & LOTI assess the level of prior knowledge?
Prospective participants will be asked to self-assess their skill levels using our Expression of Interest form [CLOSED].
Must applicants complete the relevant ONS courses on Python / R?
No, equivalent knowledge is acceptable.
How is the course carried out?
Open to flexibility and is adaptable. Participant-led, so please do bring your own datasets. Introduction to content, demos, and then time for participants to put in practice what they’ve learned, with time aside for 1-2-1 tuition.
What day in the week?
Monday each week
10
FAQs
11
How do participants access the pre-training materials?
Once the final participants have been decided, then successful applicants will need to approve a privacy statement to access the full raft of content on the ONS Learning Hub. Relevant courses will be highlighted.
What is the recommended language?
R is more user-friendly and Python is more useful for development and deploying machine learning, so it’s a matter of preference.
What software and packages need to be installed ahead of the course?
Pre-learning courses will prompt participants to install what will be necessary for the course.
The full list of additional packages and libraries will be circulated to successful applicants as part of the pre-learning starter pack.
Are you expecting people to complete Intro to Python & R as a pre-requisite?
Pre-programme questions will be shared with participants beforehand to gauge what language to focus on. No restrictions on preferred language.
FAQs
12
Is there a benchmark/ standard that is embedded in the prospectus for participants to self assess?
Participants will be asked to complete the recommended ONS courses prior to starting the programme. This will bring them up to the required level.
Utilising libraries to access Python & will it be available after the course?
Some of the content will be taken from pre-existing libraries, but some yet to be determined to allow for flexibility. Majority of packages will have been installed as part of pre-learning content & time will be allowed for participants to download during the course.
Can we use personal laptops?
Yes, but it depends on the data that is being used. If bringing your organisation’s data, then participants would be expected to have liaised with their organisation’s Information Governance teams beforehand to ensure that will be possible.
How many places are there on the Advanced course?
15 places in total, but there are limited spaces on the Advanced stream.
FAQs
13
Can participants switch between languages?
We would recommend that participants stick to one but won’t restrict you from doing so. The theory is the same but participants would be doubling workload. Easier to learn in one language and once comfortable, can switch between them by applying syntax.
How do participants contact their mentors?
Mentors can be contacted by email in-between. Potential for creating a channel on the Government Data Science Slack channel.
Does data matching cover address base as that’s what is used in Council?
It depends on the datasets, so please do bring the datasets that would be of most use and we will remain flexible.
Is access to the Learning Hub time-limited?
After completing the programme, participants will have access to the learning hub for an indefinite period.
London Office of Technology and Innovation
ONS Data Science Campus
For further information
Jay Saggar - Programme Manager (Data, Smart City & Cyber Security), LOTI
Onyeka Onyekwelu - Strategic Engagement Manager, LOTI
Sophie Nelson - Programme Officer, Data Science Campus