Understanding your code with Machine Learning
The Data Platform for your �Software Development Life Cycle
Intro
Hugo, Data Scientist @ source{d}
Alex, Engineer @ source{d}
2
Agenda
3
Goals of the workshop
What to expect from the rest 2h:
*Start copying the data!
4
Machine Learning on Code
5
Data Science project
Main activities during a data science project
6
Workshop tasks
7
Workshop tasks
Developer and project similarity
Both can be done in unsupervised way,
by building a vector representation for individual developer & project.
8
Workshop tasks
Function name suggestion
How to predict the name of a function, based on its body text?
We are going to use a simple Machine Translation baseline* model
9
* check github.com/src-d/awesome-machine-learning-on-source-code/ for State Of the Art (SOtA) models
Hands On: preparations
Before we begin: necessary steps to prepare local environment
10
Let’s get coding!