Data science workflow tools
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

 
View only
 
 
ABCDEFGHIJKLMNOPQRSTUVWXYZAAAB
1
The tools below are categorized according to their applicability within the 6 core processes of the data science workflow, as defined in this blog post:
2
http://fouryears.eu/2018/11/29/the-data-science-workflow/
3
The list does not claim to be an all-encompassing survey. This is just a sample of tools and services known to the author
4
Workflow stages
5
TitleLink
Pricing /
License
Github
stars (end of 2018)
Data
management
Compute
graph
Experiment
management
DeploymentMonitoring
Workbench
organization
6
Any database server or file format
N/AN/Ax
7
Git LFS
https://git-lfs.github.com/
MIT6488x
8
Git Annex
https://git-annex.branchable.com/
AGPLN/Ax
9
Quilt
https://quiltdata.com/
Freemium684x
10
Darty
https://github.com/zalando-incubator/darty
MIT10x
11
GNU Make
https://www.gnu.org/software/make/
GPLN/Ax
12
Luigi
https://github.com/spotify/luigi
Apache10529x
13
Airflow
https://github.com/apache/incubator-airflow
Apache10086x
14
DVChttps://dvc.org/Apache1523xx
15
Pachyderm
http://www.pachyderm.io/
Apache3267xx?
16
DatMo Open Source
https://github.com/datmo/datmo
Apache179??
17
ModelDB
https://github.com/mitdbg/modeldb
MIT573x
18
CometML
https://www.comet.ml/
FreemiumN/Ax
19
Sacred
https://github.com/IDSIA/sacred
MIT1404x
20
Hyperdash
https://hyperdash.io/
Free110x
21
Sumatra
https://github.com/open-research/sumatra
BSD81x
22
FGLab
https://kaixhin.github.io/FGLab/
MIT170x
23
TensorFX
https://github.com/TensorLab/tensorfx
Apache178x
24
TensorBoard
https://github.com/tensorflow/tensorboard
Apache2840x
25
Weights and Biases
https://www.wandb.com/
FreemiumN/Ax
26
Modelchimp
https://www.modelchimp.com/
BSD/Paid19x
27
Hyperflow
https://hyperflow.in/
Apache0?x
28
Neptune
https://neptune.ml/
FreemiumN/Axx
29
Tensorflow Serving
https://www.tensorflow.org/serving/
Apache2843x
30
SKompiler
https://github.com/konstantint/SKompiler
MIT24x
31
JPMML/jpmml-sklearn
https://github.com/jpmml
AGPL258x
32
AWS Lambda
https://aws.amazon.com/lambda/
PaidN/Ax
33
Acumos
https://www.acumos.org/
PaidN/Ax
34
MLFlow
https://www.mlflow.org/
Apache2674xxx
35
YHat
http://docs.yhat.com/
PaidN/Axx?
36
Grafana
https://grafana.com/
Apache25175x
37
Metabase
https://www.metabase.com/
AGPL12314x
38
Tableau
https://www.tableau.com/
PaidN/Ax
39
MyDBR
https://mydbr.com/
FreemiumN/Ax
40
DatMo Pro
https://datmo.com/product
PaidN/Axx
41
Cookiecutter-DS
http://drivendata.github.io/cookiecutter-data-science/
MIT1649x
42
Hops.io
https://www.hops.io/
AGPL5x
43
Lore
https://github.com/instacart/lore
MIT1247xxxx
44
FloydHub
https://www.floydhub.com/
Freemium109xxxx
45
AzureML
https://studio.azureml.net/
PaidN/Axxxx
46
Dataiku
https://www.dataiku.com/dss/
PaidN/Axxxxx
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
Loading...
Main menu