Data science workflow tools
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

Comment only
 
 
ABCDEFGHIJKLMNOPQRSTUVWXYZAAAB
1
The tools below are categorized according to their applicability within the 6 core processes of the data science workflow, as defined in this blog post:
2
http://fouryears.eu/2018/11/29/the-data-science-workflow/
3
The list does not claim to be an all-encompassing survey. This is just a sample of tools and services known to the author
4
Workflow stages
5
TitleLink
Pricing /
License
Github
stars (end of 2018)
Data
management
Compute
graph
Experiment
management
DeploymentMonitoring
Workbench
organization
6
Any database server or file format
N/AN/Ax
7
Git LFS
https://git-lfs.github.com/
MIT6488x
8
Git Annex
https://git-annex.branchable.com/
AGPLN/Ax
9
Quilt
https://quiltdata.com/
Freemium684x
10
Darty
https://github.com/zalando-incubator/darty
MIT10x
11
GNU Make
https://www.gnu.org/software/make/
GPLN/Ax
12
Luigi
https://github.com/spotify/luigi
Apache10529x
13
Airflow
https://github.com/apache/incubator-airflow
Apache10086x
14
PyDoit
http://pydoit.org
MIT541x
15
DVChttps://dvc.org/Apache1523xx
16
Pachyderm
http://www.pachyderm.io/
Apache3267xx?
17
DatMo Open Source
https://github.com/datmo/datmo
Apache179??
18
ModelDB
https://github.com/mitdbg/modeldb
MIT573x
19
CometML
https://www.comet.ml/
FreemiumN/Ax
20
Sacred
https://github.com/IDSIA/sacred
MIT1404x
21
Hyperdash
https://hyperdash.io/
Free110x
22
Sumatra
https://github.com/open-research/sumatra
BSD81x
23
FGLab
https://kaixhin.github.io/FGLab/
MIT170x
24
TensorFX
https://github.com/TensorLab/tensorfx
Apache178x
25
TensorBoard
https://github.com/tensorflow/tensorboard
Apache2840x
26
Weights and Biases
https://www.wandb.com/
FreemiumN/Ax
27
Modelchimp
https://www.modelchimp.com/
BSD/Paid19x
28
Hyperflow
https://hyperflow.in/
Apache0?x
29
Neptune
https://neptune.ml/
FreemiumN/Axx
30
Tensorflow Serving
https://www.tensorflow.org/serving/
Apache2843x
31
SKompiler
https://github.com/konstantint/SKompiler
MIT24x
32
JPMML/jpmml-sklearn
https://github.com/jpmml
AGPL258x
33
AWS Lambda
https://aws.amazon.com/lambda/
PaidN/Ax
34
Acumos
https://www.acumos.org/
PaidN/Ax
35
MLFlow
https://www.mlflow.org/
Apache2674xxx
36
YHat
http://docs.yhat.com/
PaidN/Axx?
37
Grafana
https://grafana.com/
Apache25175x
38
Metabase
https://www.metabase.com/
AGPL12314x
39
Tableau
https://www.tableau.com/
PaidN/Ax
40
MyDBR
https://mydbr.com/
FreemiumN/Ax
41
DatMo Pro
https://datmo.com/product
PaidN/Axx
42
Cookiecutter-DS
http://drivendata.github.io/cookiecutter-data-science/
MIT1649x
43
Hops.io
https://www.hops.io/
AGPL5x
44
Lore
https://github.com/instacart/lore
MIT1247xxxx
45
FloydHub
https://www.floydhub.com/
Freemium109xxxx
46
AzureML
https://studio.azureml.net/
PaidN/Axxxx
47
Dataiku
https://www.dataiku.com/dss/
PaidN/Axxxxx
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
Loading...