ABCD
1
List of public data sets, useful for visualization
2
- Contributions: everyone has edit access to improve existing entries and to add your favorites at the bottom (right click on last row > Insert 1 below) 🙇
3
- Tip to dig up new sets: to avoid just getting reports and sanitized data, add following keywords to your Google search: "data set", api, feed, download, json, csv, pdf
4
5
DescriptionFeature highlightsLinkNotes
6
your computer’s system clocktime, calendar, timezoneN/Athink of "time as form"
7
GeoNamescountry, city, population, lat/lnghttp://www.geonames.org/mainly using REST webservices, check readme of Gazetteer Data for format on extract files
8
Earthquakeslat/lng, seismic station and waveformhttp://earthquake.usgs.gov/data/webservices including real-time feeds
9
USDA Aerial PhotographyN/A
https://www.fsa.usda.gov/programs-and-services/aerial-photography/index
U.S. only, 10M+ images since 1955, not digitized and have to order
10
NYC Taxi Trip Recordszone boundary, pick-up/drop-off lat/lng, tip
http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml
since 2009
11
OpenFlightsairport lat/lng, airline, routehttp://openflights.org/data.htmlglobal coverage with 7k airports
12
Nuclear power stations worldwidename, lat/lng, reactors
https://fusiontables.google.com/DataSource?dsrcid=579353#rows:id=1
simple set, would be great to aggregate with more data
13
NOAAstation measurements, raw satellite radiancehttps://www.ncdc.noaa.gov/data-accessworld largest weather and climate db, focus on U.S., complicated access
14
Dark Sky APIconditionhttps://darksky.net/dev/requires API key, takes lat/lng as parameter, forecasts, historical
15
New York Times (NYT) APIsheadline, abstract, new article with imagehttp://developer.nytimes.com/requires key per API, API exploration tool, "hack the news"
16
Data for No Ceilings (Fathom)indicator per year per countryhttps://github.com/fathominfo/noceilings-datagender inequality, 900 indicators since 1990 globally
17
Hubway Ridesstation lat/lng, status, rebalancing, rider demohttp://hubwaydatachallenge.org/Hubway data visualization challenge, half a million rides
18
U.S. Census Bureau CitySDKcensus, economic, housing, commutehttps://uscensusbureau.github.io/citysdk/U.S. only
19
eurostatpopulation, immigration, tradehttp://ec.europa.eu/eurostatEuropean statistics
20
Sunlight Foundation APIscongress, party, financehttp://sunlightfoundation.com/api/Government and Politics
21
Gridded Population of the World (GPW), v4population count
http://beta.sedac.ciesin.columbia.edu/data/collection/gpw-v4
2000-2020, 1km grid size
22
TIGER/Line Shapefilesgeographic
https://www.census.gov/geo/maps-data/data/tiger-line.html
most comprehensive, detailed
23
City Datapopulation count, crime, weatherhttp://www.city-data.com/U.S. only
24
NYC 3-D Building Modeleach building with major roofs as CityGML
http://www1.nyc.gov/site/doitt/initiatives/3d-building.page
LoD 1-2, from 2014
25
swissBUILDINGS3D 1.0footprints + heights 2.5D without roofs
https://shop.swisstopo.admin.ch/en/products/landscape/build3D
not continued
26
swissBUILDINGS3D 2.0CityGML (and others)
https://shop.swisstopo.admin.ch/en/products/landscape/build3D2
LoD 2, 1125 of 2294 communes 100% anticipated by mid-2018, big cities missing
27
Stadt Bern Geodaten und Pläne3D city model, historic, cycle, energy, aerial
http://www.bern.ch/themen/planen-und-bauen/geodaten-und-plane
LoD 1-2, aerial images ('99, '04, '08, '12, '16), have to order most, downloads at geobern.ch
28
3D-Stadtmodell Zürich3D city model as CityGML
https://www.stadt-zuerich.ch/ted/de/index/geoz/geodaten_u_plaene/3d_stadtmodell.html
LoD 0-2, beyind paywall
29
3D-Stadtmodell des Kantons Basel-Stadt3D city model, historic
http://www.gva.bs.ch/produkte_dienstleistungen/3d-stadtmodelle.cfm
not clear how to download
30
Geneva, Switzerland Buildings3D buildings
https://www.arcgis.com/home/item.html?id=033ecf268ae34489ac9aa1e88cd70860
ArcGIS
31
MIT JSON APIsClassrooms, courses, peoplehttp://ist.mit.edu/web-api?category=15Available to MIT affiliates
32
Global Cigarette ConsumptionAnnual per country cigarettes per capita
http://www.tobaccoatlas.org/topic/cigarette-use-globally/
Switzerland=1633 vs. USA=1083
33
Uber MovementTravel times (timestamped origin/destination)https://movement.uber.com/cities~30 global cities
34
35
Other collections
36
Awesome Public DatasetsN/A
https://github.com/caesar0301/awesome-public-datasets
GitHub repo with 100+ contributors
37
Data Packaged Core DatasetsN/Ahttps://github.com/datasets/Important, commonly-used datasets in high quality, easy-to-use & open form as data packages
38
opendata.swissN/Ahttps://opendata.swissportal for Swiss open government data with 2k sets
39
OpenDataMonitorN/A
http://opendatamonitor.eu/frontend/web/index.php?r=dashboard%2Findex
An overview of available open data resources in Europe
40
Data.govN/Ahttps://www.data.gov/U.S. Government’s open data
41
Data USAN/Ahttps://datausa.ioMakes public U.S. governmetnt data more accessible through visualizations etc.
42
AWS Public DatasetsN/Ahttps://aws.amazon.com/datasets/includes Million Song Dataset, Marvel Universe Social Graph
43
Google Public Data ExplorerN/Ahttps://www.google.com/publicdata/directorysearch box and many WEF, WHO, WTO, World Bank sort of sets
44
Subreddit DatasetsN/Ahttps://www.reddit.com/r/datasets/continuous new submissions and requests of sets
45
Datasets for Machine LearningN/A
https://docs.google.com/spreadsheets/d/1AQvZ7-Kg0lSZtG1wlgbIsrm90HaTZrJGQMz-uKRRlFw/edit
includes Amazon Reviews (100M+), famous ImageNet, Stanford Drone Dataset (70GB)
46
OpenMLN/Ahttp://www.openml.org/search?type=data20k sets
47
Quora threadN/A
https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
dig deep in the thread to find non-generic pointers
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100