A | B | C | D | |
---|---|---|---|---|
1 | List of public data sets, useful for visualization | |||
2 | - Contributions: everyone has edit access to improve existing entries and to add your favorites at the bottom (right click on last row > Insert 1 below) 🙇 | |||
3 | - Tip to dig up new sets: to avoid just getting reports and sanitized data, add following keywords to your Google search: "data set", api, feed, download, json, csv, pdf | |||
4 | ||||
5 | Description | Feature highlights | Link | Notes |
6 | your computer’s system clock | time, calendar, timezone | N/A | think of "time as form" |
7 | GeoNames | country, city, population, lat/lng | http://www.geonames.org/ | mainly using REST webservices, check readme of Gazetteer Data for format on extract files |
8 | Earthquakes | lat/lng, seismic station and waveform | http://earthquake.usgs.gov/data/ | webservices including real-time feeds |
9 | USDA Aerial Photography | N/A | https://www.fsa.usda.gov/programs-and-services/aerial-photography/index | U.S. only, 10M+ images since 1955, not digitized and have to order |
10 | NYC Taxi Trip Records | zone boundary, pick-up/drop-off lat/lng, tip | http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml | since 2009 |
11 | OpenFlights | airport lat/lng, airline, route | http://openflights.org/data.html | global coverage with 7k airports |
12 | Nuclear power stations worldwide | name, lat/lng, reactors | https://fusiontables.google.com/DataSource?dsrcid=579353#rows:id=1 | simple set, would be great to aggregate with more data |
13 | NOAA | station measurements, raw satellite radiance | https://www.ncdc.noaa.gov/data-access | world largest weather and climate db, focus on U.S., complicated access |
14 | Dark Sky API | condition | https://darksky.net/dev/ | requires API key, takes lat/lng as parameter, forecasts, historical |
15 | New York Times (NYT) APIs | headline, abstract, new article with image | http://developer.nytimes.com/ | requires key per API, API exploration tool, "hack the news" |
16 | Data for No Ceilings (Fathom) | indicator per year per country | https://github.com/fathominfo/noceilings-data | gender inequality, 900 indicators since 1990 globally |
17 | Hubway Rides | station lat/lng, status, rebalancing, rider demo | http://hubwaydatachallenge.org/ | Hubway data visualization challenge, half a million rides |
18 | U.S. Census Bureau CitySDK | census, economic, housing, commute | https://uscensusbureau.github.io/citysdk/ | U.S. only |
19 | eurostat | population, immigration, trade | http://ec.europa.eu/eurostat | European statistics |
20 | Sunlight Foundation APIs | congress, party, finance | http://sunlightfoundation.com/api/ | Government and Politics |
21 | Gridded Population of the World (GPW), v4 | population count | http://beta.sedac.ciesin.columbia.edu/data/collection/gpw-v4 | 2000-2020, 1km grid size |
22 | TIGER/Line Shapefiles | geographic | https://www.census.gov/geo/maps-data/data/tiger-line.html | most comprehensive, detailed |
23 | City Data | population count, crime, weather | http://www.city-data.com/ | U.S. only |
24 | NYC 3-D Building Model | each building with major roofs as CityGML | http://www1.nyc.gov/site/doitt/initiatives/3d-building.page | LoD 1-2, from 2014 |
25 | swissBUILDINGS3D 1.0 | footprints + heights 2.5D without roofs | https://shop.swisstopo.admin.ch/en/products/landscape/build3D | not continued |
26 | swissBUILDINGS3D 2.0 | CityGML (and others) | https://shop.swisstopo.admin.ch/en/products/landscape/build3D2 | LoD 2, 1125 of 2294 communes 100% anticipated by mid-2018, big cities missing |
27 | Stadt Bern Geodaten und Pläne | 3D city model, historic, cycle, energy, aerial | http://www.bern.ch/themen/planen-und-bauen/geodaten-und-plane | LoD 1-2, aerial images ('99, '04, '08, '12, '16), have to order most, downloads at geobern.ch |
28 | 3D-Stadtmodell Zürich | 3D city model as CityGML | https://www.stadt-zuerich.ch/ted/de/index/geoz/geodaten_u_plaene/3d_stadtmodell.html | LoD 0-2, beyind paywall |
29 | 3D-Stadtmodell des Kantons Basel-Stadt | 3D city model, historic | http://www.gva.bs.ch/produkte_dienstleistungen/3d-stadtmodelle.cfm | not clear how to download |
30 | Geneva, Switzerland Buildings | 3D buildings | https://www.arcgis.com/home/item.html?id=033ecf268ae34489ac9aa1e88cd70860 | ArcGIS |
31 | MIT JSON APIs | Classrooms, courses, people | http://ist.mit.edu/web-api?category=15 | Available to MIT affiliates |
32 | Global Cigarette Consumption | Annual per country cigarettes per capita | http://www.tobaccoatlas.org/topic/cigarette-use-globally/ | Switzerland=1633 vs. USA=1083 |
33 | Uber Movement | Travel times (timestamped origin/destination) | https://movement.uber.com/cities | ~30 global cities |
34 | ||||
35 | Other collections | |||
36 | Awesome Public Datasets | N/A | https://github.com/caesar0301/awesome-public-datasets | GitHub repo with 100+ contributors |
37 | Data Packaged Core Datasets | N/A | https://github.com/datasets/ | Important, commonly-used datasets in high quality, easy-to-use & open form as data packages |
38 | opendata.swiss | N/A | https://opendata.swiss | portal for Swiss open government data with 2k sets |
39 | OpenDataMonitor | N/A | http://opendatamonitor.eu/frontend/web/index.php?r=dashboard%2Findex | An overview of available open data resources in Europe |
40 | Data.gov | N/A | https://www.data.gov/ | U.S. Government’s open data |
41 | Data USA | N/A | https://datausa.io | Makes public U.S. governmetnt data more accessible through visualizations etc. |
42 | AWS Public Datasets | N/A | https://aws.amazon.com/datasets/ | includes Million Song Dataset, Marvel Universe Social Graph |
43 | Google Public Data Explorer | N/A | https://www.google.com/publicdata/directory | search box and many WEF, WHO, WTO, World Bank sort of sets |
44 | Subreddit Datasets | N/A | https://www.reddit.com/r/datasets/ | continuous new submissions and requests of sets |
45 | Datasets for Machine Learning | N/A | https://docs.google.com/spreadsheets/d/1AQvZ7-Kg0lSZtG1wlgbIsrm90HaTZrJGQMz-uKRRlFw/edit | includes Amazon Reviews (100M+), famous ImageNet, Stanford Drone Dataset (70GB) |
46 | OpenML | N/A | http://www.openml.org/search?type=data | 20k sets |
47 | Quora thread | N/A | https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public | dig deep in the thread to find non-generic pointers |
48 | ||||
49 | ||||
50 | ||||
51 | ||||
52 | ||||
53 | ||||
54 | ||||
55 | ||||
56 | ||||
57 | ||||
58 | ||||
59 | ||||
60 | ||||
61 | ||||
62 | ||||
63 | ||||
64 | ||||
65 | ||||
66 | ||||
67 | ||||
68 | ||||
69 | ||||
70 | ||||
71 | ||||
72 | ||||
73 | ||||
74 | ||||
75 | ||||
76 | ||||
77 | ||||
78 | ||||
79 | ||||
80 | ||||
81 | ||||
82 | ||||
83 | ||||
84 | ||||
85 | ||||
86 | ||||
87 | ||||
88 | ||||
89 | ||||
90 | ||||
91 | ||||
92 | ||||
93 | ||||
94 | ||||
95 | ||||
96 | ||||
97 | ||||
98 | ||||
99 | ||||
100 |