School of Data - suggested resources
Comments
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

 
£
%
123
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ABCDEFGHIJKLMNOPQRST
1
TitleDescriptionLinkTypeContacted (Y/N)Permission granted to reuseSubmitted by
2
Have we contacted the author (under CC-By / CC-Zero or equivalent!)
3
Statistical literacyA port (not quite finished) of a course on statistical literacy - currently material are on moodle sitehttps://p2pu.org/en/groups/statistical-literacy/CourseyYes, materials here https://github.com/phewson/statlit are GNU documentation licence but could be CC.
4
Python for Informaticshttp://open.umich.edu/education/si/resources/python-opentextbook/winter2010Could have data to giveopen textbookNCC: BY Charles Severance
5
Networks: Theory and Analysishttp://open.umich.edu/education/si/si508/fall2008http://www.stat.ucla.edu/~cocteau/graduate level courseNCC: BY Lada Adamic
6
Ann Arbor Datadive 2012http://open.umich.edu/education/lsa/resources/datadive/winter2012/materialshttp://opendatacookbook.net/wiki/service weekend with lecturesYCC: BY; CC: BY NC SA (Various authors)
7
NYU-CollectOpen-Source App for Data Collection on mobile deviceshttp://code.google.com/a/eclipselabs.org/p/nyu-collect/app/coden(open-source)
8
Data Services Studio (NYU)Part of the library system, has many resources for students and lists of resourceshttp://nyu.libguides.com/dataservicestudioOrganization(s)n
9
Data Driven Detroit Data SetsA list of datasets the D3 folks use, could probably be uploaded into the SoD datasets database alsohttp://datadrivendetroit.org/data-mapping/Organization(s)NMixed city/national data and compiled data
10
Data Driven Detroit ToolboxWelcome to the D3 Toolbox, where you can find the tools that Data Driven Detroit has created to support communities with access to data they can use to take action in their neighborhoods.http://datadrivendetroit.org/data-mapping/toolbox/Organization(s)Nunknown
11
Univ Michigan Clark Library Census GuideResearch Guide organizing census data and providing guideline to visualization/mapping toolshttp://guides.lib.umich.edu/content.php?pid=117561&sid=1038165Organization(s)NCC: BY
12
Dryadrepository of data underlying scientific publications; "dryad lab" teaching modules in developmenthttp://datadryad.org/DataYCC-Zero
13
14
Data and CC tools wikiContains FAQ about how CC tools may be applied to data, links to wiki pages with case studies of organizations using CC0 and CC licenses for datahttp://wiki.creativecommons.org/DataTutorialCC BY
15
16
Inter-University Consortium for Political and Social Research (ICPSR)Contains large amounts of political and social research data. Some data sets are restricted, but most are publicly available for download.http://www.icpsr.umich.edu/icpsrweb/ICPSR/access/index.jspOrganization/DataNNo CC licenses, but the whole purpose of the site is to allow public access and re-use.
17
Intermediate Data Analysis for Human RightsA course on the basics of working with data, aimed at human rights activistshttp://www.columbia.edu/itc/sipa/U8165/CourseNCopyright, but with permissions to reproduce parts. Here's the wording: "Limited copying permission: Human rights organizations and individual workers are granted permission
to photocopy portions or all of this document to further their work provided acknowledgement of the source is given. The authors would appreciate hearing about such applications.
"
18
FreebaseAn entity graph of people, places and thingshttp://www.freebase.com/DataCC-BY
19
Bastards' Book of RubyGuide to Ruby + scraping/parsing etc aimed at journalistshttp://ruby.bastardsbook.com/TutorialAll rights restricted
20
Drawing by Numbersby Tactical Tech. Tool reviews, tips, ideas and articles about working with data and visualisation, for campaigners and advocateshttp://drawingbynumbers.orgTutorialYCC-BY-SA 3.0
21
House of cards Lidar dataRadiohead House of Cards music video data sethttp://code.google.com/p/radiohead/downloads/listDataCreative Commons Attribution-Noncommercial-Share Alike 3.0 License
22
23
UC Irvine Machine Learning RepositoryMassive numbers of data sets for practicing machine learning skillshttp://archive.ics.uci.edu/ml/Data
24
25
R Tutorial on Twitter text miningslide deck designed for first-timers working with R, tutorial code is available on githubhttp://jeffreybreen.wordpress.com/2011/07/04/twitter-text-mining-r-slides/Tutorial
26
27
ScraperWikiplatform for writing scraper code that's scheduled to run automatically, also for finding scraped datahttps://scraperwiki.com/about/Tool
28
29
Mark Hansen's Data Lectures from NYU ITPgreat stats lectures that don't follow the traditional stats narrativehttp://www.stat.ucla.edu/~cocteau/nyu/lectures/Tutorial
30
31
Programmable Webhub for finding apishttp://www.programmableweb.com/Data
32
33
Data Analysis in Python with PandasVideo tutorial - 3 hrs - excellenthttp://www.youtube.com/watch?v=w26x-z-BdWQ&feature=relatedTutorial
34
35
Average monthly temperatures across the worldinteresting data sethttp://datamarket.com/data/set/1loo/average-monthly-temperatures-across-the-world-1701-2011#!display=lineData
36
United Nations resolutionsMultilingual, paragraph-aligned resolutions of the General Assembly of the United Nationshttp://www.uncorpora.orgDataAny research use
37
Premasagar RosePresentation: Anatomy of a data visualizatoinhttp://anatomydataviz.dharmafly.com/#/slides/1Presentation, example, almost tutorialI know him (carolina)Yes, we are free to use and adapt at will
38
Scraping by ExamplesCode examples on how to scrape data from web siteshttp://www.slideshare.net/alegomes/scraping-by-examplesPresentation, example, almost tutorialI'm the authorCC
39
Statistics Open For AllSOFA - Open source stats - user friendlyhttp://www.sofastatistics.com/home.phpTool
40
Cooper-Hewlett CollectionCollection from the National Design museum, including digitized textiles, prints, graphic designhttps://github.com/cooperhewitt/collectionDataNoCC0 waiver
41
Common CrawlAn open crawl of the web accessible to everyone as a public dataset on Amazon.http://www.commoncrawl.orgDataI'm involved in the project (davelester)
42
Common Crawl Quick Start - Build from GithubGuide to running an example MapReduce job with Common Crawl data. https://commoncrawl.atlassian.net/wiki/display/CRWL/Quick+Start+-+Build+from+GithubTutorial""
43
Pentaho - Data Integration (Kettle)Pentaho Kettle enables IT and developers to access and integrate data from any source, and deliver it to your business applications, all from within an intuitive and easy to use graphical tool.http://sourceforge.net/projects/pentaho/ToolFree, Open SourceJohn Paz, Tech Writer, Pentaho Corp (and School of Data Contributor) johnapaz@gmail.comv
44
Research Data MANTRA Research Data MANTRA is a course designed for PhD students and others who are planning a research project using digital datahttp://datalib.edina.ac.uk/mantra/CoursesYesCC-By
45
7 ways to get data out of PDFs Resource on how to get data out of PDFs from the Help me investigate team. http://helpmeinvestigate.posterous.com/7-ways-to-get-data-out-of-pdfsTutorialNo-Firat Gelbal
46
PDF SolutionsResource on how to get data out of PDFshttp://www.investintech.com/prod_a2e_pro.htmToolNo-Firat Gelbal
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
Loading...
 
 
 
Sheet1