smalldatajournalism.com
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

 
View only
 
 
ABCDEFGHIJKLMNOPQRSTUVWX
1
titleincdate_addedsource_urltypecategoryauthordatekeywordsdescriptionprioritycost?
2
The Data Journalism Handbooky9/4/2013http://datajournalismhandbook.org/BookReportingThe European Journalism Center and the Open Knowledge Foundation2012An international collaboration of the best of data journalism worldwide.
3
When Maps Shouldn’t Be Mapsy9/4/2013http://www.ericson.net/content/2011/10/when-maps-shouldnt-be-maps/ArticleVisualizationMatthew Ericson10/14/2011mappingComputers have made mapping almost too easy, to the point where we overlook clearer, more efficient ways to display data that isn't particularly geospatially interesting.
4
A fundamental way newspaper sites need to change9/4/2013http://www.holovaty.com/writing/fundamental-change/ArticleCurationAdrian Holovaty9/6/2006A hallmark essay on how data can be used in the very core of journalsitic operations, by the journalist who created the popular Django framework. Too bad his advice hasn't been much heeded.
5
Serious Fun With Numbersy9/4/2013http://www.cjr.org/reports/serious_fun_with_numbers.php?page=allArticleReportingJanet Paskin with Daniel Gilbert11/9/2010excelDaniel Gilbert had spreadsheets with thousands of rows of data. "There was a story there, he was certain. But control-f would not find it." After he took a class on how to use Microsoft Access, he could finally ask and answer the questions he knew were in the data. Gilbert's meticulously detailed series on landowners being cheated by oil and gas companies ended up winning a Pulitzer.
6
My Favorite (Excel) Thingsy9/4/2013http://extra.twincities.com/car/mj/ExcelClassHandout.pdfArticleCollectionMaryJo Webster2010excelA short readable guide to one of the most vital and fundamental data journalism tools.
7
Design Principles for News Apps & Graphics9/4/2013http://www.propublica.org/nerds/item/design-principles-for-news-apps-graphicsArticleVisualizationLena Groeger5/30/2013design, news-apps, mappingA well-lillustrated list of design concepts to bring to visualizations and news interactives.
8
Visual Literacy in an Age of Datay9/4/2013http://source.mozillaopennews.org/en-US/learning/visual-literacy-age-data/ArticleVisualizationShazna Nessa6/13/2013design, news-appsAre modern web interactives becoming too complicate for users to interpret? Probably.10
9
PowerPoint does Rocket Science - and Better Techniques for Technical Reports9/4/2013http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0001yB&topic_id=1ArticleVisualizationEdward Tuftepowerpoint, news-appsHow ugly formatting, including improperly bolded text, can obscure the important information in technical presentations.10
10
Pie Chartsy9/4/2013http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=00018SArticleVisualizationEdward Tufte discussion boardchartsA great discussion between Edward Tufte and his readers on why pie charts aren't ideal.10
11
Does High Public Debt Consistently Stifle Economic Growth? A Critique of Reinhart and Rogoff9/4/2013http://www.peri.umass.edu/236/hash/31e2ff374b6377b2ddec04deaa6388b1/publication/566/PaperCurationThomas Herndon, Michael Ash, and Robert Pollin4/15/2013excelAn influential economic analysis by two Harvard economists, based on a spreadsheet snafu? This paper and the articles that followed are a bag of laughs.
12
Common Static Visualization Types9/4/2013http://guides.library.duke.edu/vis_typesGuideVisualizationAngela Zoss2010chartsChart types, from simple to exotic, in tabular form.
13
Using Google Refine to Clean Messy Data9/4/2013http://www.propublica.org/nerds/item/using-google-refine-for-data-cleaningGuideCurationDaniel Nguyen12/30/2010data-cleaningGoogle Refine is now Open Refine and its interface and feature sets have improved quite a bit since I wrote this essay. But it's still a good overview of how a single digital tool can make a data-intensive investigation possible.
14
Using Small Multiples to get to ‘Aha!9/4/2013http://37signals.com/svn/posts/266-using-small-multiples-to-get-to-ahaArticleVisualizationMatt Linderman2/13/2007chartsApplying Tufte's principle of small multiples to online video thumbnails.6
15
Beautiful spider loses its way9/4/2013http://junkcharts.typepad.com/junk_charts/2013/08/beautiful-spider-loses-its-way-.htmlArticleVisualizationKaiser FungCharts can be too beautiful for their own good.10
16
If correlation doesn’t imply causation, then what does?9/4/2013http://www.michaelnielsen.org/ddi/if-correlation-doesnt-imply-causation-then-what-does/ArticleAnalysisMichael Nielsen1/23/2012Predictably, the answer is very complicated and involves calculus.
17
Know Your Stats9/4/2013http://source.mozillaopennews.org/en-US/learning/know-your-stats/ArticleAnalysisDave Stanton7/25/2013A short list of statistical snafus as applied to data journalism.8
18
The Perils of Polling Twitter9/4/2013http://source.mozillaopennews.org/en-US/learning/perils-polling-twitter/ArticleAnalysisJacob Harris7/18/2013A nice explanation of selection bias, in the context of Twitter and other social media.
19
How the Data Sausage Gets Made9/4/2013http://source.mozillaopennews.org/en-US/learning/how-sausage-gets-made/ArticleCurationJacob HarrisThe horror of scraping government data.10
20
Freeing the Plum Book9/4/2013http://source.mozillaopennews.org/en-US/learning/freeing-plum-book/ArticleCurationDerek WillisA brilliant example of a (legal) hack to get public data.9
21
Sane Data Updates are Harder Than You Think9/4/2013http://source.mozillaopennews.org/en-US/learning/sane-data-updates-are-harder-you-think/ArticleCurationAdrian Holovaty3/11/2013A three-part series to just how complicated just doing data updates can be.10
22
Culture Hacking with a Staff Database9/4/2013http://codeascraft.com/2013/05/31/culture-hacking-with-a-staff-database/ArticleCurationIan Malpass Etsy found interesting and practical ways to mash up the (non-HR-private) data of its staff.10
23
For Example9/4/2013http://bost.ocks.org/mike/example/ArticleVisualizationMike BostockAn awe-inspiring visual essay from the creator of D3, its purpose "to share my love of examples with you."10
24
The different sketch styles of the designers at 37signals9/4/2013http://37signals.com/svn/posts/1880-the-different-sketch-styles-of-the-designers-at-37signalsArticleVisualizationJason Fried 8/27/2009Drawn sketches can hint at what's visually compelling to the designer.
25
Why we skip Photoshop9/4/2013http://37signals.com/svn/posts/1061-why-we-skip-photoshopArticleVisualizationJason Fried This is not really about data, but about the details of the creation process, and the importance of streamlining your workflow.
26
Common MySQL Queries9/4/2013http://www.artfulsoftware.com/infotree/queries.phpGuideCurationPeter Brawley and Arthur FullerIf you're new to SQL, or even if you're skilled at it, this is one of the best collections of virtually every half-way complicated query you can dream of.10
27
Machine Learning in R: Clustering9/4/2013http://horicky.blogspot.com/2012/04/machine-learning-in-r-clustering.htmlGuideAnalysisRicky Ho
28
Getting your heart rate using R and Ruby9/4/2013http://blog.airbrake.io/guest-post/exploring-everything/GuideCurationSau Sheong ChangA clever example of how to use code and available technology to track variables in the "meatspace"
29
Story hunting in birth, death data9/4/2013http://www.anthonydebarros.com/2010/09/07/data-analysis-births-deaths/ArticleReportingAnthony DeBarros
30
Word clouds considered harmful9/4/2013http://www.niemanlab.org/2011/10/word-clouds-considered-harmful/ArticleVisualizationJacob HarrisAnother popular web visualization that is more junk than data.10
31
A Gentle Introduction to SQL using SQLite9/4/2013https://github.com/tthibo/SQL-Tutorial/blob/master/tutorial_files/part1.textileGuideCurationTroy Thibodeaux
32
I had no idea how to make custom maps, so I learnt by doing. You should too.9/4/2013http://blogging.alastair.is/i-had-no-idea-how-to-make-custom-maps-so-i-learnt-by-doing-you-should-too/GuideVisualizationAlastair CooteWhen you want something done beautifully, sometimes you have to do it yourself.10
33
Console productivity hack: Discover the frequent; then make it the easy9/4/2013http://matt.might.net/articles/console-hacks-exploiting-frequency/GuideCurationMatt Might
34
Finding Stories in the Structure of Data9/4/2013http://source.mozillaopennews.org/en-US/learning/finding-stories-structure-data/ArticleReportingMatt WaiteHow the structure of data can create new kinds of storytelling.
35
Data Mining: Finding Similar Items and Users9/4/2013https://www.bionicspirit.com/blog/2012/01/16/cosine-similarity-euclidean-distance.htmlGuideAnalysisAlexandru Nedelcu
36
Data Mining the Web: $100 Worth of Priceless9/4/2013http://blog.luckyoyster.com/post/33592990831/data-mining-the-web-100-worth-of-pricelessArticleCurationA high-level description of the engineering used to mine and index 3.4 billion web pages in about 14 hours.
37
A Programmer's Guide to Data Mining9/4/2013http://guidetodatamining.com/BookCurationRon ZacharskiA guide to practical data mining, collective intelligence, and building recommendation systems
38
Think Stats9/4/2013http://www.greenteapress.com/thinkstats/BookAnalysisAllen B. DowneyAn introduction to Probability and Statistics for Python programmers.
39
Think Bayes9/4/2013http://www.greenteapress.com/thinkbayes/thinkbayes.pdfBookAnalysisAllen B. DowneyAn exploration of Bayesian statistics, using the Python programming language
40
Predictive Analytics: Data Preparation9/4/2013http://horicky.blogspot.com.au/2012/05/predictive-analytics-data-preparation.htmlGuideCurationRicky Ho
41
Big Data Tutorials9/4/2013http://nealcaren.web.unc.edu/big-data/GuideCurationNeal Caren
42
Bastards Book of Regular Expressions9/4/2013http://regex.bastardsbook.com/BookCurationDaniel NguyenIf there's one data-related skill I wish I had learned years ago, it would be regular expressions.10
43
Data Analysis for Politics and Policy9/4/2013http://www.edwardtufte.com/tufte/ebooksBookAnalysisEdward Tufte
44
Up and Down the Ladder of Abstraction9/4/2013http://worrydream.com/LadderOfAbstraction/ArticleVisualizationBret Victor9
45
Scientific Communication as Sequential Art9/4/2013http://worrydream.com/#!/ScientificCommunicationAsSequentialArtArticleVisualizationBret Victor
46
How to Design an Infographic: How Many Households9/4/2013http://worrydream.com/#!/HowManyHouseholdsArticleVisualizationBret VictorA critique of a well-liked New York Times interactive.8
47
Attempts to Map Music9/4/2013http://dabrownstein.wordpress.com/2013/08/03/attempts-at-mapping-music/ArticleVisualizationDaniel Brownstein 8/3/2013
48
Mapping NYC stop and frisks: some cartographic observations9/4/2013http://spatialityblog.com/2012/07/27/nyc-stop-frisk-cartographic-observations/ArticleVisualizationSteven Romalewski7/27/2012A critique of map design and thresholds, in the contest of the NYPD's stop-and-frisk data9
49
How 2 Blade staffers overcame obstacles to cover Toledo’s gangs9/4/2013http://www.toledoblade.com/Police-Fire/2013/04/28/How-2-Blade-staffers-overcame-obstacles-to-cover-Toledo-s-gangs.htmlArticleCurationTaylor Dungjen10
50
Design Tip: Never Use Black9/4/2013http://ianstormtaylor.com/design-tip-never-use-black/ArticleVisualizationIan Storm Taylor8
51
The Greatest Paper Map of the United States You’ll Ever See9/4/2013http://www.slate.com/articles/arts/culturebox/2012/01/the_best_american_wall_map_david_imus_the_essential_geography_of_the_united_states_of_america_.htmlArticleVisualizationSeth StevensonA case for where hand-editing has advantages over algorithmic rendering.
52
Visualizing Slavery9/4/2013http://opinionator.blogs.nytimes.com/2010/12/09/visualizing-slavery/ArticleVisualizationSusan Schulten
53
People Power Prevails!9/4/2013http://source.mozillaopennews.org/en-US/learning/people-power-prevails/ArticleCurationJohn Keefe8
54
Cicada Tracker9/4/2013http://project.wnyc.org/cicadas/GuideCurationWNYC and RadiolabEasily one of the most esoteric and brilliant crowdsourcing data projects yet conceived.10
55
GIS Analyses of Dr. Snow's Map 9/4/2013http://www.udel.edu/johnmack/frec480/cholera/cholera2.htmlGuideVisualizationJohn Mackenzie
56
Mapping Chicago’s shooting victims9/4/2013http://blog.apps.chicagotribune.com/2013/07/15/mapping-chicagos-shooting-victims/GuideVisualizationAndy Boyle
57
Proto-analysis of Boston Globe Traffic on Facebook9/4/2013http://sonya2song.blogspot.com/2013/07/proto-analysis-of-boston-globe-traffic.htmlArticleAnalysisSonya Song
58
Under the Hood of the Open Gender Tracker9/4/2013http://source.mozillaopennews.org/en-US/articles/under-hood-open-gender-tracker/ArticleCurationIrene Ros and Nathan Matias
59
Creating a Metric for News Apps9/4/2013http://brianabelson.com/open-news/2013/03/18/A-Metric-For-News-Apps.htmlArticleAnalysisBrian AbelsonA great explainer of how to implement real metrics to the field of news applications.10
60
Matching Methods for Causal Inference9/4/2013http://biostat.jhsph.edu/~estuart/Stuart10.StatSci.pdfGuideAnalysisElizabeth A. Stuart
61
Style and Substance: Analyzing a Beach Ball Chart9/4/2013http://blog.apps.chicagotribune.com/2012/02/07/style-and-substance-analyzing-a-beach-ball-chart/ArticleVisualizationJoe GermuskaThe beach ball chart is about as dumb as it sounds.10
62
Better web cartography with dot density maps and new tools9/4/2013http://blog.apps.chicagotribune.com/2011/08/12/better-web-cartography-with-dot-density-maps-and-new-tools/GuideVisualizationChristopher Groskopf
63
How Not to Lie with Statistics: Avoiding Common Mistakes in Quantitative Political Science9/4/2013http://dash.harvard.edu/bitstream/handle/1/4455012/not%20lie%20stat.pdf?sequence=1GuideAnalysisGary King
64
How we identified America's 50 worst charities9/4/2013http://www.tampabay.com/news/business/how-we-identified-americas-50-worst-charities/2124085ArticleReportingKendall Taggart and Kris Hundley6/6/2013How exactly do you define "worst"? Here's a high-level description of the factors reporters used to rank the performance of charity, and the amount of data they had to sift and analyze to do so.8
65
Example story memos and assessments for class on Real-Time Data; Using APIs9/4/2013http://dwillis.github.io/data-reporting/apr25.htmlGuideReportingDerek Willis
66
Analyzing the Analyzers: An Introspective Survey of Data Scientists and Their Work9/4/2013http://oreilly.com/data/stratareports/analyzing-the-analyzers.csp?intcmp=il-strata-ebooks-analyzing-the-analyzers-strata-right-railBookAnalysisHarlan Harris, Sean Murphy, Marck Vaisman
67
Drawing Conclusions from Data9/4/2013http://source.mozillaopennews.org/en-US/learning/statistically-sound-data-journalism/ArticleAnalysisJonathan Stray
68
Making the WNYC Census Map9/4/2013http://johnkeefe.net/47474697GuideVisualizationJohn Keefe
69
Months after West blast, a report highlights what we don’t know about chemical safety
9/4/2013http://www.cjr.org/united_states_project/laurel_dallas_morning_news_investigation_chemical_safety_data.php?page=allArticleReportingRichard Parker
70
Principles of Information Display for Visualization Practitioners
9/4/2013http://www2.cs.uregina.ca/~rbm/cs100/notes/spreadsheets/tufte_paper.htmlArticleVisualizationAl Globus
71
Small Multiples with Details on Demand9/4/2013http://vallandingham.me/small_multiples_with_details.htmlGuideVisualizationJim Vallandingham
72
The Rhetoric of Data9/4/2013http://towcenter.org/blog/the-rhetoric-of-data/ArticleReportingNicholas Diakopoulos
73
Mapping Millions of Dots9/4/2013http://www.mapbox.com/blog/mapping-millions-of-dots/ArticleVisualizationEric FischerMillions and millions of beautiful dots, without crashing your computer
74
Visualizing Data Uncertainty: An Experiment with D3.js9/4/2013http://blog.velir.com/index.php/2013/07/11/visualizing-data-uncertainty-an-experiment-with-d3-js/ArticleVisualizationAlex Krusz
75
Hexbins!9/4/2013http://indiemaps.com/blog/2011/10/hexbins/ArticleVisualizationZachary Forest Johnson10/18/20119
76
Election 2012: Electoral combinations9/4/2013http://blog.apps.npr.org/2012/11/13/election-2012-generating-the-combinations.htmlGuideVisualizationChristopher Groskopf
77
The NYT's Visual Election Outcome Explorer9/4/2013http://source.mozillaopennews.org/en-US/articles/nyts-512-paths-white-house/ArticleVisualizationMike BostockAn behind-the-scenes explanation of how one of the best web visualizations ever was built.10
78
Journalism in the Age of Data9/4/2013http://datajournalism.stanford.edu/ArticleReportingGeoff McGhee9/28/2010A video report on data visualization as a storytelling medium
79
Counting the Jay-Z subway crowd
9/12/2013http://johnkeefe.net/counting-the-jay-z-subway-crowdArticleReportingJohn Keefe2012-09Using the MTA turnstile data to see what kind of crowds Jay-Z drew at Jay-Z's inaugural concert with Barclays Center.7
80
Mapping Dollars in a District9/12/2013http://johnkeefe.net/dollars-in-a-districtGuideReportingJohn Keefe2011-09WNYC's political team wanted to know: how much money was being raised by candidates for a state legislative district from within the district itself? John Keefe walks through his use of Fusion Tables and additional database techniques for the analysis and visualization.
9
81
Free, Live Election data: Now's your chance to play9/12/2013http://johnkeefe.net/96855599GuideVisualizationJohn Keefe2012How to use Google-provided election data to build a Fusion Table-powered map.9
82
Making the NYC Evacuation Map
9/12/2013http://johnkeefe.net/making-the-nyc-evacuation-mapArticleReportingJohn KeefeOne day John Keefe was just browsing the official NYC data site and tucked away some code that involved the city's flood zones. When Hurricane Irene stuck, NYC.gov was flooded (with web traffic) and WNYC became an invaluable resource.10
83
Mapping Campaign Contributions on the Fly
9/12/2013http://johnkeefe.net/campaign-finance-mapping-on-the-flyGuideVisualizationJohn KeefeWith Fusion Tables and a little Ruby magic, John Keefe describes how he mapped contributions to presumptive NYC mayoral candidates by zip code by lunchtime.8
84
We Have Met the Enemy and He Is PowerPoint9/12/2013http://www.nytimes.com/2010/04/27/world/27powerpoint.htmlArticleVisualizationElisabeth Bumiller“When we understand that slide, we’ll have won the war.”10
85
The Data Driven Congressional Reporter9/12/2013http://thescoop.org/archives/2012/12/26/the-data-driven-congressional-reporter/ArticleReportingDerek WillisWhy aren't more Congressional reporters taking advantage of the copious generated by our legislators?8
86
On Legislative Data Transparency9/12/2013http://thescoop.org/archives/2012/02/04/on-legislative-data-transparency/ArticleCurationDerek Willis2/4/2012This says everything you need to know about the complexity and annoyance of real-life data: "For example, in the Senate it is possible for the Majority Leader and Minority Leader to alter the rules of math when it comes to how many senators constitute a three-fifths majority."10
87
The NYPD's crash data has a booboo. Here's a band-aid.
9/12/2013http://blog.accursedware.com/nypd-crash-data-band-aid/ArticleCurationJohn Krauss2/16/2012Citizen data journalism at its best, from John Krauss: "I already knew that they were finally releasing -- after the Council forced them to -- crash data as idiotically obfuscated PDFs, but reading that they justified this out of concern for "the integrity of the data," was so galling that it goaded me into action. I would make the data accessible as friendly, parseable CSVs."10
88
White House Safety Datapalooza data journalism talk9/14/2013http://blog.chryswu.com/2012/09/14/the-transcript-of-my-white-house-datapalooza-data-journalism-talk/ArticleReportingChrys Wu
89
The NYT's Amanda Cox on Winning the Internet9/14/2013http://source.mozillaopennews.org/en-US/articles/nyts-amanda-cox-wins-internet/ArticleVisualizationErin KissaneA visualization of what makes great visualizations at the New York Times8
90
The One-Query Story9/14/2013https://docs.google.com/presentation/d/1QB2pfbo2sOupG8qCjdwonCw4J-hl8gsdzrwQI8QAcdQ/edit?pli=1#slide=id.gaf5af151_00GuideReportingKate Martin, Grant Smith, Megan LutherA brilliant walkthrough of benign data sets and the code that could potentially launch investigative projects.10
91
Disputes over crime maps highlight challenge of outsourcing public data9/14/2013http://www.poynter.org/latest-news/making-sense-of-news/213443/disputes-over-crime-maps-highlight-challenge-of-outsourcing-public-data/ArticleReportingAdam Hochberg
92
Oscar Predictions, Election Style9/14/2013http://fivethirtyeight.blogs.nytimes.com/2013/02/22/oscar-predictions-election-style/ArticleAnalysisNate SilverThe master of presidential predictions explains why he's not so great at predicting Oscar winners.10
93
The Gettysburg Powerpoint Presentation9/14/2013http://www.norvig.com/Gettysburg/index.htmArticleVisualizationPeter NorvigHistory's most memorable 3D-column chart.
94
Nightingale's Rose9/14/2013http://dd.dynamicdiagrams.com/2008/01/nightingales-rose/ArticleVisualizationHenry WoodburyA modern critique of Florence Nightingale's, "Diagram of the Causes of Mortality"
95
chartsnthings9/14/2013http://chartsnthings.tumblr.com/BlogVisualizationKevin QuealyA (personal) blog of data sketches from the New York Times Graphics Department. Maintained by @KevinQ.
96
How the Sun Sentinel reported its Pulitzer Prize winning coverage of off-duty cops9/14/2013http://www.ire.org/blog/ire-news/2013/04/15/how-sun-sentinel-reported-its-pulitzer-prize-winni/ArticleReportingSally Kestin and John MainesThis 2013 Pulitzer Public Service winning story epitomizes the best of watchdog journalism, as its expose of a deadly public injustice sparked swift reform. But it deserves an award for its ingenious do-it-yourself way of collecting the kind of data that, in theory, was non-existent.10
97
How Gannett Wisconsin Media gathered salary data from cities, counties and state agencies9/15/2013http://ire.org/blog/ire-news/2013/04/17/behind-story-how-gannett-wisconsin-media-gathered-/ArticleReportingSarah Harkins
98
Simulation Study of a Laundry Room9/15/2013http://iaindunning.com/2013/simulation-study-of-a-laundry-room.htmlGuideAnalysisIain DunningSimulations don't yet have much place in journalism, but I like this analysis of laundry room behavior because it acknowledges the limitation of analysis and the impact of assumptions, as well as intuitively walking through the design of the simulation (with Julia code snippets). Also, it tackles a question near to all of us.
99
Evil Analytics: How What You Measure Becomes Your Master9/15/2013http://www.forbes.com/sites/ryanholiday/2012/05/29/evil-analytics-how-what-you-measure-becomes-your-master/ArticleAnalysisRyan Holiday
100
If You Liked This, You’re Sure to Love That9/16/2013http://www.nytimes.com/2008/11/23/magazine/23Netflix-t.html?_r=1&pagewanted=allArticleAnalysisClive Thompson"Merely knowing who people are, paradoxically, isn’t very predictive of their movie tastes."10
Loading...
Main menu