Day 2
Data Science Summer Institute 2024
What is it? Why is it important?
Data Visualization
London, early 1850s
Cholera was rampant throughout London and no one knew why.
Illustration from Punch (1852).
Miasmas and Miasmatists
Bad smells given off by waste and rotting matter were believed to be the main source of disease.
Suggested remedies:
Staunch believers:
John Snow, 1813-1858
Data Visualization
John Snow, 1813-1858
Other famous data visualizations
Other famous data visualizations
Charles Minard: Napoleon's Russian campaign (1869) (Video)
Other famous data visualizations
Florence Nightingale: "Diagram of the causes of mortality…" (1858)
Other famous data visualizations
W.E.B Dubois: "The Exhibit of American Negroes" (1900)
What makes for an effective data visualization?
Text Analysis
Classroom Activity
I. YOU don't know about me without you have read a book by the name of The Adventures of Tom Sawyer; but that ain't no matter. That book was made by Mr. Mark Twain, and he told the truth, mainly. There was things which he stretched, but mainly he told the truth. That is nothing. I never seen anybody but lied one time or another, without it was Aunt Polly, or the widow, or maybe Mary. Aunt Polly--Tom's Aunt Polly, she is--and Mary, and the Widow Douglas is all told about in that book, which is mostly a true book, with some stretchers, as I said before. Now the way that the book winds up is this: Tom and me found the money that the robbers hid in the cave, and it made us rich. We got six thousand dollars apiece--all gold. It was an awful sight of money when it was piled up. Well, Judge Thatcher he took it and put it out at interest, and it fetched us a dollar a day apiece all the year round--more than a body could tell what to do with. The Widow Douglas she took me for her son, and allowed she would sivilize me; but it was rough living in the house all the time, considering how dismal regular and decent the widow was in all her ways; and so when I couldn't stand it no longer I lit out...
I. YOU don't know about me without you have read a book ...
II. WE went tiptoeing along a path amongst the trees bac ...
III. WELL, I got a good going-over in the morning from o ...
IV. WELL, three or four months run along, and it was wel ...
V. I had shut the door to. Then I turned around and ther ...
VI. WELL, pretty soon the old man was up and around agai ...
VII. "GIT up! What you 'bout?" I opened my eyes and look ...
VIII. THE sun was up so high when I waked that I judged ...
IX. I wanted to go and look at a place right about the m ...
X. AFTER breakfast I wanted to talk about the dead man a ...
XI. "COME in," says the woman, and I did. She says: "Tak ...
XII. IT must a been close on to one o'clock when we got ...
XIII. WELL, I catched my breath and most fainted. Shut u ...
XIV. BY and by, when we got up, we turned over the truck ...
XV. WE judged that three nights more would fetch us to C ...
XVI. WE slept most all day, and started out at night, a ...
XVII. IN about a minute somebody spoke out of a window w ...
XVIII. COL. Grangerford was a gentleman, you see. He was ...
XIX. TWO or three days and nights went by; I reckon I mi ...
XX. THEY asked us considerable many questions; wanted to ...
XXI. IT was after sun-up now, but we went right on and d ...
XXII. THEY swarmed up towards Sherburn's house, a-whoopi ...
XXIII. WELL, all day him and the king was hard at it, ri ...
XXIV. NEXT day, towards night, we laid up under a little ...
XXV. THE news was all over town in two minutes, and you ...
XXVI. WELL, when they was all gone the king he asks Mary ...
So many!
Tools for Data Visualization
CoDAP (https://codap.concord.org)
CODAP is free open source software for data analysis built for use in schools. With CODAP, you can explore, visualize, and learn from data in any content area. Their mission is to make data literacy accessible for all students.
Dataset: Chronic Kidney Disease (159 observations)
Jupyter Notebooks (https://jupyter.org/)
Dataset: World Data Statistics (41,415 observations)
Google Sheets (https://sheets.google.com)
Dataset: World Data Statistics (41,415 observations)
Places to find data