1 of 41

DATA

Katie Akateh

June 6th, 2025

LITERACY

2 of 41

Goals

  • Understand what it means to be “data literate” and why it’s important.
  • Define Data and describe types of data.
  • Know the steps of working with data in research.
  • Learn best practices related to data visualization.

3 of 41

Take the Quiz -

How Data Literate are you?

4 of 41

Data literacy & why it’s important

DATA LITERACY = THE ABILITY TO READ, EXPLORE, UNDERSTAND, AND COMMUNICATE DATA TO MAKE DECISIONS AND SOLVE PROBLEMS

"The world’s most valuable resource is no longer oil, but data."

--The Economist, 2017

Data literacy is a continuum

5 of 41

Data literacy -

Technical and Non Technical Skills

Non-Technical Data Skills

Technical Data Skills

  • Analysis
  • Visualization
  • Data Management
  • Mathematics/ Statistics
  • Programming Languages
  • Critical Thinking
  • Curiosity
  • Subject-Area Knowledge
  • Communication
  • Problem-Solving Skills

6 of 41

What is data?

DATA* = FACTUAL INFORMATION THAT IS SYSTEMATICALLY RECORDED AND ANALYZED TO ANSWER A QUESTION

*Definition may vary by discipline

7 of 41

Data comes in many different forms!

Harvard College Alcohol Survey 2001

Darwin’s finches from the Galapagos Islands (beak adaptation to specific types of foods present on different islands inspired Darwin’s theory of evolution by natural selection)

Rosalind Franklin’s x-ray diffraction image of crystalized DNA (evidence of a double helix structure)

8 of 41

Types of data

  • Quantitative data deals with quantities, i.e., information that can be counted, measured, or otherwise expressed using numbers
    • Summarized and analyzed using traditional statistics or related methods
  • Mixed data combines quantitative and qualitative data
    • Analyzed using mixed (quantitative and qualitative) methods
  • Qualitative data deals with qualities or characteristics, i.e., information that is descriptive nature, and cannot be easily expressed in numbers. Such as Country of Origin, gender, name, hair color.
    • Sources of qualitative data: text documents, interview transcripts, images, audio and video recordings, other
    • Requires qualitative summary and analysis (not statistics)

9 of 41

Activity - What Type of Data

10 of 41

Activity

Tell us about your project.

  1. What types of data are you planning to use?

  1. Where do you plan to search for them? (Who may have already collected the data, how, and why?)

11 of 41

Steps of working with data

Research Data Lifecycle. Adapted from UK Data Service Model 2017.| Source Queensland University of Technology, Advanced Information Research Skills

12 of 41

Plan: Formulate a question or hypothesis

  • What do we (people) already know about the topic? (literature review)
  • What do you want to know?
  • What do you expect to find (e.g., pattern of results, differences between groups or conditions, cause and effect), and why?

13 of 41

Collect & Capture Data

Two primary approaches:

  1. Collect new data
  2. Find an existing dataset 🡺 OPEN DATA

14 of 41

Strategies to find data

15 of 41

3 strategies to find open data

There are many open-access, publicly available datasets online that you can use!

  1. Find an established data repository, and search for a dataset by topic or other attributes.
  2. Find a published research article and locate the original dataset used.
  3. Think about who has an interest in collecting this information.

16 of 41

Activity

Tell us about your project.

  • What types of data are you planning to use?

  • Where do you plan to search for them? (Who may have already collected the data, how, and why?)

17 of 41

3. Process your Data

Even if you didn’t collect the data, understanding the data is critical to interpreting the results!

  • How was the data collected? Who collected it? When and where? With what measures or instruments? Using what study design? Primary research question of the study? Source of funding?
  • Which variables will you look at to answer your question or test your hypothesis? (The Codebook is your friend.)

18 of 41

Data Wrangling (Cleaning) Tips

Tips: Document all the changes you make to your data files, no matter how small, so you (or someone else) can repeat/ replicate your processing steps, your analyses, and ultimately your results

  • Save your working data file with a new name; keep the original secure
  • Consider the tool you will use for data analyses or visualizations – and structure your data for that tool
  • Check for missing data, and decide how to deal with it
  • Be careful and consistent at each step to avoid errors

19 of 41

4. Analyze

This involves applying statistical or mathematical techniques to the data to discover patterns, relationships, or trends.

The goal is to find the right analysis or the right visualization

to answer your question or test your hypothesis.

Things to consider:

  • Type of data (e.g., quantitative, qualitative, etc.)
  • Limitations of the data
  • Tool/s you intend to use (e.g., statistical software)
  • Be as simple as you can be – but no simpler

20 of 41

Break

21 of 41

5. Visualize

After the data is analyzed, the next step is to interpret the results and visualize them in a way that is easy to understand.

Data visualization helps to make complex data more understandable and provides a clear picture of the findings.

22 of 41

Activity

Share your visualization

  1. What is your visualization describing?

  1. What do you notice?
  2. What do you wonder?

23 of 41

Visual Design Principles

The visual design of your charts is about emphasis, consistency, and clarity!

  • Selecting Chart type (and some to avoid)
  • Color
  • Text
  • Accessibility

24 of 41

"Data visualization is part art and part science. The challenge is to get the art right without getting the science wrong and vice versa." -- Claus Wilke

25 of 41

Know Your Purpose

Are You?

Comparing categories?

Compare variables across categories.

Showing part to whole?

Relate the part of a variable to the total.

Explaining distribution?

Showing values in the dataset and how often they occur.

Describing relationships?

Show correlations among two or more variables.

Displaying change over time?

Emphasize changing trends. Can be short or long time periods.

Visualizing spatial data?

Relates data to geographies. Use when geographic locations are most important to audience.

Selecting a Chart: Consider your Purpose

26 of 41

27 of 41

United States Transgender Survey, 2015 - National Center for Transgender Equality

Some Chart Types are difficult to Interpret

28 of 41

Some Chart Types are difficult to Interpret

29 of 41

Chart Type: Lengths are easier to interpret

30 of 41

Color

  • Should have a purpose for the data
  • Consider your audience

sequential

diverging

qualitative

less

more

hot

cold

neutral

group 1

group 2

group 3

group 4

group 5

31 of 41

US Social Media Usage by Ann Pregler

32 of 41

Text

Text

  • Use brief, descriptive titles
  • Text should be horizontal, not vertical
  • Use callouts or icons for context
  • Use a large font size
  • Proportional, accurate axes

33 of 41

Accessibility: Color Contrast

Accessibility: Color Contrast

Color contrast: text colors should stand out against the background (at least 4.5:1 contrast ratio in a contrast checker)

US Social Media Usage by Ann Pregler

34 of 41

Accessibility: Color Combinations

Accessibility: Color

Color combinations: avoid combinations that will appear too similar to color-blind users:

  • Red and green
  • Blue and green
  • Yellow and red
  • Purple and red
  • Yellow and pink

and/or use more than just color to mark things (pattern, shade, saturation, labels).

35 of 41

The Dude Map by Jack Grieve and Diansheng Guo

36 of 41

Accessibility: Alternative Text (alt text)

Accessibility: Alt Text

All images need descriptive alt text, including visualizations.

  • Provides information about images to people using screen readers.
  • Be concise but descriptive.
  • Include text from the image.
  • Context is important!
  • What does your audience need to know?

37 of 41

Alt Text: example 1

A pie chart titled, "What are 5th Graders Reading?" that shows that 96.5% are reading fiction and 3.5% are reading non-fiction.

38 of 41

A scatter plot

A scatter plot showing that MoMA keeps its collection current.

39 of 41

Activity

  • Cat/Dog Data Visualization
    • Start asking questions of the data.
    • Construct a data visualization that
      • Answers a question.
      • Aligns with best practices.
    • Write alt text for your visual.

Let’s practice these data viz ideas!

40 of 41

6. Share and Communicate Results

  • Go back to your initial research question or hypothesis – and now answer it with data

  • Consider your audience, their needs, interests, and level of knowledge, and how they will use the results

  • The goal is to tell a clear, accurate, logical, and compelling story with your data

41 of 41

Thank you!