1 of 86

A WORLD OF POSSIBILITIES

Lassoing satellite data

for global and local

investigations

Access this slideshow at

tinyurl.com/nicar-satellite-data

2 of 86

Introduction

Carl Churchill, Wall Street Journal

Deborah Nelson, University of Maryland

Laura Kurtzberg, Florida International University

3 of 86

RASTER DATA

  • Values are stored in discrete pixels of an image, usually geotiff
  • Pixels contain color values (red, green, blue) or a research value
  • Raster data often comes from analyzing satellite data

Akhund, Sadig. (2022). Analysis of Spatial Big Data for Geographical Information Systems. 10.13140/RG.2.2.20522.70080/2.

4 of 86

EXAMPLES

5 of 86

SATELLITE

DATA

PROJECT

EXAMPLE

The Financial Times | Steve Bernard

September 2020

6 of 86

SATELLITE

DATA

PROJECT

EXAMPLE

ABC News Australia

How heat and drought turned Australia into a tinderbox

By Colin Gourlay, Tim Leslie, Matt Martino and Ben Spraggon

Story Lab

8 Feb 2020

7 of 86

RASTER

DATA

PROJECT

EXAMPLE

ProPublica

The Most Detailed Map of Cancer-Causing Industrial Air Pollution in the U.S.

by Al Shaw and Lylla Younes, November 2, 2021, Updated August 28, 2023

Additional reporting by Ava Kofman

Not All Raster Data is Satellite Data

8 of 86

Aquazônia

9 of 86

MAPPING IMPACTS ON WATER

Darker, deep blue areas represent low combined impact

Light yellow areas represent extreme impact on freshwater

10 of 86

MAPPING IMPACTS ON WATER

I calculated 9 different impacts per river microbasin using GEE

Cecília Gontijo Leal

Scientific Consultant,

Biologist &

Researcher

IIAA= 3*URB + 3*GAR + 3*HDL + 2*AGP + 2*MIN + 2*PRE + EST + HID + DEG

11 of 86

MAPPING IMPACTS ON WATER

12 of 86

📋 Data Sources

AWII:

Amazon Water Impact Index

  • Mapbiomas: land use maps of the entire country, and multi-country Amazon, using visual interpretation of Landsat satellite imagery
  • Study of forest degradation by Matricardi et al.
  • Synergize/CNPq research group calculation of precipitation change based on CHIRPS data
  • Amazon Network of Georeferenced Socio-Environmental Information (RAISG)
  • Agencia Nacional das Aguas

13 of 86

Constructing the site

14 of 86

THANK YOU!

Check out the tools, sources, and tutorials at the end of the slideshow →

15 of 86

Carl Churchill

The Wall Street Journal

Graphics Reporter

carl.churchill@wsj.com

Preparing and Downloading Satellite Image Stacks with GEE*

A world of possibilities: Lassoing satellite data for global and local investigations

NICAR 2024

*and some other stuff

16 of 86

Landsat 7 and 8 produce 1 terabyte of data every day.

17 of 86

Give me the stupid data

I want this on my computer

Where is it

This story is pubbing in 4 hours i have a liberal arts degree what am i doing

18 of 86

[And others]

19 of 86

Y

X

Z (time)

Bands

20 of 86

Sentinel-2 (actually twin satellites)

  • 5-6 days revisit time
  • 10 - 60m resolution
  • 13 bands

21 of 86

22 of 86

23 of 86

24 of 86

C a r r i z o P l a i n

25 of 86

Satellite image

26 of 86

1,100 official datasets

1,352 community-uploaded datasets

27 of 86

28 of 86

  1. Area
  2. Quality
  3. Bounds

Filters

Dataset

(already in GEE)

29 of 86

30 of 86

31 of 86

32 of 86

33 of 86

34 of 86

35 of 86

36 of 86

37 of 86

38 of 86

Vegetative health (EVI) 16-d comp

(day of year)

Vegetative health (EVI) 16-d comp

(forward from Jan 1st 2020)

EVI (Sentinel 2) = 2.5 * ((B8 – B4) / (B8 + 6 * B4 – 7.5 * B2 + 1)).

39 of 86

Inundated farmland after flooding

40 of 86

GPM

  • Satellite constellation
  • Data released every 3 hours
  • Precip estimate for every 30 minutes
  • 5 dedicated precipitation bands

Global Precipitation Measurement

41 of 86

42 of 86

43 of 86

44 of 86

45 of 86

46 of 86

47 of 86

48 of 86

49 of 86

Earth Engine Alternatives

Not everyone has time to learn coding

50 of 86

The EO Browser

51 of 86

Climate Engine

52 of 86

Giovanni

53 of 86

NASA EO

54 of 86

Tutorials!

55 of 86

Carl Churchill

The Wall Street Journal

Graphics Reporter

carl.churchill@wsj.com

The code for the hurricane GIF earlier:

https://code.earthengine.google.com/b8ff6cc38ab556fc3598f9483feb721d

(its a old photo)

Questions?

56 of 86

Deborah Nelson,

Professor of Investigative Journalism

Philip Merrill College of Journalism, University of Maryland

dnelson4@umd.edu

“Ground-truthing” satellite data

57 of 86

Satellite data can help document the what, where and how.

But you need to come back to earth

for the causes

and consequences

58 of 86

Ground-truth the data

to report on local conditions and dynamics

that can’t be seen from the satellites

or measured in the data.

FEET ON THE GROUND

Your own, or through collaboration with a local journalist or connection with trusted sources.

VIRTUAL FEET ON THE GROUND

The usual suspects: Clips, satellite imagery, street view mapping, social media, etc.

Key resource: Google Scholar. Type the location of interest along with your keywords into Google Scholar. You may be surprised at how often you get geographically relevant results.

- Mine the top for authors with scientific and geographic expertise.

- Mine the middle for geographic coordinates, location descriptions, methodology that may inform your own modeling and clues about local political/social/environmental dynamics.

- Mine the bottom -- footnotes and supplements -- for more sources, details & data

59 of 86

Most human viruses initially came from animals

-- an event known as zoonotic spillover.

A growing body of scientific evidence has linked spillover

to environmental factors….

Case study…

60 of 86

Some are natural -- like precipitation and temperature.

61 of 86

Others are tied to human activity -- tree loss, urbanization, intensive farming --

that increase our contact with wildlife and disturb habitats in ways thought

to make some animals more susceptible to developing and shedding viruses.

62 of 86

While doing some deep reading on the subject, I found a reference to a database of zoonotic outbreaks over the past 20 years.

in a footnote of a report. It is maintained by Gingko, a public benefit company, and includes location information.

63 of 86

He thought we might be able to predict where and why future outbreaks could occur by:

  1. mapping the locations of where past outbreaks began
  2. analyzing the ecological conditions surrounding those outbreaks
  3. and then identifying where else on earth similar conditions exist.

I mentioned all this to my longtime reporting partner at Reuters:

Ryan McNeill, deputy data editor in London

64 of 86

We decided to focus our analysis on viruses linked to bats because there were so many spillovers -- at least 95 since 2002.

And they represented some of the deadliest new diseases to emerge in the last half century: SARS, Ebola, Marburg and Nipah.

Although we don’t know how people were first infected by SARS-CoV-2, we do know it’s related to a family of bat viruses responsible for the 2003 SARS pandemic out of China.

65 of 86

DONT BLAME THE BATS! THE EARTH NEEDS THEM!

When a bat virus jumps to humans, it’s a warning that the ecosystem is way out of whack.

66 of 86

The deets:

Ryan recruited Reuters data journalist Allison Martell to help geolocate

95 documented spillovers 2002-2020 from that database. Researchers at the company provided us with a list of outbreaks of diseases in which bats are the reservoirs. Ryan and Allison mined studies, public health records and other documents to vet the list, expand on it and identify the locations of first known victims.

They then developed a machine learning model -- in consultation with scientists -- that analyzed 8 billion data points related to conditions around those spillovers, most derived from satellite data imagery drawn from Google Earth Engine.

The model included 56 covariates at a 25-square-kilometer resolution, including environmental, social and economic conditions such as land surface temperatures, precipitation, the estimated number of bat species present in an area and measures of deforestation and urban development, that scientific research has linked to spillover risk. (Bat species richness was calculated from IUCN Red List data.)

67 of 86

Based on the machine-learning analysis of past spillovers, we identified other places where similar conditions exist. We took those that scored in the 95th percentile and called them JUMP ZONES.

Jump zones = the highest risk areas areas on earth for spillover and potential starting points for future outbreaks, epidemics and pandemics.

68 of 86

69 of 86

WHAT WE FOUND:

9 million sq km of high-risk zones in 113 countries.

A 57% increase in people living in areas at highest risk for spillover over 2 decades -- Nearly 1.8 billion people -- 1 of 5 on the planet -- now live in these areas.

99% of highest-risk areas globally are in lower- and middle-income countries -- mostly tropical locales undergoing rapid urbanization, much of it driven by wealthy countries’ demand for raw materials.

70 of 86

Since 2020, the last year of the analysis:

  • 7 Ebola and Marburg outbreaks reported in Africa….
  • More than 20 Nipah outbreaks reported in Bangladesh and India

…all in geographic areas made up almost entirely of hotspots identified in our analysis -- including in countries with no prior known occurrences of the virus.

…all pathogens with potential to develop into an epidemic.

71 of 86

“GROUND-TRUTHING”* THE DATA

72 of 86

We used the analysis to zero-in on specific locations to better understand the changes that turned them into hotspots…. and to help to decide where to put reporters’ feet on the ground…

73 of 86

…or drones in the air…

74 of 86

…in order to report on local dynamics that couldn’t be seen from the satellites or measured in the data.

We collaborated with a dozen journalists to conduct field reporting on six continents.

75 of 86

The view from the ground in Ghana, where Marburg virus made its first known appearance in 2022.

Bat-eaten fruit and pulp underfoot in a deforeseted area where the virus infected a farmer and his baby.

76 of 86

IN INDIA

The data identified a dramatic increase in risk in the state of Kerala in the years leading up to Nipah’s first known appearance there in 2018.

77 of 86

The view from the sky on Google Earth Pro, showed rapid urbanization of once heavily wooded areas.

78 of 86

On the ground, Sreekanth and I found backyards full of fruit trees and a type of fruit bat known to carry Nipah roosting nearby in small patches of woods. They’d raid the gardens at night. Residents would pick up the discarded fruit to toss away or eat. Scientists told us the virus from an infected bat’s saliva could survive hours on a piece of fruit.

79 of 86

Throughout West Africa we found governments are granting mining concessions in ecologically risky areas without requiring companies to assess and address the dangers.

On the ground, Reuters reporters visited the site where scientists had found evidence of Ebola in bats near where a 2013 epidemic that killed 11,000 people began. Ryan used their phone GPS signal to locate the site on a major iron ore concession.

80 of 86

Biggest reporting challenge: Laos.

81 of 86

Jump zones more than doubled to 73%

-- a bigger % increase than any other country

That was accompanied by a 19% increase in tree loss.

To figure out what was driving those changes and to understand the risks they were creating, we needed to know what was happening on the ground.

The challenge: Laos is not a historically open country.

82 of 86

We mined every source we could find….

… to document how China’s demand for rubber and food crops were driving deforestation and spillover risk.

83 of 86

…allowing us to geolocate them on our risk map

A 2017 study found coronaviruses

in bats for sale at markets in Laos

and included a dataset with their locations

…and show what risk looks like on the ground…

84 of 86

85 of 86

MORE

EXAMPLES

& TUTORIALS

86 of 86

TOOLS & DATA SOURCES

🚢

🌏

EOBrowser

Tutorial

Sentinel Hub

QGIS

Tutorial