1 of 18

A Digital Methods Summer School 2016 Workshop

Anne Helmond & Fernando van der Vlist

Carolin Gerlitz & Esther Weltevrede

Tracking the Trackers

2 of 18

Agenda

  1. Introduction: Trackers
  2. Tracker tracker tool
  3. Example project: Like Economy
  4. Methodological summary
  5. Tracking exercise

3 of 18

1. Introduction: Trackers

4 of 18

“For every explicit action of a user, there are probably 100+ implicit data points from usage; whether that is a page visit, a scroll etc.” (Berry 2011: 152)

Tracking

5 of 18

  • Every time a web user requests a website, a series of tracking features are enabled: cookies, widgets, advertising trackers, analytics, beacons etc.
  • First party (from website) vs. third party tracker (e.g. Facebook, Twitter, Google).

Tracking technologies

6 of 18

  • Ghostery: Browser plugin which detects and allows to block the ‘invisible’ web and prevents a ‘digital footprint’.
  • Detection via tracker library/code snippets [reg ex].
  • Detecting around 2188 trackers.
  • Not uncontroversial: started as NGO, then bought by analytics company Evidon in 2010.

Tracker blocking

7 of 18

2. Tracker tracker tool

8 of 18

  • Tracker Tracker: tool built on top of Ghostery by the Digital Methods Initiative (2012).
  • Allows to detect which trackers are present on lists of websites & create a network view.
  • “Repurposing analytical capacities” of privacy app: digital research methods paired with platform & software studies.

DMI Tracker Tracker

9 of 18

Gerlitz, Carolin, and Anne Helmond. 2013. “The Like Economy: Social Buttons and the Data-Intensive Web.” New Media & Society 15 (8): 1348–65. doi:10.1177/1461444812472322.

3. Example project: Like Economy

10 of 18

  • Starting point: social media widgets place cookies (Gerlitz & Helmond 2013).
  • These cookies track both platform users and anyone else on the web.
  • All web users potentially feed data into platforms through cookies.
  • RQ: How pervasive are platform cookies on the most visited websites of the web?

Like Economy

11 of 18

  1. Create a collection of 1000 most-visited websites based on Alexa.com data.
  2. Input into the Tracker Tracker tool.
  3. Visualise results with Gephi.
  4. Colour-code based on platform.

Like Economy: Method

12 of 18

Facebook trackers

13 of 18

4. Methodological summary

14 of 18

Methodological summary

  1. Research question: type of tracker & sites
  2. Website (URL) collection making: existing expert list
  3. Tracker Tracker tool
  4. Visualisation
  5. Analyse results + add layers

15 of 18

5. Tracking exercise

16 of 18

Tracking exercise

  1. Team up in pairs. Study health, kids, adult sites?
  2. Get access to the collections made with Alexa.com: http://tiny.cc/TrackURLs.
  3. Enter the list into the Tracker Tracker tool.�Settings: Only look at specified pages.
  4. Save > Output > GEFX (Gephi).
    1. Alternative: Save > Output > CSV
  5. Open in Gephi, use colour settings to visually distinguish between different tracking services/types.

17 of 18

Tracking exercise

Gephi instructions:

  • New Project > Open Graph File > OK
  • Layout > Choose a Layout > Force Atlas 2
    • Lin Log mode: yes
    • Prevent Overlap: yes
    • Scaling: 15
  • Ranking > Nodes > Degree > Size/Weight (Red Diamond), Min size: 3 Max size: 30 (you can play with these settings).
  • Partition > refresh > type > Apply
  • Preview > Presets > Default Straight > Refresh
  • Export > SVG/PDF/PNG
  • Put your output here

18 of 18

End! Thank you.