The Politics of Technology
Spring 2019: Using Your Data for Fun and Profit
1
Week 3
Outline
Week 1
Week 3
Week 2
2
Week 4
Topic 5: The Data Economy
3
You are the product
4
Business
Model
Tensions
Business Models Evolve...
5
Business trends: Goods → Services → Finance → Data
Mixed Models
6
“Free” = Fuel for Internet Growth
“Free” (ad-based) services bring:
Gratis vs. Libre
7
“Free” Services
8
User
Service
Advertiser
Data Broker
Data
Dollars
Who is the Customer?
And what is the product?
You → Attention → Data
Ad-Tech: The Beginning...
9
October 27, 1994: the website HotWired (now Wired.com) displayed the first online banner ad, paid for by AT&T.
Google Changed the Game
AdWords: Google sells ads based on search keywords� → then displays on search results page (2000)
AdSense: Google buys space on web pages � → then fills with ads based on page content (2003)
10
Impact: Google started collecting and analyzing a lot of data to improve its ad business.
Data Economy: the Dark Side
11
Data Economy Dystopia
12
Evolution of Capitalism (Zuboff)
13
Maximizing attention / engagement == Subliminal advertising redux?
Data Collection Motivations
14
Is personal data collection inherently bad??
Depends on how it is used!!
Who Do You Trust? Faith-based Privacy
15
Trust Graph� Enforced by contract
Data Broker
App Developer
Platform
User
Advertiser
16
Duh!
People say they care about privacy but they continue to buy devices that can spy on them
“Some 63 percent of people find connected devices to be “creepy,” and 75 percent don’t trust the way their data is shared”
“Nearly 70 percent of survey takers said they own one or more connected device, which include smart home appliances, fitness monitors, and gaming consoles.“ By Rani Molla@ranimolla May 13, 2019
Who do you trust? Privacy Policies
17
We don't sell any of your information to anyone, and we never will. We also impose strict restrictions on how our partners can use and disclose the data we provide.
Google does not sell your personal information, which includes your Gmail and Google Account information. We also do not share your personal information with advertisers, unless you have asked us to.
Google will not target ads based on sensitive information, such as race, religion, sexual orientation, health, or sensitive financial categories.
Information about our customers is an important part of our business, and we are not in the business of selling it to others. We share customer information only as described below ...
What about Data Brokers?
18
[Image by Cracked Labs]
About
Data Brokers
(2015)
www.webfx.com
19
Your
Profile
20
In 2017, data giant Acxiom provided up to 3,000 attributes on 700 million people. In 2018, the number was 10,000, on 2.5 billion consumers [Image by Cracked Labs]
Personal Data: Kinds of Tracking
21
Inferred desires and intents → Merchandising or Minority Report?
Acquisition and Access
REVIEW QUESTIONS FROM LAST WEEK:
What personal info is subject to search warrant, subpoena, or court order?
What can be collected / used by 3rd parties?
What can be collected in a public place?
What is a public space ? � (e.g. Facebook; AR, VR worlds?)
Who can know, correct, delete, control use?
First party data is the information collected directly from an audience or customers by a company or website publisher.
Second party data is someone else’s first party data, bought directly from the collector of it.
Third party data is data that <a publisher> buys from outside sources that are not the original collectors of that data.
Source: lotame.com
22
Data Sources
23
24
Identity Resolution Companies
“We maintain the largest people-based identity graph that connects more than 500 platforms, data owners, and publishers to brands and agencies. Our graph matches PII-based data—like emails, postal addresses, and phone numbers—with anonymous identifiers—like cookies and devices ID—in a privacy-conscious way. “
“We offer services that create value for consumers, including lower prices, greater access, and better interactions with brands.”
25
Identity Resolution Examples
26
Data Broker Accuracy
“How a data mining giant got me wrong” --Tom Bergin
“LONDON (Reuters) - I’m 57, with a 30-year-old wife, a fairly new hot water boiler, an old-style television, a petrol car and no kids.
Actually, none of that is true. But that is what you might believe if you purchased access to my data from the world’s largest information broker by market value.”
27
Annoying to you, REALLY annoying to advertisers!
Summary 5: The Data Economy
Takeaways
Questions
28
Topic 6: Tracking & Targeting
30
How They Do It
How Personalization is Done
31
Personalize by…
“Does the change increase conversion rates?”
Other tracking techniques
32
Two Views of Cookies
I have heard that cookies are bad for privacy - is that true?
This is a myth - cookies are a friendly internet tool primarily used by the advertising and e-commerce industry to make surfing easier and quicker. They have several roles, none of which can compromise your privacy:
Tracking Cookies are a specific type of cookie that is distributed, shared, and read across two or more unrelated Web sites for the purpose of gathering information or potentially to present customized data to you. Not all cookies are tracking cookies.
Tracking cookies are not harmful like malware, worms, or viruses, but they can be a privacy concern. As an example, if you go to a Web site that hosts online advertising from a third-party vendor, the third-party vendor can place a cookie on your computer. If another Web site also has advertisements from the third-party vendor, then that vendor knows you have visited both Web sites. Nothing malicious has occurred, but the advertising company can determine indirectly all the sites you have been to if they have cookies present on those sites.
33
DII: Browser Fingerprinting
34
Web Beacons
web bug, tracking bug, tag, web tag, page tag, tracking pixel, pixel tag, 1×1 gif, or clear gif
35
Different: in-store bluetooth beacons send radio signals to apps on your smartphone
The Facebook “Pixel”
“The Facebook pixel is code that you place on your website. It collects data that helps you track conversions from Facebook ads, optimize ads, build targeted audiences for future ads, and remarket to people who have already taken some kind of action on your website.
It works by placing and triggering cookies to track users as they interact with your website and your Facebook ads.”
36
Facebook Connect Convenience vs. Privacy
“Facebook gave GateGuru the following information about me: My name, gender, birthday, all my friends’ names, my employers, my schools, all my status updates, every Facebook group I belong to or page I like, my events, photos I’ve uploaded or been tagged in, my religious and political views, my hometown, my current city, my videos, my website URL, the content and member list of the Facebook groups I manage, and my relationship status …”
37
Ad Placement
HOW EXACTLY FACEBOOK decides who sees what is one of the great pieces of forbidden knowledge in the information age, hidden away behind nondisclosure agreements, trade secrecy law, and a general culture of opacity.
They appear to deliver certain ads, including for housing and employment, in a way that aligns with race and gender stereotypes — even when advertisers ask for the ads to be exposed a broad, inclusive audience.
39
Ad-Tech’s Eternal Triangle
40
USER is OK with ad content and web content
ADVERTISER is OK with adjacent web content
Web PUBLISHER is OK with ad content
Ad-Tech Under Fire
41
Real-time Bidding (RTB)
“Real-time Bidding (RTB) is a way of transacting media that allows an individual ad impression to be put up for bid in real-time. This is done through a programmatic on-the-spot auction, which is similar to how financial markets operate. RTB allows for Addressable Advertising; the ability to serve ads to consumers directly based on their demographic, psychographic, or behavioral attributes.”
https://www.iab.com/guidelines/real-time-bidding-rtb-project/
42
How Real-Time Bidding Works
43
Controversial Content Categories
113 /People & Society/Ethnic & Identity Groups/Lesbian, Gay, Bisexual & Transgender 144 /Beauty & Fitness/Face & Body Care/Unwanted Body & Facial Hair Removal 171 /People & Society/Ethnic & Identity Groups/Indigenous Peoples/Native Americans 195 /Health/Reproductive Health 198 /Health/Reproductive Health/Birth Control 202 /Health/Reproductive Health/Male Impotence 235 /Beauty & Fitness/Hair Care/Hair Loss 236 /Beauty & Fitness/Weight Loss 237 /Health/Nutrition/Vitamins & Supplements 238 /Health/Medical Facilities & Services/Medical Procedures/Surgery/Cosmetic Surgery 241 /Beauty & Fitness/Fitness/Bodybuilding 245 /Health/Oral & Dental Care 246 /Health/Vision Care 257 /Health/Substance Abuse 401 /People & Society/Family & Relationships/Family/Parenting/Pregnancy & Maternity 409 /News/Politics/Right-Wing Politics 410 /News/Politics/Left-Wing Politics 420 /Health/Health Conditions/Skin Conditions 421 /Health/Reproductive Health/Sexually Transmitted Diseases 429 /Health/Health Conditions/Cancer 437 /Health/Mental Health 499 /Health/Alternative & Natural Medicine
44
Ad Industry: “Nothing to see here”
NAI response:
the existence of this taxonomy and the fact that it includes sensitive categories does not imply that companies are using sensitive personal data to target advertising to various individuals. Rather, the content categories IAB has standardized for use in digital advertising are primarily used by websites, mobile apps, and other online platforms to communicate to advertisers what kind of content they host on their properties. This allows them, for example, to sell space for contextual advertisements that are not tailored to particular web browsers or devices. In practice, this would enable a website that publishes general information about the symptoms of and treatments for sensitive health conditions to tag its own content with categories in the IAB taxonomy. This in turn would allow advertisers (like, e.g. health clinics or pharmaceutical companies) to advertise on the site without targeting individual site visitors. Content tags also allow brand advertisers to make decisions about where their ads will appear for brand integrity purposes. For example, some advertisers may be particularly interested in advertising on, or particularly keen to avoid advertising on, a site that has tagged its content with certain religious or political categories.
45
Ad Re-Targeting
46
The ad industry continues its quest toward fewer cookies and more consistent user IDs
The IAB Tech Lab's DigiTrust and the Advertising ID Consortium chart different, but � intersecting, paths toward the goal of an industry-wide identity resolution.
Barry Levine on January 17, 2019
Shadow Accounts
47
1. Social plugins, like the "Share" or "Like" buttons, which you can find on outside websites, like shopping pages and news articles.
2. Facebook Login, which lets you log into outside apps using your Facebook account so you don't have to create a new password or username.
3. Facebook Analytics, which gives website owners information on how and when people interact with their site.
4. Facebook ads and measurement tools, including Facebook Pixel. The tool measures how effective ads are, by giving Facebook information on when you visited certain sites and took specific actions, like buying something.
"When you visit a site or app that uses our services, we receive information even if you're logged out or don't have a Facebook account,"
ADINT: Advertising Intelligence
48
Threat Review: How Would You Rank Them?
TECHNOLOGY
PEOPLE
49
Summary 6: Tracking & Targeting
Takeaways
Questions
50
Week 4 Preview
51
Bonus 3: More Background
52
Current Research
53
“I think it will very much be an uphill battle for Solid. When the web appeared, it entered a vacuum needing to be filled. Now Solid is entering a space dominated by massive corporations who make a living by harvesting and selling user data. They’re not going to let go of that flock of golden geese easily”.
54
Facebook Sues Analytics Firm Over Data Misuse
Facebook revealed last Friday that it has filed a lawsuit alleging South Korean analytics firm Rankwave abused its developer platform's data, and has refused to cooperate with a mandatory compliance audit and request to delete the data. TechCrunch reports:
Facebook's lawsuit centers around Rankwave offering to help businesses build a Facebook authorization step into their apps so they can pass all the user data to Rankwave, which then analyzes biographic and behavioral traits to supply user contact info and ad targeting assistance to the business. Rankwave also apparently misused data sucked in by its own consumer app for checking your social media "influencer score." That app could pull data about your Facebook activity such as location checkins, determine that you've checked into a baseball stadium, and then Rankwave could help its clients target you with ads for baseball tickets.
The use of a seemingly fun app to slurp up user data and repurpose it for other business goals is strikingly similar to how Cambridge Analytica's personality quiz app tempted millions of users to provide data about themselves and their friends. TechCrunch has attained a copy of the lawsuit that alleges that Rankwave misused Facebook data outside of the apps where it was collected, purposefully delayed responding to a cease-and-desist order, claimed it didn't violate Facebook policy, lied about not using its apps since 2018 when they were accessed in April 2019, and then refused to comply with a mandatory audit of its data practices. Facebook Platform data is not supposed to be repurposed for other business goals, only for the developer to improve their app's user experience.
55
Simple Opt Out is drawing attention to opt-out data sharing and marketing practices that many people aren't aware of (and most people don't want), then making it easier to opt out. For example:
This site makes it easier to opt out of data sharing by 50+ companies (or add a company, or see opt-out tips). Enjoy! http://simpleoptout.com/
Intro
To
RTB
56
Data Breaches
The New
Normal
57
Which data?
58
59
“Facebook already tracks people online, even when they're logged out or don't have an account, through tools like the Facebook Pixel and plugins like the Share button on pages.
The social network's Share button is on 275 million web pages. It collects data allowing advertisers to see what kind of content you're viewing.”
60
61
The evidence comprises category lists from Google and IAB, which allow advertisers to target people according to characteristics such as being an incest victim, having cancer, having a substance-abuse problem, being into a certain kind of politics or adhering to a certain religion or sect.
Content Scanning
62
The company’s data collection practices also include scanning your email to extract keyword data for use in other Google products and services and to improve its machine learning capabilities, Google spokesman Aaron Stein confirmed in an email to NBC News. “We may analyze [email] content to customize search results, better detect spam and malware,” he added
only Gmail apps "directly enhancing email functionality--such as email clients, email backup services and productivity services (e.g., CRM and mail merge services)" will be authorized to access inbox data.
Ad Placement Exclusions
63
Some don’t like the trade...
64
Voice: the next influence frontier
65
66
“Access to internal tools is highly controlled, and is only granted to a limited number of employees who require these tools to train and improve the service by processing an extremely small sample of interactions,”
67
Geoffrey A. Fowler / Washington Post:
When Alexa runs your home, Amazon tracks you in more ways than you might want. — Would you let a stranger eavesdrop in your home and keep the recordings?
68
69
Verizon, T-Mobile, Sprint, and AT&T hit with class action lawsuits for selling customer location data — The lawsuits come after a Motherboard investigation showed AT&T, Sprint, and T-Mobile sold phone location data that ended up with bounty hunters, and The New York Times covered an instance of Verizon selling data.
Joseph Cox / Motherboard: