facebook.tracking.exposed
Final Presentations
User Uniqueness�
We can use Jaccard Similarity to measure uniqueness between any two sets
For any two users A and B, we can compare how unique they are across postIds or sources by using the complement of the similarity 1 - J(A, B)
User Uniqueness
Build a similarity matrix using Jaccard
Reduce to a single value for each user
Plot the runtime complexity
Posts Uniqueness
Data Exploration
The same post can have several source.
Scenario: Kialo posts on a user timeline and the user share it with their friend. The same post will appear as nature=organic and source= user_name even thou it is a reshare of Kialo’post.
Posts Uniqueness
Data Exploration
Looking at some post from business (The New York Time), the concatenate text can be slightly different, due to AB testing, but the posts are actually the same. We want to consider them as the same content. Even if me and my friend see a different variation of the post, we can consider we consumed the same content.
Posts Uniqueness
Measuring Post Similarity
Posts Uniqueness
Measuring Post Similarity
Investigating Users’ Behavior
Users are treated similarly by Facebook as long as they have a similar behavior on the platform
At the end, we could cluster users into:
Users’ Emotional Reactions Over Time
14/03
25/03
Example: 14th of March - Real world event
Example: 14th of March - Random Event
26-27/02
26-27/02 ?
Anger Levels over time
Next Steps
Cross bubble surprise
What news are important for Facebook?
Main idea is to look at number of impressions containing certain keyword. If a word “spikes” on a certain day, then something happened and we can visualize and analyze it.
To help with visual analysis, I’ve created a simple tool.
Cross bubble surprise
What news are important for Facebook?
At the moment, this is only an awareness tool. In future, there are multiple ways how to improve visualization and also analyze actual algorithm:
Content Classification
Advertisements
Content Classification
Political Posts
Content Classification
Other
Content Classification
Sponsored Facebook Post Topics (Oct. - Dec. 2018)
Content Classification
Sponsored Facebook Post Topics (Oct. - Dec. 2018)
“Other” Ad
(e.g. Events)
Political Ad
Sales Ad
Exposure and diversity
How many people are populating your newsfeed?
Let’s see if data analysis offer alternative readings
Exposure and diversity
How many people are populating your newsfeed?
Let’s see if data analysis offer alternative readings
Exposure and diversity
How many people are populating your newsfeed?
Exposure and diversity
How many people are populating your newsfeed?
Exposure and diversity
How many people are populating your newsfeed?
Exposure and diversity
How many people are populating your newsfeed?
Exposure and diversity
How many people are populating your newsfeed?