Interactions among Echo Chambers in Social Media
Team: Jiazan Shi, Siming Wu, Yue Yang(Max)
Sponsor: Maurizio Porfiri, Rayan Succar, Alain Boldini
Project Objective
Objective
Hypothesis
Echo chamber refers to the situation where opinions get amplified and reinforced by information spread inside a closed system because people tend to believe the information that is similar or could support their own beliefs. The effect of echo chambers could be more severe when it comes to political and other sensitive topics, such as gun control, which would cause polarization and social danger.
2
Literature Review: Echo Chamber
3
Literature Review: Stance Detection
Lexicon Based Methods
ML Based Methods
DL Based Methods
Pros
Cons
4
Data Source
Vaccine:
Abortion:
Gun:
Snscrape
Web Scraping
Twint
5
How did we choose scraping period?
— Google trends
Gun
Vax
Abortion
Covid-19 Vaccines
School mass shooting
Roe v Wade overturned
6
Data Pipeline
Tweets Scraping
(Snscrape)
Users Following
(Twint)
Cleaning
&
Filtering
Subset of Tweets
Stance Detection
(ChatGPT)
Network Analysis
User Following Relationship
Active Users
Hypothesis Testing
Visualization
Stance Leaning
Sentiment
Analysis
Network of
Pro and Anti Users
Data Collection
Data Preprocessing
Data Analysis
7
Data Summary
Topic | Raw Tweets | Raw Users | Time Period | Final Tweets | Final users |
Gun | 2.9M | 1.21M | 05/17/2022 - 05/31/2022 | 222K | 14,505 |
Abortion | 1.1M | 554K | 06/17/2022 - 07/01/2022 | 178K | 11,979 |
Vaccine | 1.5M | 686K | 02/28/2021 - 03/15/2021 | 212K | 13,970 |
8
Methodology: Concept & Definition
1
Interactions
2
Content’s Leaning
3
Individual Leaning
4
Following Leaning
9
Methodology: ChatGPT
Topic | Validate Data Source | Accuracy | F1 Score |
Gun | 81.00% | 61.0% | |
Abortion | 80.14% | 79.0% | |
Vaccine | 82.25% | 56.0% |
Model:
gpt-3.5-turbo
Prompt:
“Is this a pro-gun, anti-gun, neutral, or off-topic statement? Only say it is neutral when it is completely non-opinionated: …”
Response:
“This tweet is anti-gun”
ChatGPT Validation
10
Methodology: Analysis
Network Analysis:
The project uses NetworkX for network analysis, that is a python package for analyzing the structure of complex networks. The project created directed graphs with NetworkX that represent the following relationship among users, and analyze the structure of networks, in-degree and out-degree of users.
Hypothesis Testing:
Spearman’s rank correlation coefficient assesses the statistical correlation between the ranks of two variables.
To test the hypothesis, we analyzed the correlations between:
11
Analysis: Network Structure
Abortion
Gun
Vaccine
12
Analysis: Echo Chamber
Corr = 0.478
P-value = 0.00
Abortion
Gun
Vaccine
Corr = 0.04
P-value = 0.00
Corr = 0.455
P-value = 0.00
13
Analysis: Cross-community Interaction
Corr = 0.141
P-value = 0.00
Corr = 0.217
P-value = 0.00
Corr = 0.116
P-value = 0.00
14
Analysis: Cross-stance Interaction
Corr = 0.185
P-value = 0.00
Corr = 0.083
P-value = 0.00
Pro-gun corr = 0.363
Anti-gun corr = 0.101 (limited)
Pro-abortion corr = 0.176
Anti-abortion corr = 0.185
Pro-vax corr = 0.159
Anti-vax corr = 0.116
Corr = 0.129
P-value = 0.00
15
Conclusion
16
Findings
17
Findings
Topic | Raw Tweets | Raw Users | After Filtering | Final Tweets | Final users | After Stance Classification | Pro Users | Anti Users | Neutral Users |
Gun | 2.9M | 1.21M | 222K | 14,505 | 875 | 13,093 | 14 | ||
Abortion | 1.1M | 554K | 178K | 11,979 | 2,835 | 6,510 | 141 | ||
Vaccine | 1.5M | 686K | 212K | 13,970 | 5826 | 6847 | 801 |
18
Limitation
19
Stance detection, is there ground truth?
A person's a person, no matter how small. - Dr. Suess #WAAR
Thanks for gently spoken truth, @DanaPerino ! How DO we as a nation "protect the innocent"? #TheFive #CCOT #DNC #RNC
I can tell women what to do with their bodies #ThingsYouDontSayAsAPolitician
20
Stance detection, is there ground truth?
A person's a person, no matter how small. - Dr. Suess #WAAR
Thanks for gently spoken truth, @DanaPerino ! How DO we as a nation "protect the innocent"? #TheFive #CCOT #DNC #RNC
I can tell women what to do with their bodies #ThingsYouDontSayAsAPolitician
21
Policy Implications
Promote diverse viewpoints in algorithm: For sensitive or political information, we suggest algorithm engineers to reduce the weight for similarity and include diversity when designing algorithms.
Increase the connections between communities for sensitive communities: Twitter applies SimClusters to detect communities. Once the communities are identified, we encourage them to increase the recommendations from the opposite communities.
Allow users to personalize their feed option: A previous paper showed social media platforms that don’t have feed algorithms tweakable by users have higher possibility to have echo chamber effect than platforms that have tweakable algorithms.
Surveillance for online discussion?Some suggest that the government should increase surveillance on online discussion of sensitive or political topics, what do you think?
22
Community Notes
Currently, very few tweets get notes. Can it be scaled?
23
Advanced cluster detection techniques are in hand
24
References
25