CANIS Data Visualization and Foreign Interference
CANIS - Foreign Interference
Problem
The goal is to identify and understand the dynamics of foreign interference in social media.
This involves analyzing when specific content may be considered foreign interference, identifying the platforms with the highest occurrence of such cases, and understanding the type of actors involved.
Methodology
The project was developed in three stages:
1. We employ the 5W2H methodology for a thorough understanding and practical development of the problem.
5W2H
Who?
Where?
Why?
What?
When?
How?
How Much?
CANIS - Foreign Interference
4. Named Entity Recognition (NER):
Words classification in categories such as people, countries, and cities.
Methodology
2. Information Retrieval and Analysis in Python:
1. Data gathering:
Tweets extraction with web scraping from specific accounts using the Selenium library.
2. Information Preprocessing:
Treatment of stop words, upper/lower-case, and lemmatization for a consistent structure.
3. Sentiment Analysis:
Tweets classification as positive or negative using Vader Sentiment Analysis.
5. Definition of foreign interference proxy:
Flag identifying whether the content is foreign interference.
Tweets are considered foreign interference if sentiment is negative and the US or Canada (and its cities) are mentioned.
CANIS - Foreign Interference
Methodology
3. Visualization (Web APP):
CANIS - Foreign Interference
Demo
CANIS - Foreign Interference
CANIS - Foreign Interference
Conclusions
How do we show the scale of China’s influence on social media?
CANIS - Foreign Interference
- Optimize the tweets’ extraction code for more extensive and efficient analysis.
- Analyze tweets using NLP algorithms trained on large datasets, i.e., Large Language Models (LLM).
Next Steps
- Explore data-gathering methods from social media platforms, such as APIs, web scraping libraries, etc. to examine the dynamics of each platform.
* NOTICE that the application is not online (active) the whole time. It is necessary to wait 2-3 minutes for the system to start the app the first time.
“Unlocking data analysis insights empowers us to unveil foreign interference patterns, fostering a safer and more informed world”
- Someone not yet famous