Beyond Hostility: Detecting Subtle and Overt Forms of Online Conflict with Multi-Objective Learning
Collaborators & Credits
Outline
Negative behaviours on social media
Hate Example
Toxic Behaviours
Consequences
Reginald Gonzales. Social media as a channel and its implications on cyber bullying. In DLSU Research Congress, pages 1–7, 2014.
Conflict (Different shades of Hate)
Hate vs Conflict
Brand communities
Let us think of from a platform or brand perspective
Consequences of Negative Interaction - 1
11
Consequences of Negative Interaction - 2
Platform changes
Conflict Datasets
Need for fine grained conflict dataset
| | | |
Davidson et al | Offensive language | Hate speech classes | Normal |
Fortuna et al | Abusive | Hateful | Normal |
Brand Communities
Jan Breitsohl, Holger Roschk, and Christina Feyertag. Consumer brand bullying behaviour in online communities of service firms. Service Business Development: Band 2., pages 289–312, 2018.
Broad spectrum of conflicts
Broad spectrum of negative behaviours
Rationale
Datasets
Lesser Conflicts | Extreme Conflicts | ||
Class | Datapoints | Class | Datapoints |
Teasing | 208 | Trolling | 1089 |
Criticism | 698 | Harassment | 1098 |
Sarcasm | 577 | Threats | 482 |
Dataset Characteristics
Class | Chars | Words | %stop words | No. of sentences |
Teasing | 78.5 | 14.6 | 0.31 | 1.6 |
Criticism | 232.9 | 42.7 | 0.41 | 2.9 |
Sarcasm | 58.9 | 10.7 | 0.32 | 1.1 |
Trolling | 105.4 | 18.6 | 0.32 | 1.6 |
Harassment | 130.2 | 23.9 | 0.35 | 2.2 |
Threats | 275.7 | 50.6 | 0.31 | 5.5 |
Average | 147 | 26.85 | 0.34 | 2.48 |
Related Work
Prior work - Deep learning algorithms
Muhammad Khan, Assad Abbas, Attiqa Rehman, and Raheel Nawaz. Hateclassify: A service framework for hate speech identification on social media. IEEE Internet Computing, 25(1):40–49, 2020.
Hate
Joni Salminen, Maximilian Hopf, Shammur A Chowdhury, Soon-gyo Jung, Hind Almerekhi, and Bernard J Jansen. Developing an online hate classifier for multiple social media platforms. Human-centric Computing and Information Sciences, 10:1–34, 2020.
Tommaso Caselli, Valerio Basile, Jelena Mitrovi´c, and Michael Granitzer. Hatebert: Retraining bert for abusive language detection in english. arXiv preprint arXiv:2010.12472,2020.
Datasets
Dataset |
|
|
|
Fortuna et al | Abusive | Hateful | Normal |
Davidson et al | Offensive language | Hate speech classes | Normal |
| | | |
Models used
Models used
Performance
Models | Founta et al | Davidson et al | Conflict Dataset | |||||||||
| Acc | F1 | R | P | Acc | F1 | R | P | Acc | F1 | R | P |
BERT | 0.77 | 0.76 | 0.77 | 0.78 | 0.87 | 0.86 | 0.86 | 0.86 | 0.69 | 0.61 | 0.61 | 0.65 |
Hate BERT | 0.78 | 0.73 | 0.73 | 0.73 | 0.86 | 0.86 | 0.86 | 0.87 | 0.55 | 0.52 | 0.52 | 0.52 |
Distill BERT | 0.77 | 0.69 | 0.67 | 0.70 | 0.85 | 0.86 | 0.86 | 0.86 | 0.55 | 0.53 | 0.52 | 0.54 |
GPT-2 | 0.77 | 0.72 | 0.73 | 0.72 | 0.90 | 0.73 | 0.72 | 0.73 | 0.59 | 0.51 | 0.51 | 0.55 |
Flan-T5 | 0.77 | 0.76 | 0.76 | 0.77 | 0.87 | 0.87 | 0.86 | 0.88 | 0.67 | 0.60 | 0.61 | 0.61 |
| | | | | | | | | | | | |
Dataset challenges
Decision Transformer
Reward modelling
Decision Transformer
Performance
Models | Founta et al | Davidson et al | Conflict Dataset | |||||||||
| Acc | F1 | R | P | Acc | F1 | R | P | Acc | F1 | R | P |
BERT | 0.77 | 0.76 | 0.77 | 0.78 | 0.87 | 0.86 | 0.86 | 0.86 | 0.69 | 0.61 | 0.61 | 0.65 |
Hate BERT | 0.78 | 0.73 | 0.73 | 0.73 | 0.86 | 0.86 | 0.86 | 0.87 | 0.55 | 0.52 | 0.52 | 0.52 |
Distill BERT | 0.77 | 0.69 | 0.67 | 0.70 | 0.85 | 0.86 | 0.86 | 0.86 | 0.55 | 0.53 | 0.52 | 0.54 |
GPT-2 | 0.77 | 0.72 | 0.73 | 0.72 | 0.90 | 0.73 | 0.72 | 0.73 | 0.59 | 0.51 | 0.51 | 0.55 |
Flan-T5 | 0.77 | 0.76 | 0.76 | 0.77 | 0.87 | 0.87 | 0.86 | 0.88 | 0.67 | 0.60 | 0.61 | 0.61 |
ConflictDT | 0.77 | 0.77 | 0.78 | 0.78 | 0.89 | 0.88 | 0.88 | 0.88 | 0.71 | 0.63 | 0.64 | 0.62 |
BERT and ConflictDT class performance - conflict dataset
Reward variations
Performance
Models | Accuracy | F-1 | R | P |
BERT | 0.69 | 0.61 | 0.61 | 0.65 |
HierBERT | 0.69 | 0.62 | 0.63 | 0.63 |
GPT-2 | 0.59 | 0.51 | 0.51 | 0.55 |
Flan-T5 | 0.67 | 0.60 | 0.61 | 0.61 |
Dist all classes - ConflictDT | 0.71 | 0.63 | 0.64 | 0.62 |
Dist Lessr and Gtr Classes | 0.67 | 0.59 | 0.62 | 0.75 |
Dist Harrassment | 0.68 | 0.61 | 0.63 | 0.67 |
| | | | |
ConflictDT class performance with Harassment and Lesser-Greater rewards -�the conflict dataset.�
Sequential conflictDT
The effects of reward functions
Models | Accuracy | F-1 | R | P |
BERT | 0.69 | 0.61 | 0.61 | 0.65 |
HierBERT | 0.69 | 0.62 | 0.63 | 0.63 |
GPT-2 | 0.59 | 0.51 | 0.51 | 0.55 |
Flan-T5 | 0.67 | 0.60 | 0.61 | 0.61 |
Dist all classes - ConflictDT | 0.71 | 0.63 | 0.64 | 0.62 |
Dist Lessr and Gtr Classes | 0.67 | 0.59 | 0.62 | 0.75 |
Dist Harrassment | 0.68 | 0.61 | 0.63 | 0.67 |
Sequential no reward | 0.68 | 0.60 | 0.60 | 0.74 |
Sequential with reward | 0.69 | 0.62 | 0.62 | 0.64 |
Graph showing the distance between the logits of the predicted class and the�logits of the true class changing over timesteps
Knowledge Distillation
Knowledge Distillation (KD)
Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015.
KD - modelling
Soft Target Distillation
Feature Distillation using Probabilities
Feature Distillation using Embeddings
Models | Founta et al | Davidson et al | Conflict Dataset | |||||||||
| Acc | F1 | R | P | Acc | F1 | R | P | Acc | F1 | R | P |
BERT | 0.77 | 0.76 | 0.77 | 0.78 | 0.87 | 0.86 | 0.86 | 0.86 | 0.69 | 0.61 | 0.61 | 0.65 |
Hate BERT | 0.78 | 0.73 | 0.73 | 0.73 | 0.86 | 0.86 | 0.86 | 0.87 | 0.55 | 0.52 | 0.52 | 0.52 |
Distill BERT | 0.77 | 0.69 | 0.67 | 0.70 | 0.85 | 0.86 | 0.86 | 0.86 | 0.55 | 0.53 | 0.52 | 0.54 |
GPT-2 | 0.77 | 0.72 | 0.73 | 0.72 | 0.90 | 0.73 | 0.72 | 0.73 | 0.59 | 0.51 | 0.51 | 0.55 |
Flan-T5 | 0.77 | 0.76 | 0.76 | 0.77 | 0.87 | 0.87 | 0.86 | 0.88 | 0.67 | 0.60 | 0.61 | 0.61 |
ConflictDT | 0.77 | 0.77 | 0.78 | 0.78 | 0.89 | 0.88 | 0.88 | 0.88 | 0.71 | 0.63 | 0.64 | 0.62 |
KDemb | 0.83 | 0.83 | 0.83 | 0.84 | 0.91 | 0.90 | 0.90 | 0.90 | 0.72 | 0.67 | 0.67 | 0.67 |
KDlp | 0.78 | 0.77 | 0.77 | 0.78 | 0.86 | 0.85 | 0.85 | 0.85 | 0.71 | 0.61 | 0.63 | 0.62 |
KDlcl | 0.74 | 0.73 | 0.74 | 0.73 | 0.86 | 0.85 | 0.85 | 0.85 | 0.70 | 0.60 | 0.61 | 0.61 |
Performance-Cost Analysis
Loıc Lannelongue, Jason Grealey, and Michael Inouye. Green algorithms: quantifying the carbon footprint of computation. Advanced science, 8(12):2100707, 2021.
Performance cost
Model | Runtime (sec) | Carbon Footprint G Co2e | Energy Consumption (Wh) |
ConflictDT | 799.60 | 2.19 | 23.01 |
BERT | 761.92 | 2.06 | 21.65 |
Sequential | 2384.98 | 6.46 | 67.76 |
LLM-Prob | 8451.17 | 22.96 | 241.16 |
LLM-Hiddene | 1115.31 | 3.03 | 31.84 |
LLM-Soft | 8445.70 | 72.95 | 240.99 |
Thematic Analysis
Virginia Braun and Victoria Clarke. Using thematic analysis in psychology. Qualitative research in psychology, 3(2):77–101, 2006.
Six phase framework
Thematic Analysis
Datapoint | Model Label | Label |
"Oh, great, another selfie of you and your cat. Because the internet desperately needed that. 😴" | Sarcasm | Sarcasm |
"I think it's brilliant that you hate having to rush to pack things! I'm happy to pack at my own speed too." | Criticism | Teasing |
"Your opinions are worthless and don't deserve any respect." | Harassment | Criticism |
"You must be so proud of yourself for coming up with that gem 😂" | Trolling | Harassment |
"Wow, you must be a genius to know everything about cats. 😹" | Sarcasm | Sarcasm |
"So thats where I left my mop head! 😂" | Trolling | Teasing |
"Her outfit is so basic, it's like she raided her grandma's closet." | Trolling | Trolling |
Thematic Analysis
Datapoint | Model Label | Label | Thematic Code |
"I think it's brilliant that you hate having to rush to pack things! I'm happy to pack at my own speed too." | Criticism | Teasing | Hard to see any criticism, text is more lighthearted |
"Your opinions are worthless and don't deserve any respect." | Harassment | Criticism | Direct criticism of user |
"You must be so proud of yourself for coming up with that gem 😂" | Trolling | Harassment | Insult lead to harassment coding |
"So thats where I left my mop head! 😂" | Trolling | Teasing | Lighthearted joking, not looking to incite a response or cause hurt |
Thematic Analysis - Results
Conclusion
For any questions or queries contact either:��Joemon M Jose�joemon.jose@glasgow.ac.uk��Oliver Warke�oliver.warke.1@research.gla.ac.uk�
56
#UofGWorldChangers
@UofGlasgow