1 of 9

VaxVerdict

A ROBERTA BASED MULTILABEL CLASSIFIER

PICT CL Lab Group 1

2 of 9

Table of

Problem Statement

01

02

03

04

Approaches

Why roBERTa

Advantages

Disadvantages

05

06

Future Scope

CONTENTS

3 of 9

PROBLEM STATEMENT

The goal is to build an effective multi-label classifier to label a social media post (particularly, a tweet) according to the specific concern(s) towards vaccines as expressed by the author of the post.

Labels: Unnecessary, Mandatory, Pharma, Conspiracy, Political, Country, Rushed, Ingredients, Side-effect, Ineffective, Religious, None

4 of 9

Convolutional NN

macro f1 score: 0.55

BERT

macro f1 score: 0.6

roBERTa

macro f1 score: 0.65

Approaches

5 of 9

Why

    • Bidirectional context modeling
    • Pretrained on diverse multilingual datasets
    • No Next Sentence Prediction
    • Compatibility with CUDA architecture

ROBERTA

6 of 9

Effective Handling of Long Sequences

01

Dynamic Masking

Pretrained on Large Corpus

Large Batch Training

02

03

04

Advantages

7 of 9

Domain Specificity

01

Computational Resources

Large Memory Footprint

Lack of Incremental Learning

02

03

04

Disadvantages

8 of 9

FUTURE SCOPE

    • Hyperparameter tuning
    • Checkpoint incorporation and Dropout
    • Data Augmentation
    • Adversarial Training

9 of 9

PICT CL Lab Group 1

THANK YOU