summarization using text processing and pre-trained models
Fatemeh Rahimi
Summarization
2
Why is Summarization Useful?
3
Where can we use Summarization?
4
Summarization
5
Single Document Summarization
Summarization
Multi-Document Summarization
Types of Summarization
6
Extractive Summarization
Summarization
Abstractive Summarization
Extractive Summarization
7
Extractive Summarization (Supervised)
A dataset with highlighted sentences
8
Sentences | Highlighted |
By the Mid 19th….. | 1 |
Japan changed int ... | 0 |
Sentence 3 | 0 |
Sentence 4 | 1 |
... | ... |
... | ... |
Sentence n | 0 |
Extractive Summarization (Supervised)
We need input of numbers for classification, But we have sentences and words.
What is the solution?
9
Pretrained-models
10
Pretrained-models (Cont.)
11
Let’s use pre-trained models for Summarization
Supervised
12
Sentences within the document
Pretrained models
Embeddings
ML approaches
To find the highlighted sentences
Extracting Embeddings with pre-trained models
What is Word Embedding?
Word Embeddings in BERT:�From base model: [12x768]
13
Extractive Summarization (Supervised)
14
Extractive Summarization (UnSupervised)
(Useful When u also have a topic �or question to find summary on)��Ranking Algorithms:
15
Extractive Summarization (UnSupervised)
Clustering
16
Extractive Summarization (UnSupervised)
Graph-based
Extracting summary:
17
18
Wrap Up
We Learned:
19
Thanks for listening
Any Questions?
20
BERT
21
Evaluation for summarization
ROUGE
ROUGE-N
ex: ROUGE-1, ROUGE-2
ROUGE-L
22
DBSCAN
23