4 of 44

Transfer Learning for Language Generation

A dialog generation task:

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 4

More complex adaptation:

Duplicate the model to initialize an encoder-decoder structure�e.g. Lample & Conneau, 2019,�Golovanov, Kurbanov, Nikolenko, Truskovskyi, Tselousov and Wolf,�ACL 2019
Use a single model with concatenated inputs�see e.g. Wolf et al., 2019, Khandelwal et al. 2019

5 of 44

Transfer Learning for�Language Generation

�The Conversational Intelligence Challenge 2

« ConvAI2 »

(NIPS 2018 competition)

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 5

Final Automatic Evaluation Leaderboard (hidden test set)

6 of 44

Hugging Face: Democratizing NLP

Develop & open-source tools for Transfer Learning in NLP
We want to accelerate, catalyse and democratize research-level work in Natural Language Understanding as well as Natural Language Generation

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 6

7 of 44

Democratizing NLP – sharing knowledge, code, data

Knowledge sharing

NAACL 2019 / EMNLP 2020 Tutorial (Transfer Learning / Neural Lang Generation)
Workshop NeuralGen 2019 (Language Generation with Neural Networks)
Workshop SustaiNLP 2020 (Environmental/computational friendly NLP)
EurNLP Summit (European NLP summit)

Code & model sharing: Open-sourcing the “right way”

Two extremes: 1000-commands research-code ⟺ 1-command production code

To target the widest community – our goal is to be 👆 right in the middle

Breaking barriers

Researchers / Practitioners
PyTorch / TensorFlow

Speeding up and fueling research in Natural Language Processing

Make people stand on the shoulders of giants

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 7

9 of 44

Transformers library

We’ve built an opinionated framework providing state-of-the-art general-purpose tools for Natural Language Understanding and Generation.

Features:

Super easy to use – fast to on-board
For everyone – NLP researchers, practitioners, educators
State-of-the-Art performances – on both NLU and NLG tasks
Reduce costs/footprint – 30+ pretrained models in 100+ languages
Deep interoperability between TensorFlow 2.0 and PyTorch

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 9

10 of 44

Transformers library

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 10

11 of 44

Transformers library: code example

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 11

💥 Check it out at 💥 �https://github.com/huggingface/transformers

12 of 44

Transformers: model hub

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 12

💥 Check it out at 💥 �huggingface.co

13 of 44

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 13

14 of 44

Tokenizers library

Now that neural nets have fast implementations, a bottleneck in Deep-Learning based NLP pipelines is often tokenization: converting strings ➡️ model inputs.

We have just released 🤗Tokenizers: ultra-fast & versatile tokenization

Features:

Encode 1GB in 20sec
BPE/byte-level-BPE/WordPiece/SentencePiece...
Bindings in python/js/rust…
Link: https://github.com/huggingface/tokenizers

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 14

15 of 44

Datasets library

The full data-processing pipeline goes beyond tokenization and models to include data access and preprocessing at the beginning and model evaluation at the end.

We have recently released a new library 🤗Datasets to improve the situation on both ends of the pipeline.

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 15

Data

Tokenization

Prediction

Datasets

Tokenizers

Transformers

Metrics

Datasets

16 of 44

Datasets library

Datasets is a lightweight and extensible library to easily access and process datasets and evaluation metrics for Natural Language Processing (NLP).

Features:

One-line access to 150+ datasets and metrics – Open/collaborative hub
Built-in interoperability with Numpy, Pandas, PyTorch and Tensorflow 2
Lightweight and fast with a transparent and pythonic API
Strive on large datasets: Wikipedia (18GB) only take 9 MB of RAM when
Smart caching: never wait for your data to process several times
Link: https://github.com/huggingface/datasets

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 16

17 of 44

Datasets: code example

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 17

💥 Check it out at 💥�https://github.com/huggingface/datasets

18 of 44

Datasets: datasets hub

Transfer Learning in NLP: Concepts, Tools & Trends - Thomas Wolf - Slide 18

💥 Check it out at 💥 �huggingface.co

19 of 44

Tools for generation

20 of 44