1 of 10

Pre-Trained Models.

By Christine Muthee.

2 of 10

OUTLINE.

  • What are Pre-trained Models?
  • Why do we need them?
  • How do we obtain them?
  • Tips on using pre-trained Models.

2

3 of 10

1.

WHAT ARE PRETRAINED MODELS?

4 of 10

A pre-trained model is a neural network model that has been trained on a large dataset whose learnt parameters can be reused.

4

5 of 10

CHARACTERISTICS OF PRETRAINED MODELS.

Prior Knowledge

They have learned general features (eg edges, shapes, textures, or grammar, context) from vast datasets.

This knowledge serves as a foundation for new tasks.

Saved Weights

The parameters are saved after the initial training and can be loaded to initialize a model.

Open Source *

Many pre-trained models are openly available in model zoos or libraries (e.g. PyTorch, Hugging Face Hub, TensorFlow Hub).

5

6 of 10

BIG CONCEPT

Transfer learning: the practice of taking a pre-trained model and adapting it to a new but related task.

6

7 of 10

2.

WHY DO WE NEED THEM?

8 of 10

Importance of Pretrained Models.

Reduced Training Time and Cost.

Avoiding OverFitting.

Better Performance on small Datasets.

Feature Extraction.

Benchmarks.

8

Source: CS231N CNN for Visual Rec

9 of 10

3. SOURCES OF PRE-TRAINED MODELS.

  1. PyTorch Model Zoo.

9

And many more …

10 of 10

TIPS ON USING PRETRAINED MODELS.

  • Understand the Source Dataset.
  • Leverage Transfer Learning (can be parameter efficient).
  • Choose the right model for your Task.
  • Use Pre Trained Models as Primary Feature Extractors.
  • Experiment with different Fine-Tuning Strategies.

10