Pre-Trained Models.
By Christine Muthee.
OUTLINE.
2
1.
WHAT ARE PRETRAINED MODELS?
A pre-trained model is a neural network model that has been trained on a large dataset whose learnt parameters can be reused.
4
“
CHARACTERISTICS OF PRETRAINED MODELS.
Prior Knowledge
They have learned general features (eg edges, shapes, textures, or grammar, context) from vast datasets.
This knowledge serves as a foundation for new tasks.
Saved Weights
The parameters are saved after the initial training and can be loaded to initialize a model.
Open Source *
Many pre-trained models are openly available in model zoos or libraries (e.g. PyTorch, Hugging Face Hub, TensorFlow Hub).
5
BIG CONCEPT
Transfer learning: the practice of taking a pre-trained model and adapting it to a new but related task.
6
2.
WHY DO WE NEED THEM?
Importance of Pretrained Models.
Reduced Training Time and Cost.
Avoiding OverFitting.
Better Performance on small Datasets.
Feature Extraction.
Benchmarks.
8
Source: CS231N CNN for Visual Rec
3. SOURCES OF PRE-TRAINED MODELS.
9
And many more …
TIPS ON USING PRETRAINED MODELS.
10