Neural Networks and Fuzzy Systems
The Multilayer Perceptron
Rizoan Toufiq
Assistant Professor
Department of Computer Science & Engineering
Rajshahi University of Engineering & Technology
Altering The Perceptron Model
Altering The Perceptron Model
Altering The Perceptron Model
The New Model
The New Learning Rule
The New Learning Rule
The New Learning Rule
The Mathematics
Hidden Layer to output layer
The New Learning Rule
The Mathematics
Hidden Layer to output layer
The New Learning Rule
The Mathematics
Hidden Layer to output layer
The New Learning Rule
The Mathematics
Hidden Layer to output layer
The New Learning Rule
The Mathematics
Hidden Layer to output layer
The New Learning Rule
The Mathematics
Hidden Layer to output layer
We now need to know what δpjis for each of the units-if we know
this, then we can decrease E .
Using (4.6) and chain rule,
The New Learning Rule
The Mathematics
Hidden Layer to output layer
We now need to know what δpjis for each of the units-if we know
this, then we can decrease E .
Using (4.6) and chain rule,
The New Learning Rule
The Mathematics
Hidden Layer to output layer
The New Learning Rule
The Mathematics
Input layer to Hidden Layer
The New Learning Rule
The Mathematics
Input layer to Hidden Layer
Equations (4.12) and (4.15) together define how we can train our multilayer networks. �
The New Learning Rule
Advantage of Sigmoid Function
The Multilayer Perceptron Algorithm
The Multilayer Perceptron Algorithm
The Multilayer Perceptron Algorithm
The XOR problem Revisited
The hidden unit acts as a feature detector, �
The XOR problem Revisited
The XOR problem Revisited
One of the more interesting cases is when there is no direct connection from the input to the output.
The XOR problem Revisited
One of the more interesting cases is when there is no direct connection from the input to the output.
The XOR problem Revisited
The learning rule is not guaranteed to produce convergence, however, and it is possible for the network to fall into a situation in�which it is unable to learn the correct output.
The network shown in figure 4.7 will correctly respond to the input patterns 00 and 10, but fails to produce the correct output for the�patterns 01 or 11. �
This local minimum occurs infrequently-about 1%of the time in the XOR problem.
Visualising Network Behavior
Visualising Network Behavior
Visualising Network Behavior
Multilayer perceptrons as classifiers
Multilayer perceptrons as classifiers
Multilayer perceptrons as classifiers
Multilayer perceptrons as classifiers
if we add another layer of perceptron's �
Multilayer perceptrons as classifiers
We never need more than three layers in a network, a statement that is referred to as the Kolmogorov theorem.
Multilayer perceptrons as classifiers
We never need more than three layers in a network, a statement that is referred to as the Kolmogorov theorem.
Multilayer perceptrons as classifiers
We never need more than three layers in a network, a statement that is referred to as the Kolmogorov theorem.
Multilayer perceptrons as classifiers
Generalization
One of the major features of neural networks is their ability to generalise, that is, to successfully classify patterns that have not been previously presented.
Neural networks are good at interpolation, but not so good at extrapolation.
Fault Tolerance
Learning Difficulties
local minimum
Underfitting
Overfitting
Divergency
there is only a slight “lip” to cross before reaching an actual deeper minimum
There are alternative approaches to minimizing these occurrences, which are outlined below.
�
Learning Difficulties
Other Learning Problems
The method of gradient descent is intrinsically slow to converge in a complex landscape
One solution: The addition of the momentum term
Other solution: take account of second order effects in the gradient descent algorithm.
Summary
Rizoan Toufiq, Assistant Professor, Dept. of CSE, RUET
42
Read Task
Rizoan Toufiq, Assistant Professor, Dept. of CSE, RUET
43
B1: Neural Computing - An Introduction - R Beale and T Jackson, Publisher: Adam Hilger, 1990 IOP Publishing Ltd.
Chapter 4: The Multilayer Perceptron
Query???
Rizoan Toufiq, Assistant Professor, Dept. of CSE, RUET
44