1 of 19

INTRODUCTION TOMachine Learning�

2 of 19

CHAPTER 1: �Introduction

3 of 19

Why “Learn” ?

  • Machine learning is programming computers to optimize a performance criterion using example data or past experience.
  • There is no need to “learn” to calculate payroll
  • Learning is used when:
    • Human expertise does not exist (navigating on Mars),
    • Humans are unable to explain their expertise (speech recognition)
    • Solution changes in time (routing on a computer network)
    • Solution needs to be adapted to particular cases (user biometrics)

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

3

4 of 19

What We Talk About When We Talk About“Learning”

  • Learning general models from a data of particular examples
  • Data is cheap and abundant (data warehouses, data marts); knowledge is expensive and scarce.
  • Example in retail: Customer transactions to consumer behavior:

People who bought “Blink” also bought “Outliers” (www.amazon.com)

  • Build a model that is a good and useful approximation to the data.

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

4

5 of 19

Data Mining

  • Retail: Market basket analysis, Customer relationship management (CRM)
  • Finance: Credit scoring, fraud detection
  • Manufacturing: Control, robotics, troubleshooting
  • Medicine: Medical diagnosis
  • Telecommunications: Spam filters, intrusion detection
  • Bioinformatics: Motifs, alignment
  • Web mining: Search engines
  • ...

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

5

6 of 19

What is Machine Learning?

  • Optimize a performance criterion using example data or past experience.
  • Role of Statistics: Inference from a sample
  • Role of Computer science: Efficient algorithms to
    • Solve the optimization problem
    • Representing and evaluating the model for inference

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

6

7 of 19

Applications

  • Association
  • Supervised Learning
    • Classification
    • Regression
  • Unsupervised Learning
  • Reinforcement Learning

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

7

8 of 19

Learning Associations

  • Basket analysis:

P (Y | X ) probability that somebody who buys X also buys Y where X and Y are products/services.

Example: P ( chips | beer ) = 0.7

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

8

9 of 19

Classification

  • Example: Credit scoring
  • Differentiating between low-risk and high-risk customers from their income and savings

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

9

Discriminant: IF income > θ1 AND savings > θ2

THEN low-risk ELSE high-risk

10 of 19

Classification: Applications

  • Aka Pattern recognition
  • Face recognition: Pose, lighting, occlusion (glasses, beard), make-up, hair style
  • Character recognition: Different handwriting styles.
  • Speech recognition: Temporal dependency.
  • Medical diagnosis: From symptoms to illnesses
  • Biometrics: Recognition/authentication using physical and/or behavioral characteristics: Face, iris, signature, etc
  • ...

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

10

11 of 19

Face Recognition

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

11

Training examples of a person

Test images

ORL dataset,

AT&T Laboratories, Cambridge UK

12 of 19

Regression

  • Example: Price of a used car
  • x : car attributes

y : price

y = g (x | θ )

g ( ) model,

θ parameters

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

12

y = wx+w0

13 of 19

Regression Applications

  • Navigating a car: Angle of the steering
  • Kinematics of a robot arm

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

13

α1= g1(x,y)

α2= g2(x,y)

α1

α2

(x,y)

  • Response surface design

14 of 19

Supervised Learning: Uses

  • Prediction of future cases: Use the rule to predict the output for future inputs
  • Knowledge extraction: The rule is easy to understand
  • Compression: The rule is simpler than the data it explains
  • Outlier detection: Exceptions that are not covered by the rule, e.g., fraud

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

14

15 of 19

Unsupervised Learning

  • Learning “what normally happens”
  • No output
  • Clustering: Grouping similar instances
  • Example applications
    • Customer segmentation in CRM
    • Image compression: Color quantization
    • Bioinformatics: Learning motifs

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

15

16 of 19

Reinforcement Learning

  • Learning a policy: A sequence of outputs
  • No supervised output but delayed reward
  • Credit assignment problem
  • Game playing
  • Robot in a maze
  • Multiple agents, partial observability, ...

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

16

17 of 19

Resources: Datasets

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

17

18 of 19

Resources: Journals

  • Journal of Machine Learning Research www.jmlr.org
  • Machine Learning
  • Neural Computation
  • Neural Networks
  • IEEE Transactions on Neural Networks
  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • Annals of Statistics
  • Journal of the American Statistical Association
  • ...

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

18

19 of 19

Resources: Conferences

  • International Conference on Machine Learning (ICML)
  • European Conference on Machine Learning (ECML)
  • Neural Information Processing Systems (NIPS)
  • Uncertainty in Artificial Intelligence (UAI)
  • Computational Learning Theory (COLT)
  • International Conference on Artificial Neural Networks (ICANN)
  • International Conference on AI & Statistics (AISTATS)
  • International Conference on Pattern Recognition (ICPR)
  • ...

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

19