Data Mining: � Concepts and Techniques�
1
1
Chapter 1. Introduction
2
Why Data Mining?
3
Evolution of Sciences
4
Evolution of Database Technology
5
Chapter 1. Introduction
6
What Is Data Mining?
7
Knowledge Discovery (KDD) Process
8
Data Cleaning
Data Integration
Databases
Data Warehouse
Task-relevant Data
Selection
Data Mining
Pattern Evaluation
Example: A Web Mining Framework
9
Data Mining in Business Intelligence
10
Increasing potential
to support
business decisions
End User
Business
Analyst
Data
Analyst
DBA
Decision Making
Data Presentation
Visualization Techniques
Data Mining
Information Discovery
Data Exploration
Statistical Summary, Querying, and Reporting
Data Preprocessing/Integration, Data Warehouses
Data Sources
Paper, Files, Web documents, Scientific experiments, Database Systems
Example: Mining vs. Data Exploration
11
KDD Process: A Typical View from ML and Statistics
12
Input Data
Data Mining
Data Pre-Processing
Post-Processing
Data integration
Normalization
Feature selection
Dimension reduction
Pattern discovery
Association & correlation
Classification
Clustering
Outlier analysis
… … … …
Pattern evaluation
Pattern selection
Pattern interpretation
Pattern visualization
Example: Medical Data Mining
13
Chapter 1. Introduction
14
Multi-Dimensional View of Data Mining
15
Chapter 1. Introduction
16
Data Mining: On What Kinds of Data?
17
Chapter 1. Introduction
18
Data Mining Function: (1) Generalization
19
Data Mining Function: (2) Association and Correlation Analysis
20
Data Mining Function: (3) Classification
21
Data Mining Function: (4) Cluster Analysis
22
Data Mining Function: (5) Outlier Analysis
23
Time and Ordering: Sequential Pattern, Trend and Evolution Analysis
24
Structure and Network Analysis
25
Evaluation of Knowledge
26
Chapter 1. Introduction
27
Data Mining: Confluence of Multiple Disciplines
28
Data Mining
Machine
Learning
Statistics
Applications
Algorithm
Pattern
Recognition
High-Performance
Computing
Visualization
Database
Technology
Why Confluence of Multiple Disciplines?
29
Chapter 1. Introduction
30
Applications of Data Mining
31
Chapter 1. Introduction
32
Major Issues in Data Mining (1)
33
Major Issues in Data Mining (2)
34
Chapter 1. Introduction
35
A Brief History of Data Mining Society
36
Conferences and Journals on Data Mining
37
Where to Find References? DBLP, CiteSeer, Google
38
Chapter 1. Introduction
39
Summary
40
Recommended Reference Books
41