�
Data Mining_Anoop Chaturvedi
1
Swayam Prabha
Course Title
Multivariate Data Mining- Methods and Applications
Lecture 19
ICA Algorithms and Exploratory Factory Analysis
By
Anoop Chaturvedi
Department of Statistics, University of Allahabad
Prayagraj (India)
Slides can be downloaded from https://sites.google.com/view/anoopchaturvedi/swayam-prabha
Data Mining_Anoop Chaturvedi
2
Data Mining_Anoop Chaturvedi
3
Data Mining_Anoop Chaturvedi
4
Data Mining_Anoop Chaturvedi
5
Multiple component extraction
Parallel Algorithm:
Deflation method extracts independent components sequentially one-by-one.
Parallel method extracts all independent components simultaneously.
Data Mining_Anoop Chaturvedi
6
Data Mining_Anoop Chaturvedi
7
Data Mining_Anoop Chaturvedi
8
Data Mining_Anoop Chaturvedi
9
Nonquadratic functions and their first two derivatives | |||
Density | G(y) | | |
log cosh | | | |
Exp | | | |
Data Mining_Anoop Chaturvedi
10
Data Mining_Anoop Chaturvedi
11
Data Mining_Anoop Chaturvedi
12
Plots of simulated signals:
Data Mining_Anoop Chaturvedi
13
Mixed Signals
Data Mining_Anoop Chaturvedi
14
Data Mining_Anoop Chaturvedi
15
Data Mining_Anoop Chaturvedi
16
Unmixed Signals using FastICA
Data Mining_Anoop Chaturvedi
17
Original Signals Reconstructed Signals
Reconstruction has done a good job here except that the algorithm cannot recover the exact amplitude of the source.
Exploratory Factor Analysis (EFA):
Data Mining_Anoop Chaturvedi
18
Data Mining_Anoop Chaturvedi
19
Data Mining_Anoop Chaturvedi
20
Data Mining_Anoop Chaturvedi
21
Data Mining_Anoop Chaturvedi
22
Example: Texture measurements of a pastry-type food.
Data set: https://openmv.net/info/food-texture
Oil: percentage Oil in the pastry
Density: Product’s density
Crispy: Crispiness measurement on a scale from 7 to 15
Fracture: Angle, in degrees, through which the pastry can be slowly bent before it fractures.
Hardness: A measure of the amount of force required before breakage occurs.
50 rows and 5 columns
factanal() function of R is used
Data Mining_Anoop Chaturvedi
23
Data Mining_Anoop Chaturvedi
24
Data Mining_Anoop Chaturvedi
25
Data Mining_Anoop Chaturvedi
26
If two variables have large loadings for the same factor, they have something in common.
Factor 1 accounts for pastry, which is dense and can be bent a lot before it breaks.
Factor 2 accounts for pastry that is crispy and hard to break.
Soft pastry ⇒ factor 1
Hard pastry ⇒ factor 2