1 of 26

Data Mining_Anoop Chaturvedi

1

Swayam Prabha

Course Title

Multivariate Data Mining- Methods and Applications

Lecture 18

Latent Variable Model for Blind Source Separation

By

Anoop Chaturvedi

Department of Statistics, University of Allahabad

Prayagraj (India)

Slides can be downloaded from https://sites.google.com/view/anoopchaturvedi/swayam-prabha

2 of 26

Latent Variable Model for Blind Source SeparationLatent variable ⇒ Used to give a formal representation of ideas or concepts that cannot be well defined or measured directly. �Examples: Fuzzy concepts such as ‘general intelligence’, ‘verbal ability’, ‘ambition’, ‘socio economic status’, ‘quality of life’ and ‘happiness’.���

Data Mining_Anoop Chaturvedi

2

Person has high level of intelligence (latent variable)

He / She did well in a test (Observable variable)

3 of 26

  • A casual relationship may exist between latent variables and observable variables.
  • Unobservable concepts are constructed from certain observed variables.
  • Latent variables are usually formed as linear combination of some observable variables.
  • Observed variables are proxies for unobserved concepts.
  • Latent variables play important role in Blind source separation problems.�

Data Mining_Anoop Chaturvedi

3

4 of 26

Blind Source Separation (BSS)

  • Both the sources and the mixing methodology are unknown.
  • Mixture signals are available for further separation process.
  • Data consist of multiple time series of mixed signals.
  • Objective is to recover all individual sources from the mixed signal, or to separate a particular source.
  • Involves decomposition of unknown mixture of non-Gaussian signals into its independent component signals.�

Data Mining_Anoop Chaturvedi

4

5 of 26

  •  

Data Mining_Anoop Chaturvedi

5

6 of 26

Data Mining_Anoop Chaturvedi

6

 

7 of 26

������

More Applications

Biomedical Signal Processing:

EEG: Separating brainwave signals from different regions of the brain for analysis of neural activity.

  • To separate EEG, ERP, MEG recording into individual source signals.
  • ICA can be used to separate the signals so that an accurate representation of brain activity may be observed.

Data Mining_Anoop Chaturvedi

7

8 of 26

EEG → relate certain types of behaviour to changes in the electrical activity of the cerebral cortex.

ERP → Event-related potential is fine tuned EEG’s resulting from simulation of visual, auditory or sensory systems.

MEG → strength of magnetic fields generated by cortical activity.

f MRI (functional magnetic resonance) → experiments used to study human brain.

Data Mining_Anoop Chaturvedi

8

9 of 26

Removing noise on the ECG:

ICA can be used for removing artifacts from the ECG

To access health condition of the fetus using ECG of mother.

(ECG: mixture of fetus and maternal heart rate)

Separation of musical signals:

ICA provides a fairly accurate separation, although some accidental/unwanted signals remain.

Agricultural remote sensing images

Web image & classification ⇒ Classify web image into mountains, agriculture land, sea forest area etc.

Data Mining_Anoop Chaturvedi

9

10 of 26

Astronomy: In Celestial sources a pixel value can be considered as a mixture of different sources. BSS can be used to display interesting features and helps astronomers to improve description of celestial sources.

Audio Signal Processing:

Music Source Separation: Decomposing mixed music tracks into individual instruments or vocal tracks.

Surveillance: Separating sounds of interest (e.g., footsteps, conversations) from background noise in surveillance recordings.

Data Mining_Anoop Chaturvedi

10

11 of 26

Classification of microarray gene profiles

Gene expression data is considered a linear combination of independent components having specific biological interpretations.

Biomedical scientists can use ICA to explore gene expression features and discover underlying biological information from large microarray data sets.

Data Mining_Anoop Chaturvedi

11

12 of 26

Telecommunications:

Wireless Communication: Separating signals transmitted over a shared channel in wireless communication systems.

Radio Frequency Identification (RFID): Separating signals from different RFID tags in a crowded environment.

Speech Enhancement and Acoustic Monitoring:

  • Separating speech signals from background noise to enhance speech quality in communication systems.
  • Separating sounds of interest (e.g., animal calls, environmental sounds) from background noise

Data Mining_Anoop Chaturvedi

12

13 of 26

Image Processing:

Image Decomposition: Separating mixed images captured by sensors (e.g., satellite images) into constituent components (e.g., vegetation, buildings, water bodies).

Medical Imaging:

Separating different tissue types in medical images for diagnostic purposes.

Financial Signal Processing:

Separating different market factors contributing to stock price movements for analysis and prediction.

Data Mining_Anoop Chaturvedi

13

14 of 26

Portfolio Management:

Separating signals related to different financial assets to optimize portfolio allocation.

Seismic Signal Processing: Separating seismic signals from different sources (e.g., earthquakes, human activities) for monitoring seismic activity.

Data Mining_Anoop Chaturvedi

14

15 of 26

Independent Component Analysis (ICA)

Seeks to uncover hidden variables in high dimensional data.

Three Assumptions of ICA models

  1. Each measured signal is a linear combination of the sources.
  2. The source signals (Hidden variables) are statistically independent of each other.
  3. The values in each source signal have non-Gaussian distribution.

Data Mining_Anoop Chaturvedi

15

16 of 26

  • The number of inputs and outputs are the same.
  • Linear mixture of an unknown number of unknown variables where mixing coefficient are also unknown.
  • Since outputs are mutually independent, we can not drop components as in Principal Component Analysis.

Data Mining_Anoop Chaturvedi

16

17 of 26

  •  

Data Mining_Anoop Chaturvedi

17

18 of 26

  •  

Data Mining_Anoop Chaturvedi

18

19 of 26

  •  

Data Mining_Anoop Chaturvedi

19

20 of 26

  •  

Data Mining_Anoop Chaturvedi

20

21 of 26

  •  

Data Mining_Anoop Chaturvedi

21

22 of 26

  •  

Data Mining_Anoop Chaturvedi

22

23 of 26

  •  

Data Mining_Anoop Chaturvedi

23

24 of 26

  •  

Data Mining_Anoop Chaturvedi

24

25 of 26

  •  

Data Mining_Anoop Chaturvedi

25

26 of 26

  •  

Data Mining_Anoop Chaturvedi

26