Detection and Tracking of Foot Movement in Kathak Dance
- Pranit Chawla (17EC35017), Department of E&ECE
- Advisor: Prof. Dr. Partha Pratim Das - Mentor: Saptami Ghosh
Introduction
Ladi Variations and foot patterns
Problem Formulation
Training data available
Example of movement
Image Preprocessing: Full - Cropped Image
Detection of feet movement
Literature Review
Computer Vision Approaches (Summary) (Done last semester)
ORB feature point tracking
N_features = 500
ORB feature point tracking
N_features = 5000
Observations from ORB feature point tracking
Background Subtraction and Median Tracking
Median tracking
Background Subtraction
Pre-Processing Techniques
Background Subtraction and Median Tracking
No movement
Left foot moving
Right foot moving
Background Subtraction and Median Tracking (Observations)
Background Subtraction with Optical Flow
Optical Flow for each image in the bottom row
Background Subtraction with Optical Flow
Movement detected by count of pixels
Background Subtraction with Optical Flow
Movement missed
Background Subtraction with Optical Flow (Observations)
2D- CNN : Foot Movement Detection
CNN
FC
In the air? On the ground
2D- CNN : Foot Movement Detection Observations
3D- CNN : Temporal Activity Recognition
Architecture of a 3-D CNN
Img Source: 3D Convolutional Neural Networks for Crop Classification with Multi-Temporal Remote Sensing Images (Ji et al., 2018)
3D- CNN : Temporal Activity Recognition
Moving
Not Moving
3-D CNN
3-D CNN
3D- CNN : Temporal Activity Recognition with Optical Flow
3-D CNN
3-D CNN
Fusion Module
Moving vs Not Moving
Quantitative Results (L2V3D4R1)
| Moving (Predicted) | Not Moving (Predicted) |
Moving (Actual) | 767 (True Positive) | 184 (False Negative) |
Not Moving (Actual) | 41 (False Positive) | 1356 (True Negative) |
Confusion Matrix with number of frames for L2V3D4R1
Accuracy | Precision | Recall | F1 score |
0.90 | 0.95 | 0.80 | 0.87 |
Classification Metrics for L2V3D4R1
Quantitative Results (L2V1D4R1)
| Moving (Predicted) | Not Moving (Predicted) |
Moving (Actual) | 751 (True Positive) | 198 (False Negative) |
Not Moving (Actual) | 166 (False Positive) | 1208 (True Negative) |
Confusion Matrix with number of frames for L2V1D4R1
Accuracy | Precision | Recall | F1 score |
0.84 | 0.82 | 0.79 | 0.80 |
Classification Metrics for L2V1D4R1
Quantitative Results (L2V2D4R1)
| Moving (Predicted) | Not Moving (Predicted) |
Moving (Actual) | 756 (True Positive) | 196 (False Negative) |
Not Moving (Actual) | 27 (False Positive) | 1419 (True Negative) |
Confusion Matrix with number of frames for L2V2D4R1
Accuracy | Precision | Recall | F1 score |
0.91 | 0.97 | 0.79 | 0.87 |
Classification Metrics for L2V2D4R1
3D- CNN : Temporal Activity Recognition with Optical Flow (Results)
Moving
Moving
Moving
Moving
Moving
Not Moving
Moving
Moving
3D- CNN : Temporal Activity Recognition with Optical Flow (Observations)
Tracking path of feet
Tracking of path of foot across frames
Overview of Tracking Pipeline
Histogram Equalisation
Otsu Thresholding
Colour Space Conversion to YCrCb
Refinement by mask multiplication and area thresholding
Skin Colour Based Thresholding
Contour Fitting and Tracking of centre of mass
Morphological Operations based refinement
Colour Space Conversion to YCrCb
Histogram Equalisation across Y channel
Initial
Equalised
Otsu Thresholding
Otsu Thresholding (Results)
Original
Mask
Original
Mask
Skin Based Segmentation
Finding segmentation threshold in HSV space
Skin Based Segmentation (Results)
Skin Based Segmentation (Observations)
Refinement using Morphological Operations
Skin Based Segmentation
Morphological operation based refinement
Refinement using Morphological Operations (Issues)
Figure 1: Single contour formed due to movement and occlusion
Figure 2: Third contour formed on top due to noise
Fixing issues of Morphological Refinement
Fixing issues of Morphological Refinement (Results)
Left contour
Initial mask
Left ones mask
Right ones mask
Right contour
Multiply
Multiply
Contour Fitting and tracking Centre of Mass
Contour Fitting and tracking Centre of Mass (Results)
Step wise Results overview
Qualitative Results on different dancers
Dancer 1
Dancer 4
Dancer 3 (Male)
Quantitative Result on L2V3D4R1
Quantitative Result on L2V2D4R1
Tracking pipeline (Limitations)
Future Experiments
References
Thank you