Workshop DATAI-IESE
May 6th, 2024
Diego Borro
Computer Science PhD
Vision and Robotics research line at CEIT
Research Professor at TECNUN
dborro@ceit.es
AI techniques for image processing
Index
3
Context
Deep Learning
Convolutional Neural Networks
Fully Connected Neural Networks
4
Image processing techniques
Problem | Image information | Complexity | Labeling |
Classification | Low | Easy | Easy |
Detection | Medium | Medium | Medium |
Segmentation | High | Hard | Hard |
Types of problems and techniques in image processing
5
Classification problems
Types of problems and techniques in image processing
6
Classification problems
Initial dataset (ImageNet,…)
CNN pretrained
Weights optimization
Labeled data
Application dataset
CNN trained with own data
Weights optimization
Labeled data
Test dataset
Inference
Confussion matrix
Metrics computation
Initial model weights
Final model weights
Final layers design
Transfer learning
Network modification
Hyperparameters tuning
Types of problems and techniques in image processing
7
Detection problems
Types of problems and techniques in image processing
8
Segmentation problems
Types of problems and techniques in image processing
9
Black box and Explainable AI (XAI)
Techniques for unveiling the black box: XAI and model calibration
10
Techniques for unveiling the black box: XAI and model calibration
eXplainable AI
[1] S. Ali, et al., “Explainable artificial intelligence (xai): What we know and what is left to attain trustworthy artificial intelligence,” Information Fusion, p. 101805, 2023.
[2] Ribeiro, MT., et al., “Why Should I Trust You?": Explaining the Predictions of Any Classifier”, ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.
[3] SM Lundberg, SI Lee. “A unified approach to interpreting model predictions”, Advances in neural information processing systems, 2017.
In a FNN (fully-connected neural network), neurons learn representation and patterns that is difficult to extract and present in a human-readable form
They try to understand the importance of features by seeing how predictions change when input features are perturbed, removed of changed (Bias detection!!)
11
eXplainable AI
[4] M. Aouayeb, et. al. “Learning Vision Transformer with Squeeze and Excitation for Facial Expression Recognition”, ArXiv, 2021.
CNN are focused on image processing problems so, those patterns are images!!
Deeper representation in a CNN capture high-level abstracts or visual concepts
Convolutional layers naturally retain the spatial information of the input data
Shapes and patterns are detecting at successive layers
[4]
Techniques for unveiling the black box: XAI and model calibration
12
Some XAI techniques for CNNs
Name | Focus | Eq |
Layer visualization | Last convolutional layer | |
Saliency maps [5] | Impact in the output respected to input changes (pixels) | |
Grad-CAM [6] | Impact in the output respected to FM changes (high-level features) | |
Attention maps [7] | Image areas where the model pays attention | |
Guided Backpropagation [8] | Impact in the output respected to positive input changes | |
Integrated Gradients [9] | Impact in the output respected to changes in N inputs (pixels) | |
[5] K. Simonyan, A. Vedaldi, and A. Zisserman, “Deep inside convolutional networks: Visualising image classification models and saliency maps,” arXiv preprint arXiv:1312.6034, 2013.
[6] R. R. Selvaraju, A. Das, R. Vedantam, M. Cogswell, D. Parikh, and D. Batra, “Grad-cam: Why did you say that?” Nov. 2016.
[7] Alexey Dosovitskiy y col. “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale”. En: CoRR abs/2010.11929 (2020). arXiv: 2010.11929. url: https://arxiv.org/ abs/2010.11929.
[8] J.T. Springenberg, A. Dosovitskiy, T. Brox, and M. Riedmiller, “Striving for simplicity: the all convolutional net”, Proceedings of the International Conference on Learning Representations (ICLR 2015).
[9] M. Sundararajan, A. Taly, and Q. Yan, “Axiomatic Attribution for Deep Networks”, Proceedings of the 34th International Conference on Machine Learning (ICML’17), Vol. 70, pp. 3319-3328. August 2017.
Techniques for unveiling the black box: XAI and model calibration
13
Which technique is better?
Techniques for unveiling the black box: XAI and model calibration
Test dataset
Representation
CNN
Black-box model
XAI Explanations
Test ground truth masks
Convert to masks
IoU
Metrics evaluation
IoU
IoU
IoU
IoU
IoU
Comparison and test dataset results
14
Model calibration
[10] Guo, Chuan, Pleiss, Geoff, Sun, Yu,Weinberger, Kilian Q, (2017). On calibration of modern neural networks. International Conference on Machine Learning 1321-1330.
Techniques for unveiling the black box: XAI and model calibration
15
Projects examples
Projects examples and conclusions
16
Conclusions
Projects examples and conclusions
Workshop DATAI-IESE
May 6th, 2024
Diego Borro
Computer Science PhD
Vision and Robotics research line at CEIT
Research Professor at TECNUN
dborro@ceit.es
AI techniques for image processing