Dynamical System Modeling and Stability Investigation�DSMSI-2023
Dedicated to the 77th anniversary of the outstanding Ukrainian scientist
professor Denys Khusainov
December 19-21, 2023, Kyiv, Ukraine
USE OF CONVOLUTIONAL NEURAL NETWORKS FOR IDENTIFYING ADDITIONAL FEATURES ON A DIGITAL IMAGE OF HUMAN FACE
Kateryna Merkulova and Bohdan Pavliukh
Taras Shevchenko National University of Kyiv
�
Introduction
Currently, recognition and identification technologies are very widely used in all areas. Technology designed to detect additional features on a person's face, such as the presence of glasses, a headgear, a medical mask and a beard, can be used to solve a number of important tasks.
The most obvious task that the researched technology can handle is the collection of statistical information (which may have commercial value). Such technology can be used in access control systems to improve security, for example by requiring users to remove a mask or glasses for identification, which can help prevent unauthorized access or bypassing.
The tool for recognizing additional features in a digital image of a human face can also be used for narrowly targeted purposes, for example, to track the observance of masking by visitors to a supermarket (or any other crowded place).
2
Dynamical System Modeling and Stability Investigation, DSMSI-2023
Task definition and solution methods
3
Dynamical System Modeling and Stability Investigation, DSMSI-2023
(1)
(2)
(3)
(4)
Experimental research
The purpose of this study is to determine the effectiveness of using convolutional neural networks for recognizing additional features in images of human faces obtained from streaming video.
It was decided to recognize 4 additional features of the face (presence of glasses, medical mask, headdress and beard) and additionally the gender of the person, accordingly, 5 convolutional neural networks were developed.
For some of the neural networks, datasets were found in public sources (for example, on Kaggle), for others datasets had to be generated independently (due to this, such datasets have fewer images).
4
Dynamical System Modeling and Stability Investigation, DSMSI-2023
Experimental research
Table 1. Accuracy and loss of results when training a CNN to detect the presence of a medical mask
5
Dynamical System Modeling and Stability Investigation, DSMSI-2023
Epoch | medical mask | |||
loss | accuracy | val_loss | val_acc | |
1 | 0.2038 | 0.9253 | 0.1975 | 0.9205 |
2 | 0.0806 | 0.9708 | 0.1073 | 0.9542 |
3 | 0.0509 | 0.9825 | 0.1353 | 0.9470 |
4 | 0.0438 | 0.9852 | 0.1949 | 0.9337 |
5 | 0.0267 | 0.9892 | 0.1483 | 0.9482 |
6 | 0.0268 | 0.9919 | 0.1698 | 0.9386 |
7 | 0.0170 | 0.9955 | 0.1644 | 0.9446 |
8 | 0.0210 | 0.9925 | 0.0994 | 0.9590 |
9 | 0.0143 | 0.9958 | 0.1191 | 0.9590 |
10 | 0.0191 | 0.9931 | 0.1421 | 0.9542 |
11 | 0.0178 | 0.9940 | 0.1900 | 0.9398 |
12 | 0.0100 | 0.9964 | 0.1042 | 0.9651 |
13 | 0.0104 | 0.9976 | 0.0713 | 0.9759 |
14 | 0.0188 | 0.9925 | 0.1316 | 0.9590 |
15 | 0.0131 | 0.9955 | 0.1400 | 0.9554 |
16 | 0.0104 | 0.9967 | 0.1263 | 0.9614 |
17 | 0.0067 | 0.9982 | 0.1707 | 0.9494 |
18 | 0.0039 | 0.9988 | 0.2399 | 0.9373 |
19 | 0.0034 | 0.9997 | 0.1668 | 0.9554 |
20 | 0.0037 | 0.9985 | 0.1537 | 0.9614 |
Experimental research
Figure 1. Accuracy graphs of the developed CNNs for: a) gender identification; b) glasses recognition; c) beard recognition; d) headdress recognition; e) medical mask recognition
6
Dynamical System Modeling and Stability Investigation, DSMSI-2023
Experimental research
Figure 2. Determination of additional features on human faces with the help of developed CNNs: a) a man with a beard in a headdress and a woman in glasses; b) a man with an improperly worn medical mask and sunglasses and a man without additional features on his face; c) a man with a beard and a headdress and a woman with a headdress; d) a man in a medical mask and a man with a beard and glasses.
7
Dynamical System Modeling and Stability Investigation, DSMSI-2023
Experimental research
Table 2. Final results of accuracy and loss on the validation sets for the developed CNNs
8
Dynamical System Modeling and Stability Investigation, DSMSI-2023
CNN purpose | Accuracy, % | Loss, % |
Glasses recognition | 99.72 | 1.39 |
Beard recognition | 85 | 44.28 |
Headdress recognition | 100 | 0.05 |
Medical mask recognition | 96.14 | 15.37 |
Sex identification | 98.62 | 3.77 |
Conclusion
This work is devoted to the study of the effectiveness of using convolutional neural networks for the task of recognizing additional features on a digital image of a human face. For this purpose, CNNs was developed to recognize the following additional features: the presence of glasses, a beard, a headdress, a medical mask, and a person's gender.
All the developed models showed a good result both on training and validation data sets (which is indicated by the presented accuracy tables and graphs), and during their experimental application for streaming video. The final values of accuracy and loss on the validation data sets for developed CNNs are as follows: for the detection of glasses, the accuracy is 99.72%, the loss is 1.39%; for detecting a medical mask, the accuracy is 96.14%, the loss is 15.37%; for headdress detection, the accuracy is 100%, the loss is 0.05% (such a high result is probably related to the not very good content of the input data set); for detecting a beard, the accuracy is 85%, the loss is 44.28%; for gender determination, the accuracy is 98.62%, the loss is 3.77%. Among the shortcomings of the developed models, a decrease in the accuracy of work when the image was taken in poor lighting conditions (which was expected) can be highlighted.
9
Dynamical System Modeling and Stability Investigation, DSMSI-2023
Conclusion
One of the interesting results obtained is the fact that the fewest images in the training sets had a beard and a headdress, and, at the same time, their models gave the opposite result - based on the obtained accuracy results, the model for recognizing the beard has the largest loss, and the model for recognizing headdress has the smallest. This result is most likely related to the small number of images in the training sets.
For further research on this topic, models can be developed to identify other additional features, for example, a mustache, as well as such features of a person as age, race, emotions. It is also possible to increase the number of resulting classes for CNNs, for example, to define different types of headdress or to define more types of glasses.
To improve the quality of the obtained results, it is possible to expand data sets, as well as apply additional methods of image preprocessing, which would help solve some difficulties, for example, reduce the influence of the level of illumination on the result of the work of CNNs.
10
Dynamical System Modeling and Stability Investigation, DSMSI-2023
Thank you for your attention