2 of 11

Introduction

Currently, recognition and identification technologies are very widely used in all areas. Technology designed to detect additional features on a person's face, such as the presence of glasses, a headgear, a medical mask and a beard, can be used to solve a number of important tasks.

The most obvious task that the researched technology can handle is the collection of statistical information (which may have commercial value). Such technology can be used in access control systems to improve security, for example by requiring users to remove a mask or glasses for identification, which can help prevent unauthorized access or bypassing.

The tool for recognizing additional features in a digital image of a human face can also be used for narrowly targeted purposes, for example, to track the observance of masking by visitors to a supermarket (or any other crowded place).

Dynamical System Modeling and Stability Investigation, DSMSI-2023

3 of 11

Task definition and solution methods

Dynamical System Modeling and Stability Investigation, DSMSI-2023

(1)

(2)

(3)

(4)

4 of 11

Experimental research

The purpose of this study is to determine the effectiveness of using convolutional neural networks for recognizing additional features in images of human faces obtained from streaming video.

It was decided to recognize 4 additional features of the face (presence of glasses, medical mask, headdress and beard) and additionally the gender of the person, accordingly, 5 convolutional neural networks were developed.

For some of the neural networks, datasets were found in public sources (for example, on Kaggle), for others datasets had to be generated independently (due to this, such datasets have fewer images).

Dynamical System Modeling and Stability Investigation, DSMSI-2023

5 of 11

Experimental research

Table 1. Accuracy and loss of results when training a CNN to detect the presence of a medical mask

Dynamical System Modeling and Stability Investigation, DSMSI-2023

Epoch	medical mask
Epoch	loss	accuracy	val_loss	val_acc
1	0.2038	0.9253	0.1975	0.9205
2	0.0806	0.9708	0.1073	0.9542
3	0.0509	0.9825	0.1353	0.9470
4	0.0438	0.9852	0.1949	0.9337
5	0.0267	0.9892	0.1483	0.9482
6	0.0268	0.9919	0.1698	0.9386
7	0.0170	0.9955	0.1644	0.9446
8	0.0210	0.9925	0.0994	0.9590
9	0.0143	0.9958	0.1191	0.9590
10	0.0191	0.9931	0.1421	0.9542
11	0.0178	0.9940	0.1900	0.9398
12	0.0100	0.9964	0.1042	0.9651
13	0.0104	0.9976	0.0713	0.9759
14	0.0188	0.9925	0.1316	0.9590
15	0.0131	0.9955	0.1400	0.9554
16	0.0104	0.9967	0.1263	0.9614
17	0.0067	0.9982	0.1707	0.9494
18	0.0039	0.9988	0.2399	0.9373
19	0.0034	0.9997	0.1668	0.9554
20	0.0037	0.9985	0.1537	0.9614

6 of 11

Experimental research

Figure 1. Accuracy graphs of the developed CNNs for: a) gender identification; b) glasses recognition; c) beard recognition; d) headdress recognition; e) medical mask recognition

Dynamical System Modeling and Stability Investigation, DSMSI-2023

7 of 11

Experimental research

Figure 2. Determination of additional features on human faces with the help of developed CNNs: a) a man with a beard in a headdress and a woman in glasses; b) a man with an improperly worn medical mask and sunglasses and a man without additional features on his face; c) a man with a beard and a headdress and a woman with a headdress; d) a man in a medical mask and a man with a beard and glasses.

Dynamical System Modeling and Stability Investigation, DSMSI-2023

8 of 11

Experimental research

Table 2. Final results of accuracy and loss on the validation sets for the developed CNNs

Dynamical System Modeling and Stability Investigation, DSMSI-2023

CNN purpose	Accuracy, %	Loss, %
Glasses recognition	99.72	1.39
Beard recognition	85	44.28
Headdress recognition	100	0.05
Medical mask recognition	96.14	15.37
Sex identification	98.62	3.77

9 of 11

Conclusion

This work is devoted to the study of the effectiveness of using convolutional neural networks for the task of recognizing additional features on a digital image of a human face. For this purpose, CNNs was developed to recognize the following additional features: the presence of glasses, a beard, a headdress, a medical mask, and a person's gender.

All the developed models showed a good result both on training and validation data sets (which is indicated by the presented accuracy tables and graphs), and during their experimental application for streaming video. The final values of accuracy and loss on the validation data sets for developed CNNs are as follows: for the detection of glasses, the accuracy is 99.72%, the loss is 1.39%; for detecting a medical mask, the accuracy is 96.14%, the loss is 15.37%; for headdress detection, the accuracy is 100%, the loss is 0.05% (such a high result is probably related to the not very good content of the input data set); for detecting a beard, the accuracy is 85%, the loss is 44.28%; for gender determination, the accuracy is 98.62%, the loss is 3.77%. Among the shortcomings of the developed models, a decrease in the accuracy of work when the image was taken in poor lighting conditions (which was expected) can be highlighted.

Dynamical System Modeling and Stability Investigation, DSMSI-2023

10 of 11

Conclusion

One of the interesting results obtained is the fact that the fewest images in the training sets had a beard and a headdress, and, at the same time, their models gave the opposite result - based on the obtained accuracy results, the model for recognizing the beard has the largest loss, and the model for recognizing headdress has the smallest. This result is most likely related to the small number of images in the training sets.

For further research on this topic, models can be developed to identify other additional features, for example, a mustache, as well as such features of a person as age, race, emotions. It is also possible to increase the number of resulting classes for CNNs, for example, to define different types of headdress or to define more types of glasses.

To improve the quality of the obtained results, it is possible to expand data sets, as well as apply additional methods of image preprocessing, which would help solve some difficulties, for example, reduce the influence of the level of illumination on the result of the work of CNNs.

Dynamical System Modeling and Stability Investigation, DSMSI-2023

1 of 11

2 of 11

3 of 11

4 of 11

5 of 11

6 of 11

7 of 11

8 of 11

9 of 11

10 of 11

11 of 11