1 of 12

Dynamical System Modeling and Stability Investigation�DSMSI-2023

Dedicated to the 77th anniversary of the outstanding Ukrainian scientist

professor Denys Khusainov

December 19-21, 2023, Kyiv, Ukraine

The system for recognizing useful information of the client's ID-card based on machine learning technologies

Oleksii Bychkov, Liudmyla Zubyk, Dmytro Gololobov, Yaroslav Isaienkov, Ganna Grynkevych, Anastasiia Ivanytska

2 of 12

The relevance of research

2022 IEEE Third International Conference on System Analysis & Intelligent Computing (SAIC)

2

Scientific task: to find the approach to recognizing text from the image of ID card, which will allow to develop the dictionary formation model for interaction with the server

The approaches to evaluating the effectiveness of information technology :

  1. Models based on Tesseract technology of recognition
  2. Models used Paddle Optical Character Recognition ion (OCR)

The actual problems:

  1. the existing approaches to text recognition often require image pre-processing models
  2. the quality of the input image, its brightness, position relative to the camera and size
  3. the complex output format after text recognition

Contemporary scientists:

  1. Q. Uddin, T. Bui, T. Nguyen, A. Jatowt, M. Coustaty, Ch. Padole, U. Sh. Verma, Pr. Gujral, Mr. Kumar, C. Madan Kumar, D. Arulanantham, Ivanytska A., Zubyk L, V. Martsenyuk, O. Bychkov, K. Merkulova, Y. Zhabska, Dakhno, N., Barabash, O., Shevchenko, H., Leshchenko, O., Dudnik A, M. Kozlenko, V. Sendetskyi, [13] E. Zhang, V. A. Putra, K. Leung
  2. M. R. M. Ribeiro, D. Julio, Phan Van Hoai, H.-Th. Duong, N. S. Yusman, Kr. Olejniczak, M. Sulc, D. Yan, S. Shen, D. Wang, F. Borisyuk, A. Gordo, D. Yu, X. Li, X. Zu, Ch. Zhang

3 of 12

2022 IEEE Third International Conference on System Analysis & Intelligent Computing (SAIC)

3

The aim of this study

develop the model for forming the dictionary based on text recognition technologies from document images and increase the accuracy of information transfer in the client-service system

The following tasks:

  1. analysis of text recognition technologies on documents;
  2. select the optimal module from two approaches for recognizing text images
  3. determine the stages of developing the model of the dictionary formation which based of text recognition technologies, which was selected;
  4. evaluate the adequacy of the proposed model on test data.

4 of 12

Text recognition tools based on machine learning technologies

Figure 1: Sequence of data processing recognition system

Dynamical System Modeling and Stability Investigation, DSMSI-2023

5 of 12

Text recognition tools based on machine learning technologies

5.1

where S – number of substitutions;

D– number of deletions;

I – number of insertions;

N – number of characters in reference text.

N=S+D+C (where C- number of correct characters).

5.2

Dynamical System Modeling and Stability Investigation, DSMSI-2023

6 of 12

Table 1�The quality of recognition according to the CER and WER parameters with address date of Chinese user’s information

Dynamical System Modeling and Stability Investigation, DSMSI-2023

File name

OCR address

Real address

CER, %

WER, %

8-1.jpg

'广州市南沙区横沥镇大安 北街77号

'广州市南沙区横沥镇大安 北街77号

98.0

 

2.0

6-1.jpg

广州市番禺区洛浦街东乡 四街四巷七横巷2号

广州市番禺区洛浦街东乡 四街四巷七横巷2号

98.7

1.8

7-1.jpg

广州市番禺区洛溪新吉 祥道一幢之二601房

广州市番禺区洛溪新城吉 祥道一幢之二601房

96.7

3.3

1.jpeg

安徽省涡阳县西阳镇庙 行政村王大庄自然村049 号

安徽省涡阳县西阳镇文庙 行政村王大庄自然村049 号

95.8

4.3

4.jpg

河北省市郭村乡北 平庄村049号

河北省河间市郭村乡北太 平庄村049号

89.5

11.3

7 of 12

Text recognition tools based on machine learning technologies

Figure 2: The result of text recognition by the Paddle OCR system

Dynamical System Modeling and Stability Investigation, DSMSI-2023

8 of 12

Development of the text processing model for dictionary formation

The formation of the dictionary of useful information, which looks like this:

"name":"张*舒",

"sex":"女",

"nation":"汉",

"birth":"197***512",

"address":"广州市番**洛溪新城吉祥道十幢之二601房",

"idcard":"441*******0427

Dynamical System Modeling and Stability Investigation, DSMSI-2023

9 of 12

Development of the text processing model for dictionary formation

The next stages of forming the dictionary of useful information:

Dynamical System Modeling and Stability Investigation, DSMSI-2023

Combining the recognition result into one string

Function to find part of text

Find id_number after the characters '公民身份号码' in the client's passport

Using of regular expression for search id_number

Search data of birthday after symbols '出生','住址'

Find address after characters '住址', '公民身份号码'

Find id_number after '住址'

Formation of the result table styling

10 of 12

Steps for forming system responses when working with the server:

Dynamical System Modeling and Stability Investigation, DSMSI-2023

Read the input image

Run the text recognition algorithm

Use text processing model for dictionary formation

Send valuable information to the server

If the result of forming the dictionary is error, then send the result – error to the document

11 of 12

Conclusions

As the result of the research, the model of forming dictionary basing on text recognition technologies, that uses machine learning technologies, was developed and implemented. The following stages of research were performed.

Firstly, existing approaches to recognition of text information from user documents in client-server systems were analyzed in order to solve the problem of user identification. Two approaches were selected for text recognition algorithm.

Secondly, the comparative analysis of the Optical Character Recognition library using Tesseract Engine and Paddle Optical Character Recognition was held. The feasibility and effectiveness of using Paddle OCR for analyzing documents with Chinese characters was substantiated.

Thirdly, the model of the dictionary formation which based of text recognition technologies was developed and tested.

Finally, the adequacy of the model of the dictionary formation was evaluated. Its effectiveness was verified on test data and the adequacy of the model was assessed based CER and WER. In contrast to existing approaches, the model showed the increase in text recognition accuracy by 1.8-11.3%, depending on the quality of the source image. The result is implemented in client-server applications for product solutions of Zhejiang Jimi IoT Technology Co., Ltd.

Dynamical System Modeling and Stability Investigation, DSMSI-2023

12 of 12

CONTACT USfor collaborative machine learning and data mining analysis��a.titova.wk@gmail.com�l.v.zubyk19@gmail.com �gololobov.dma@meta.ua �yisaienkov@gmail.com�Ggrynkevych@ukr.net