DEVELOPMENT OF A SYSTEM FOR AUTOMATIC DETECTION AND RECOGNITION OF TEXTUAL SYMBOLS USING NEURAL NETWORKS

Authors

Keywords:

Symbol recognition, neural networks, CRAFT, machine learning, classification

Abstract

The aim of the study is to develop software for the automatic detection and recognition of symbols in images using neural networks. The system employs two neural networks: the first analyzes the full-size image, identifies symbols, and determines their coordinates in the QUAD format. The second network processes the extracted image fragments and classifies the symbols based on the training dataset. Text recognition is performed character by character, making the CRAFT model an optimal choice for symbol detection. The symbol recognition model is built using a classical classifier architecture that includes two convolutional layers with pooling and four fully connected layers. The results demonstrate that the combination of the two models ensures a recognition accuracy of 83.87% on the test dataset. Testing confirmed that the main factors influencing accuracy are the quality of the training dataset, learning parameters, and the method of symbol localization. The practical significance of the results lies in the potential application of the developed system for automating text data processing in various fields, such as document management, computer vision, and sign recognition.

References

PyTorch documentation – PyTorch 2.5 documentation. PyTorch. URL: https://pytorch.org/docs/stable/index.html

OpenCV: OpenCV modules. OpenCV documentation index. URL: https://docs.opencv.org/4.x/index.html

Character Region Awareness for Text Detection. arXiv.org. URL: https://doi.org/10.48550/arXiv.1904.01941

K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.

O. Ronneberger, P. Fischer, and T. Brox. U-net: Convolutional networks for biomedical image segmentation. In MICCAI, pages 234–241. Springer, 2015.

Papers with Code - ICDAR 2015 Dataset. The latest in Machine Learning | Papers With Code. URL: https://paperswithcode.com/dataset/icdar-2015 (date of access: 21.01.2025).

Papers with Code - ICDAR 2017 Dataset. The latest in Machine Learning | Papers With Code. URL: https://paperswithcode.com/dataset/icdar-2017 (date of access: 21.01.2025).

GitHub - ankush-me/SynthText: Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016. GitHub. URL: https://github.com/ankush-me/SynthText (date of access: 21.01.2025).

GitHub - Belval/TextRecognitionDataGenerator: A synthetic data generator for text recognition. GitHub. URL: https://github.com/Belval/TextRecognitionDataGenerator (дата звернення: 08.12.2024).

Home | AimStack. AimStack. URL: https://aimstack.io/ (date of access: 21.01.2025).

Published

2025-04-16

How to Cite

[1]
Pelekh, H. and Bulatetskyi , V. 2025. DEVELOPMENT OF A SYSTEM FOR AUTOMATIC DETECTION AND RECOGNITION OF TEXTUAL SYMBOLS USING NEURAL NETWORKS. Applied Problems of Computer Science, Security and Mathematics. 4 (Apr. 2025), 33–39.