Development of software for the segmentation of text areas in real-scene images
Автор: Lobanova Viktoriya Aleksandrovna, Ivanova Yuliya Aleksandrovna
Журнал: Компьютерная оптика @computer-optics
Рубрика: Обработка изображений, распознавание образов
Статья в выпуске: 5 т.46, 2022 года.
Бесплатный доступ
This article discusses the design and development of a neural network algorithm for the segmentation of text areas in real-scene images. After reviewing the available neural network models, the U-net model was chosen as a basis. Then an algorithm for detecting text areas in real-scene images was proposed and implemented. The experimental training of the network allows one to define the neural network parameters such as the size of input images and the number and types of the network layers. Bilateral and low-pass filters were considered as a preprocessing stage. The number of images in the KAIST Scene Text Database was increased by applying rotations, compression, and splitting of the images. The results obtained were found to surpass competing methods in terms of the F-measure value.
Deep learning, u-net architecture, image processing, image segmentation, text areas, real scenes images
Короткий адрес: https://sciup.org/140296225
IDR: 140296225 | DOI: 10.18287/2412-6179-CO-1047