Application of deep learning methods in the problems of text image segmentation
Автор: Burikova A.G., Ershov N.M.
Журнал: Сетевое научное издание «Системный анализ в науке и образовании» @journal-sanse
Статья в выпуске: 2, 2024 года.
Бесплатный доступ
The paper is devoted to solving the problem of text image segmentation, the purpose of which is to select text blocks in the document image that correspond to columns, headers, footers etc. A review of existing image segmentation methods is carried out, including those intended for searching and selecting text blocks in images. Both classical methods and methods based on the use of artificial neural networks are analyzed. To solve given problem, an approach based on convolutional neural networks and the U-Net model is proposed. A method for automatically generating training examples for training a neural network is described. The processes of setting up a model, training and testing it are considered. The results of a numerical study of trained models on real data are presented.
Image segmentation, pattern recognition, deep learning, convolutional neural networks, UNet model
Короткий адрес: https://sciup.org/14131164
IDR: 14131164