Method for analyzing the structure of noisy images of administrative documents
Бесплатный доступ
The problem of extracting content elements (fields) from the images of administrative documents via descriptions of anchoring elements is considered. Administrative documents contain static elements and content elements (filled information). The static objects of the document model are the lines of the document structure and the words. Sets of objects united by properties and relationships are described. The text descriptor can contain attributes that distinguish it from similar descriptors. We suggest using combined descriptors of line segments and words. We showed experimentally that the extraction of object sets improves the recognition accuracy of the document fields by 17% and the accuracy of information extraction by 16%. For optical character recognition, we employed SDK Smart Document Engine in the experiment.
Noisy image, document recognition, special text point, descriptor
Короткий адрес: https://sciup.org/147240332
IDR: 147240332 | DOI: 10.14529/mmp220407
Список литературы Method for analyzing the structure of noisy images of administrative documents
- Rusinol M., Frinken V., Karatzas D., Bagdanov A.D., Llados J. Multimodal Page Classification Inadministrative Document Image Streams. International Journal on DocumentAnalysis and Recognition, 2014, vol. 17, no. 4, pp. 331–341. DOI: 10.1007/s10032-014-0225-8
- Jain R., Wigington C. Multimodal Document Image Classification. Document Analysis and Recognition, 2019, vol. 2019, pp.71–77. DOI: 10.1109/ICDAR.2019.00021
- Qasim S.R., Mahmood H., Shafait F. Rethinking Table Recognition Using Graph Neural Networks. Computer Vision and Pattern Recognition, 2019, vol. 1, pp. 142–147. DOI: 10.1109/ICDAR.2019.00031
- Bellavia F. SIFT Matching by Context Exposed. Transactions on Pattern Analysis and Machine Intelligence, 2022, vol. 2022, pp. 1–17. DOI: 10.1109/TPAMI.2022.3161853
- Bay H., Tuytelaars T., Luc Van Goolab. Speeded-Up Robust Features (SURF). Computer Vision and Image Understanding, 2006, vol. 110, no. 3, pp. 404–417. DOI: 10.1016/j.cviu.2007.09.014
- Slavin O., Andreeva E., Paramonov N. Matching Digital Copies of Documents Based on OCR. Control and Modeling Problems, 2019, vol. 2019, pp. 177–181. DOI: 10.1109/CSCMP45713.2019.8976570
- Slavin O., Arlazarov V., Tarkhanov I. Models and Methods Flexible Documents Matching Based on the Recognized Words. Cyber-Physical Systems: Advances in Design and Modelling, 2021, vol. 350, pp. 173–184. DOI: 10.1007/978-3-030-67892-0_15
- Deza M.M., Deza E. Encyclopedia of Distances. Berlin, Springer-Verlag, 2009.
- Matas J., Galambos C., Kittler J. Robust Detection of Lines Using the Progressive Probabilistic Hough Transform. Computer Vision and Image Understanding, 2000, vol. 78, issue 1, pp. 119–137. DOI: 10.1006/cviu.1999.0831
- Grompone von Gioi R., Jakubowicz J., Morel J.M. On Straight Line Segment Detection. Journal of Mathematical Imaging and Vision, 2008, vol. 32, pp. 313–347. DOI: 10.1007/s10851-008-0102-5
- Grompone von Gioi R., Jakubowicz J., Morel J.M., Randall G. LSD: A Fast Line Segment Detector with a False Detection Control. Transactions on Pattern Analysis and Machine Intelligence, 2010, vol. 32, issue 4, pp. 722–732. DOI: 10.1109/TPAMI.2008.300
- Emaletdinova L., Nazarov M. Construction of a Fuzzy Model for Contour Selection. Studies in Systems, Decision and Control, 2022, vol. 417, pp. 243–246. DOI: 10.1007/978-3-030-95116-0_20
- Zlobin P., Chernyshova Y., Sheshkus A., Arlazarov V.V. Character Sequence Prediction Method for Training Data Creation in the Task of Text Recognition. Machine Vision, 2021, vol. 2021, article ID: 120840, 10 p. DOI: 10.1117/12.2623773
- Matalov D., Usilin S., Arlazarov V.V. About Viola–Jones Image Classifier Structure in the Problem of Stamp Detection in Document Images. Machine Vision, 2021, vol. 2021, article ID: 11605, 16 p. DOI: 10.1117/12.2586842
- Arlazarov V., Voysyat Ju.S., Matalov D., Nikolaev D., Usilin S.A. Evolution of the Viola- Jones Object Detection Method: A Survey. Bulletin of the South Ural State University. Mathematical Modelling, Programming and Computer Software, 2021, vol. 14, no. 4, pp. 5–23. DOI: 10.14529/mmp210401
- Roy P.P., Pal U., Llados J. Seal Detection and Recognition: An Approach for Document Indexing. Document Analysis and Recognition, 2015, vol. 2015, article ID: 367879, 15 p. DOI: 10.1109/ICDAR.2009.128
- Katsuhiko U. Extraction of Signature ad Seal Imprint from Bankchecks by Using Color Information. Document Analysis and Recognition, 1995, vol. 1995, pp. 665–668. DOI: 10.1109/ICDAR.1995.601983
- Matalov D., Usilin S., Arlazarov V.V. Modification of the Viola-Jones Approach for the Detection of the Government Seal Stamp of the Russian Federation. Machine Vision, 2019, vol. 2019, article ID: 10411, 11 p. DOI: 10.1117/12.2522793
- Marchenko A.E., Ershov E.I., Gladilin S.A. The System for Parsing a Document Specified by Attributes of Structural Elements and the Rrelations between Structural Elements. Trudy ISA RAN, 2017, vol. 67, no. 4, pp. 87–97. (in Russian)