Accelerated preprocessing in task of searching substrings in a string

Бесплатный доступ

Introduction. A rapid development of the systems such as Yandex, Google, etc., has predetermined the relevance of the task of searching substrings in a string, and approaches to its solution are actively investigated. This task is used to create database management systems that support associative search. Besides, it is applicable in solving information security issues and creating antivirus programs. Algorithms of searching substring in a string are used in signature-based discovery tasks.Materials and Methods. The solution to the problem is based on the Aho-Corasick algorithm which is a typical technique of searching substrings in a string. At the same time, a new approach regarding preprocessing is employed.Research Results. The possibility of constructing the transition function and suffix references through suffix arrays and special mappings, is shown. The relationship between the prefix tree and suffix arrays was investigated, which provided the development of a fundamentally new method of constructing the transition and error functions...

Еще

String searching, aho-corasick algorithm, prefix tree, suffix array, information search, error function, transition function

Короткий адрес: https://sciup.org/142221962

IDR: 142221962   |   DOI: 10.23947/1992-5980-2019-19-3-290-300

Статья научная