Accelerated preprocessing in task of searching substrings in a string
Автор: Mazurenko A.V., Boldyrikhin N.V.
Журнал: Вестник Донского государственного технического университета @vestnik-donstu
Рубрика: Информатика, вычислительная техника и управление
Статья в выпуске: 3 т.19, 2019 года.
Бесплатный доступ
Introduction. A rapid development of the systems such as Yandex, Google, etc., has predetermined the relevance of the task of searching substrings in a string, and approaches to its solution are actively investigated. This task is used to create database management systems that support associative search. Besides, it is applicable in solving information security issues and creating antivirus programs. Algorithms of searching substring in a string are used in signature-based discovery tasks.Materials and Methods. The solution to the problem is based on the Aho-Corasick algorithm which is a typical technique of searching substrings in a string. At the same time, a new approach regarding preprocessing is employed.Research Results. The possibility of constructing the transition function and suffix references through suffix arrays and special mappings, is shown. The relationship between the prefix tree and suffix arrays was investigated, which provided the development of a fundamentally new method of constructing the transition and error functions...
String searching, aho-corasick algorithm, prefix tree, suffix array, information search, error function, transition function
Короткий адрес: https://sciup.org/142221962
IDR: 142221962 | DOI: 10.23947/1992-5980-2019-19-3-290-300