The application of big data technologies for organization of extracting, flow and store data about non-resident companies

Автор: Papoyan Vladimir, Korenkov Vladimir, Kadochnikov Ivan

Журнал: Сетевое научное издание «Системный анализ в науке и образовании» @journal-sanse

Статья в выпуске: 3, 2019 года.

Бесплатный доступ

Banks need to establish if their clients are tax evasion companies or run real business. The development of national information resource for retrieval and analysis of information on non-resident companies is one of the key to solve the problem. The application of Big Data technologies is necessary for implementation of the efficient resource. Therefore, problems such as extracting, flow and stores data about companies from national registers are considered in the article. The application of technologies and approaches described in this article allow to achive stable performance and support facilitate of the system. The implementation is tested on two registers of companies The Insolvency Service and Companies House.

Еще

Big data, web scraping, data flow, apache kafka, apache nifi

Короткий адрес: https://sciup.org/14122703

IDR: 14122703

Статья научная