The application of big data technologies for organization of extracting, flow and store data about non-resident companies
Автор: Papoyan Vladimir, Korenkov Vladimir, Kadochnikov Ivan
Журнал: Сетевое научное издание «Системный анализ в науке и образовании» @journal-sanse
Статья в выпуске: 3, 2019 года.
Бесплатный доступ
Banks need to establish if their clients are tax evasion companies or run real business. The development of national information resource for retrieval and analysis of information on non-resident companies is one of the key to solve the problem. The application of Big Data technologies is necessary for implementation of the efficient resource. Therefore, problems such as extracting, flow and stores data about companies from national registers are considered in the article. The application of technologies and approaches described in this article allow to achive stable performance and support facilitate of the system. The implementation is tested on two registers of companies The Insolvency Service and Companies House.
Big data, web scraping, data flow, apache kafka, apache nifi
Короткий адрес: https://sciup.org/14122703
IDR: 14122703