Clustering of media content from social networks using bigdata technology

Автор: Rytsarev Igor Andreevich, Kirsh Dmitriy Victorovich, Kupriyanov Alexandr Victorovich

Журнал: Компьютерная оптика @computer-optics

Рубрика: Численные методы и анализ данных

Статья в выпуске: 5 т.42, 2018 года.

Бесплатный доступ

The article deals with one of the key problems of the social network analysis - the problem of classifying accounts based on media content uploaded by users. The main difficulties are the content heterogeneity (both in format and subject) and the large volumes of data, which leads to excessive computational complexity of its processing and often to the complete inefficiency of traditional analysis methods. In the article, we discuss an approach to the clustering of media content from social networks based on textual annotation using BigData technology - a modern and efficient tool that allows to solve the problem of large data volume processing. To carry out computational experiments, a large sample of heterogeneous images (photographs, paintings, postcards, etc.) was collected from real Twitter accounts. The results confirmed the high quality of media content clustering, the average error was around 5 %.

Еще

Технология bigdata, алгоритм k-means, googlenet, cluster analysis, bigdata technology, text annotation, social networks, media content analysis, k-means clustering

Короткий адрес: https://sciup.org/140238453

IDR: 140238453   |   DOI: 10.18287/2412-6179-2018-42-5-921-927

Статья научная