Neural network classification of videos based on a small number of frames
Автор: Smirnov A.V., Parfenov D.D., Tishchenko I.P., Kurshev E.P., Znamenskij S.V.
Журнал: Программные системы: теория и приложения @programmnye-sistemy
Рубрика: Математическое моделирование
Статья в выпуске: 4 (63) т.15, 2024 года.
Бесплатный доступ
The article proposes a method for neural network classification of short videos. The classification problem is considered from the point of view of reducing the number of operations required to categorize videos. The proposed solution consists of using a small number of frames (no more than 10) to perform classification using the lightest neural network architecture of the ResNet family of models. As part of the work, a proprietary training dataset was created, consisting of three classes: “animals”, “cars” and “people”. As a result, a classification accuracy of 79% was obtained, a database of classified videos was formed, and an application with GUI elements was developed for interacting with the classifier and viewing the results.
Video classification, dataset, neural networks, graphical user interface
Короткий адрес: https://sciup.org/143183791
IDR: 143183791 | DOI: 10.25209/2079-3316-2024-15-4-79-96