Texts of different emotional classes and their topic modeling
Автор: Kolmogorova A., Sun Q.
Журнал: Вестник Волгоградского государственного университета. Серия 2: Языкознание @jvolsu-linguistics
Статья в выпуске: 5 т.23, 2024 года.
Бесплатный доступ
The article is devoted to studying verbalization specifics of various emotional states in the texts in the Russian language with the purpose to confirm or refute the hypothesis that texts of different emotional classes reflect the denotative situation not identically, which is reflected in thematic specifics and lexical content. The research material consisted of eight corpus texts in the Russian language, which were extracted from the public pages of the social network VKontakte. The texts were selected according to emotional hashtags that corresponded to eight basic emotions, according to H. Lцvheim’s model: anger, surprise, shame, enjoyment, disgust, distress, excitement, fear. The correspondence of emotion and hashtag was established in a preliminary psycholinguistic experiment. While analyzing the text collection, we used the method of computer thematic modeling to identify statistically non-random groups of words (topics). We applied the BERTopic neural network model to the collected data. As a result of the analysis, it was found that texts of 8 emotional classes contain an uneven number of topics, despite the fact that their number does not correlate directly with the amount of data: with a relatively small amount of data, there may be many topics, but in a voluminous corpus - few. The sets of words (tokens) that make up each non-random group (topic) differ in each subcorpora, reflecting the specifics of the denotative situation, which is formed under the influence of the emotional state of the speaker. The idea of diverse thematic “granularity” of texts of different emotional classes is theoretically justified.
Emotions, denotative situation, topic modeling, social network texts, russian language
Короткий адрес: https://sciup.org/149147498
IDR: 149147498 | DOI: 10.15688/jvolsu2.2024.5.5
Список литературы Texts of different emotional classes and their topic modeling
- Belyanin V.P., 2000. Osnovy psikholingvisticheskoy diagnostiki (Modeli mira v literature) [Foundations of Psycholinguistic Diagnostics (World Models in Literature)]. Moscow, Trivola Publ. 248 p.
- Blei D.M., Ng A.Y., Jordan M.I., 2003. Latent Dirichlet Allocation. The Journal of Machine Learning Research, no. 3, pр. 993-1022.
- Grootendorst M., 2022. BERTopic: Neural Topic Modeling with a Class-Based TF-IDF Procedure. DOI: 10.48850/arXiv.2203.05794
- Hakak N., Mohd M., Kirmani M., Mohd M., 2017. Emotion Analysis: A Survey. Proceedings of the International Conference on Computer. Communication and Electronics, 1–2 July, pp. 397-402. DOI: 10.1109/COMPTELIX.2017.8004002
- Kolmogorova A., Kalinin A., Malikova A., 2019. Tipologiya i kombinatorika verbalnykh markerov razlichnykh emotsionalnykh tonalnostey v internet-tekstakh na russkom yazyke [Types and Combinatorics of Verbal Markers of Different Emotional Tonalities in Russian-Language Internet Texts]. Vestnik Tomskogo gosudarstvennogo universiteta [Tomsk State University Journal], vol. 448, pp. 48-58. DOI: 10.17223/15617793/448/6
- Kositsina Yu.V., 2013. Statiko-dinamicheskaya model tematicheskoy organizatsii monologicheskogo dialektnogo teksta: avtoref. dis.... kand. filol. nauk [Statical and Dynamical Model of Topical Organization of Monological Text in Dialects. Cand. philol. sci. diss.]. Kemerovo. 213 p.
- Li H., Pang N., Guo S., Wang H., 2007. Research on Textual Emotion Recognition Incorporating Personality Factor. ROBIO 2007: IEEE International Conference on Robotics and Biomimetics, pp. 2222-2227.
- Lövheim H., 2012. A New Three-Dimensional Model for Emotions and Monoamine Neuro-Transmitters. Medical Hypotheses, vol. 78, рр. 341-348.
- Picard R., 1997. Affective Computing. Cambridge, The MIT Press. 306 p.
- Sia S., Dalmia A., Mielke S.J., 2020. Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics Too! DOI: 10.85550/arXiv.abs/2004.14914
- Shakhovskiy V.I., 2010. Emotsii: dolingvistika, lingvistika, lingvokulturologiya [Emotions: Protolinguistics, Linguistics, Lingvoculturology]. Moscow, Librokom Publ. 128 p.