Russian text author's gender identification in forensic examination: probability-and-statistics method
Автор: Radbil Timur B., Markina Marina V.
Журнал: Вестник Волгоградского государственного университета. Серия 2: Языкознание @jvolsu-linguistics
Рубрика: Развитие и функционирование русского языка
Статья в выпуске: 5 т.20, 2021 года.
Бесплатный доступ
The article discusses intermediate research results in the development and improvement of a computerized model of Russian texts authorization, which is based on complex application of probabilistic-andstatistical methods. The study aims to describe the new capabilities of the created system in the aspect of its application to diagnostic examinations in text authorization for detection of the gender of the alleged author of the text. The work presents the next stage of fine-tuning and testing of the improved version of the computer program “CTA” (computerized text authorization), which at this stage was adapted for the task of determining and comparing stable relative frequencies of correlation coefficients (the ratio of specified linguistic phenomena of different levels of the language system) in the texts, the authors of which are men and women. The research material is the continuously updated primary bases of literary texts of the 19th and 21st centuries (4 bases, respectively). The work shows that for the texts written by men and women, significant differences can be noted in such correlation coefficients as average word length, average sentence length, objectivity coefficient, quality coefficient, activity coefficient, dynamism coefficient, connectivity coefficient, etc. Verification of the results obtained experimentally has demonstrated that the accuracy of gender determining at this stage of the study is approximately 65%. This indicator can be significantly exceeded with an increase in the volume and quality specification of databases and/or when using new models for calculating the correlation coefficients (Spearman’s model, etc.).
Text authorization, computer text authorization, gender, forensic studies in text authorization, automatic text processing, probability-and-statistics method, applied linguistics
Короткий адрес: https://sciup.org/149139434
IDR: 149139434 | DOI: 10.15688/jvolsu2.2021.5.4