Automatic Speech Recognition coupled LLM
Автор: Yurij G. Sidorov, Vladimir L. Malykh, Aleksey N. Kalinin, Olga S. Yelistratova
Журнал: Программные системы: теория и приложения @programmnye-sistemy
Рубрика: Медицинская информатика
Статья в выпуске: 6 (69) т.16, 2025 года.
Бесплатный доступ
One of the barriers preventing the widespread use of speech medical data entry in HIS is the insufficient consumer quality of texts obtained after transcription. Not all medical terms and words of the general lexicon are recognized correctly, the coordination of words by gender, number and case is disrupted, the text is not well formatted from the point of view of grammar. All this requires further revision of the text. Another difficult problem is the need to bring the text to the structure of the medical document in HIS. The document structure can be quite complex, contain many elements, and have requirements for the type and format of the data in the structure elements. Speech input can only be partially used to generate a document, and the missing data can be taken from a custom template. To solve these problems, we propose to use LLM as a corrector of speech transcription results, an integrator of text data and text data from a template, and a structurer of the resulting data. The paper proposes a solution architecture for the input of speech medical data based on the composition of the transcription system and LLM. A methodology for conducting solution tests is proposed, including the preparation of a dataset and a metric for calculating the quality of the solution. The implementation of the solution based on a free and proprietary component is described. The results can be used in the development and evaluation of AI systems used for speech data input, and not only in medicine.
Medical information systems, MIS, artificial intelligence, AI, speech input, transcription system, large language models, LLM
Короткий адрес: https://sciup.org/143185295
IDR: 143185295 | УДК: 61:007.51:316 | DOI: 10.25209/2079-3316-2025-16-6-197-219