Optimization strategies and evaluation methods for fine-tuning large language models

Основное

Автор: Denisenko V.V., Chesnikov L.S.

Журнал: Международный журнал гуманитарных и естественных наук @intjournal

Статья в выпуске: 4-1 (91), 2024 года.

Бесплатный доступ

This paper provides a comprehensive review of optimization strategies and evaluation methods for fine-tuning large language models. Key aspects such as the selection of loss function, optimizer, and learning rate tuning are discussed, along with evaluation methods including comparative experiments and specialized analysis techniques. Conclusions are drawn regarding the significance of fine-tuning in the development of NLP and its future directions.

Fine-tuning, language models, optimization, evaluation, loss function, optimizer, learning rate, overfitting, sensitivity analysis, cross-validation

Короткий адрес: https://sciup.org/170204831

IDR: 170204831 | DOI: 10.24412/2500-1000-2024-4-1-180-184

Статья научная