Applying machine learning for segmentation of skeletal muscle ultrasound images
Автор: Mischenko E.A., Demin I.Yu.
Журнал: Проблемы информатики @problem-info
Рубрика: Прикладные информационные технологии
Статья в выпуске: 3 (64), 2024 года.
Бесплатный доступ
This paper discusses the use of machine learning techniques for segmenting ultrasound images of human skeletal muscles. Ultrasound is a non-invasive diagnostic tool that provides ample information about the structure and condition of organs and tissues. It is widely used in diagnostics, and modern ultrasound machines can produce high-resolution scans. The study’s significance lies in the difficulty and time required to interpret medical materials, as well as the potential for subjectivity and errors. One promising approach to assist clinicians is the use of machine learning models and methods. These techniques are commonly employed for tasks such as data analysis, diagnosis and prognosis, and medical material classification and segmentation. Machine learning models can aid inexperienced practitioners and expedite the provision of high-quality medical care. The aim of the paper is to create an accurate and efficient healthy skeletal muscle segmentation model using machine learning for medical imaging ultrasound method. Convolutional neural networks were used to build a model for segmentation of ultrasound images of skeletal muscles. U-net network architecture with different number of convolutional layers was used, as well as U-net++ network, which is a modification of the classical U-net. The U-Net architecture is one of the classical CNNs for image segmentation tasks, and it has been actively applied to biomedical images. Among the main advantages of the network is its ability to train well on a small amount of data. The U-Net++ network was proposed as a modification of the U-Net network architecture, which was designed to improve the network performance on medical image segmentation tasks. U-Net++ is based on the idea that the model will be more efficient and faster trained if the feature maps of the encoder and decoder are semantically similar. To train the neural network, a dataset of ultrasound images of healthy skeletal muscles from open sources was used, because to obtain a high-quality model it is not only necessary to tune the neural network competently, but also to train it on a sufficient, high quality and diverse dataset. The images were annotated manually using binary masks to indicate muscle tissue boundaries. A binary mask was created for each image, replacing the muscle layer boundaries with white pixels and the rest with black pixels. These masks serve as truth labels when training neural networks. The dataset was divided into training and validation samples, and the training data was augmented. To prevent data leakage, augmentation is performed after dividing the data into samples. Increasing the validation sample is not recommended as it will not improve the training accuracy. The final model was tested using a dataset of ultrasound images of skeletal muscles of volunteers obtained using a Verasonics acoustic system. The results of segmentation using the U-Net network, U-Net++ network, and their variants with increased number of convolution layers were compared. All training processes used the same dataset of skeletal muscle ultrasound images, which was divided into training and validation in the ratio of 8/2. Accuracy and loU were used as metrics for evaluation. Callbacks such as Early Stopping, which stops training when the validation error is unchanged and ReduceLROnPlateau designed to adaptively adjust the learning rate when there is no improvement on the monitored metric were used, both calls track the required parameters for a certain number of epochs. These callbacks were used to prevent overlearning. In the context of this work, where the task was to classify a small number of classes and small size images were used, the three-convolution network did not perform better than the two- convolution network. This is because the more complex features that a three-convolution network can learn were not necessary for the task at hand. Under such conditions, the simpler two convolution network can learn more efficiently because it has fewer parameters and requires less computational resources. In addition, the simpler network is less prone to overlearning, which is an important factor in tasks with small amounts of data. A neural network model was trained, analyzed and tested, and a model that can segment ultrasound images of skeletal muscles was developed. In this study, it was shown that neural networks of U-Net and U-Net++ architectures can be effectively used for segmentation of ultrasound images of skeletal muscles. The main result of the study is to obtain an effective neural network model and to confirm the potential of its application for segmenting ultrasound images of skeletal muscles. The application of neural networks in medicine has a number of advantages, including improved diagnostic accuracy, reduced labour costs for physicians and increased speed of data processing.
Machine learning, skeletal muscle, ultrasound images segmentation, convolutional neural network
Короткий адрес: https://sciup.org/143183465
IDR: 143183465 | DOI: 10.24412/2073-0667-2024-3-47-57