Статьи журнала - Компьютерная оптика

Все статьи: 2553

Automatic target recognition algorithm for low-count terahertz images

Automatic target recognition algorithm for low-count terahertz images

Antsiperov Viacheslav Evgenievich

Статья научная

The paper presents the results of developing an algorithm for automatic target recognition in broadband (0.1-10) terahertz images. Due to the physical properties of terahertz radiation and associated hardware, such images have low contrast, low signal-to-noise ratio and low resolution - i.e. all the characteristics of a low-count images. Therefore, standard recognition algorithms designed for conventional images work poorly or are not suitable at all for the problem considered. We have developed a fundamentally different approach based on clustering 2D point clouds in accordance with a set of predefined patterns. As a result, we reduce the problem of target recognition to the problem of maximizing the image data likelihood with respect to the classes of model objects up to the size and position. The resulting recognition algorithm has a structure close to that of the well-known EM algorithm; its formal scheme is at the end of the paper.

Бесплатно

Benign and malignant breast tumors classification based on texture analysis and backpropagation neural network

Benign and malignant breast tumors classification based on texture analysis and backpropagation neural network

Wisudawati Lulu Mawaddah, Madenda Sarifuddin, Wibowo Eri Prasetyo, Abdullah Arman Adel

Статья научная

Breast cancer is a leading cause of death in women due to cancer. According to WHO, in 2018, it is estimated that 627.000 women died from breast cancer, that is approximately 15 % of all cancer deaths among women [3]. Early detection is a very important factor to reduce mortality by 25 - 30 %. Mammography is the most commonly used technique in detecting breast cancer using a low-dose X-ray system in the examination of breast tissue that can reduce false positives. A Computer-Aided Detection (CAD) system has been developed to effectively assist radiologists in detecting masses on mammograms that indicate the presence of breast tumors. The type of abnormality in mammogram images can be seen from the presence of microcalcifications and the presence of mass lesions. In this research, a new approach was developed to improve the performance of CAD System for classifying benign and malignant tumors. Areas suspected of being masses (RoI) in mammogram images were detected using an adaptive thresholding method and mathematical morphological operations. Wavelet decomposition is performed on the Region of Interest (RoI) and the feature extraction process is performed using a GLCM method with 4 statistical features, namely, contrast, correlation, entropy, and homogeneity. Classification of benign and malignant tumors using the MIAS database provided an accuracy of 95.83 % with a sensitivity of 95.23 % and a specificity of 96.49 %. A comparison with other methods illustrates that the proposed method provides better performance.

Бесплатно

Bidirectional Encoder representation from Image Transformers for recognizing sunflower diseases from photographs

Bidirectional Encoder representation from Image Transformers for recognizing sunflower diseases from photographs

Baboshina V.A., Lyakhov P.A., Lyakhova U.A., Pismennyy V.A.

Статья научная

This paper proposes a modern system for recognizing sunflower diseases based on Bidirectional Encoder representation from Image Transformers (BEIT). The proposed system is capable of recognizing various sunflower diseases with high accuracy. The presented research results demonstrate the advantages of the proposed system compared to known methods and contemporary neural networks. The proposed visual diagnostic system for sunflower diseases achieved 99.57 % accuracy on the sunflower disease dataset, which is higher than that of known methods. The approach described in the work can serve as an auxiliary tool for farmers, assisting them in promptly identifying diseases and pests and taking timely measures to treat plants. This, in turn, helps in preserving and enhancing the yield. This work can have a significant impact on the development of agriculture and the fight against the global food shortage problem.

Бесплатно

Bragg-fresnel optics and supermirrors

Bragg-fresnel optics and supermirrors

Erko A., Vidal B.

Статья научная

The main principles and some applications of Bragg-Fresnel multilayer optics and X-ray supermirrors are described. An elliptical Bragg-Fresnel multilayer lens (BFML), designed and fabricated in the IMT RAS has been used for 2-dimensional focusing of the white X-ray synchrotron beam. For the beam energy of about 12 KeV the spot size checked with the knife edge method was about 1 mm. Applications of BFML and supermirrors in x-ray imaging are discussed.

Бесплатно

Brain tumor classification using deep convolutional neural networks

Brain tumor classification using deep convolutional neural networks

Nurtay M., Kissina M., Tau A., Akhmetov A., Alina G., Mutovina N.

Статья научная

This study presents a comparative analysis of various convolutional neural network (CNN) models for brain tumor detection on MRI medical images. The primary aim was to assess the effectiveness of different CNN architectures in accurately identifying brain tumors. Multiple models were trained, including a custom-designed CNN with its specific layer architecture, and models based on Transfer Learning utilizing pre-trained neural networks: ResNet-50, VGG-16, and Xception. Performance evaluation of each model in terms of accuracy metrics such as precision, recall, F1-score, and confusion matrix on a test dataset was carried out. The dataset used in this study was obtained from the openly accessible Kaggle competition "Brain Tumor Detection from MRI." This dataset consisted of four classes: glioma, meningioma, no tumor (healthy), and pituitary, ensuring a balanced representation. Testing four models revealed that the custom CNN architecture, utilizing separable convolutions and batch normalization, achieved an average ROC AUC score of 0.99, outperforming the other models. Moreover, this model demonstrated an accuracy of 0.94, indicating its robust performance in brain tumor classification on MRI images.

Бесплатно

Brain tumor segmentation by deep learning transfer methods using MRI images

Brain tumor segmentation by deep learning transfer methods using MRI images

Shchetinin E.Yu.

Статья научная

Brain tumor segmentation is one of the most challenging tasks of medical image analysis. The diagnosis of patients with gliomas is based on the analysis of magnetic resonance images and manual segmentation of tumor boundaries. However, due to its time-consuming nature, there is a need for a fast and reliable automatic segmentation algorithm. In recent years, deep learning methods applied to brain tumor segmentation have shown promising results. In this paper, a deep neural network model based on U-Net neural network architecture is proposed for brain glioma segmentation. It is proposed to use deep convolutional neural network models pre-trained on the ImageNet dataset as U-Net encoders. Among such models, VGG16, VGG19, Mobilenetv2, Inception, Efficientnetb7, InceptionResnetV2, DenseNet201, DenseNet121 were used. The computational experimental analysis performed in the paper on a set of MRI brain images showed that the best encoder model for the above deep models was the DenseNet121 model with the following values of segmentation metrics Mean IoU of 91.34 %, Mean Dice of 94.26 %, Accuracy of 94.22 %. The paper also comparatively analyses the results of the proposed segmentation method with several works of other authors. The comparative analysis of the segmentation results of the studied MRI images showed that the DenseNet121 model either surpassed or was comparable to the models proposed in the refereed papers in terms of segmentation accuracy metrics.

Бесплатно

Building detection by local region features in SAR images

Building detection by local region features in SAR images

Ye Shi Ping, Chen Chao Xiang, Nedzved Alexander, Jiang Jun

Статья научная

The buildings are very complex for detection on SAR images, where the basic features of those are shadows. There are many different representations for SAR shadow. As result it is no possible to use convolutional neural network for building detection directly. In this article we give property analysis of SAR shadows of different type buildings. After that, each region (ROI) prepared for training of building detection is corrected with its own SAR shadow properties. Reconstructions of ROI will be put in a modified YOLO network for building detection with better quality result.

Бесплатно

Bulk cargo volume measurement for moving dump trucks with a single-layer LiDAR and a camera

Bulk cargo volume measurement for moving dump trucks with a single-layer LiDAR and a camera

Bocharov D.A., Kokhan V.V., Konyushenko I.D., Resniansky A.Y., Nikolaev I.P., Nikolaev D.P.

Статья научная

The paper addresses the problem of non–contact bulk cargo volume estimation for moving dump trucks. A common scanning method that lets to evaluate the volume of cargo of complex surface for a moving truck implies two single-layer (2D) Light Detection and Ranging (LiDAR) sensors: one is used to scan a vehicle in a plane perpendicular to its movement and the second – to estimate vehicle displacements and restore scans positions on an axis along vehicle movement direction. While LiDAR sensors provide reliable measurement signals in controlled environments their efficacy drastically decreases under challenging outdoor conditions: sand dust, fog, rain heavy precipitation cause false detections and distort LiDARs signal. Thus, vehicle displacements estimated with a highly corrupted LiDAR signal can not be used for a reliable measurement as they may lead to significant volume calculation errors. Partially this is solved in multi-echo lidar where distorted data could be separated from the relevant. In contrast to the single-echo 2D LiDAR, image data from industrial cameras is less sensitive to sand dust or fog. In the paper we propose a novel bulk cargo estimation method that implies only one 2D LiDAR and for vehicle displacements estimation utilizes a camera and computer vision methods. As we demonstrate on a diverse dataset of 730 pairs of dump truck passes from an operating sand pit, the proposed method is more accurate than the two 2D LiDARs baseline while requiring a significantly cheaper sensor. In case if a camera is already present in the volume measurement system and utilized for loaded material classification then the proposed method lets to reduce the cost of solution by the cost of one lidar.

Бесплатно

Calculation of effective mode field area of photonic crystal fiber with digital image processing algorithm

Calculation of effective mode field area of photonic crystal fiber with digital image processing algorithm

Tan Yili, Wang Honglian, Wang Yourong

Статья научная

Photonic crystal fiber as a new type of optical fiber has been extensively applied because of its unique properties. The effective mode area of optical fiber is an important parameter, which has a great influence on the performance of optical fiber. In this study, digital image processing algo-rithm was used for preprocessing to improve the accuracy of calculation of mode field area. Then the effective mode field area of optical fiber was calculated using Matlab based Gauss fitting method. Take single-mode fiber G.652 as an example, the effective mode field area was calculated using the traditional algorithm and digital image processing algorithm respectively. It was found that the results obtained using digital image processing algorithm were within the allowed error range, suggesting the effectiveness of the algorithm. Then the calculation of the effective mode area of the triangular lattice photonic crystal fiber further verified the reliability of the algorithm.

Бесплатно

Camera parameters estimation from pose detections

Camera parameters estimation from pose detections

Shalimova Ekaterina Alekseevna, Shalnov Evgeny Vadimovich, Konushin Anton Sergeevich

Статья научная

Some computer vision tasks become easier with known camera calibration. We propose a method for camera focal length, location and orientation estimation by observing human poses in the scene. Weak requirements to the observed scene make the method applicable to a wide range of scenarios. Our evaluation shows that even being trained only on synthetic dataset, the proposed method outperforms known solution. Our experiments show that using only human poses as the input also allows the proposed method to calibrate dynamic visual sensors.

Бесплатно

Central Russia heavy metal contamination model based on satellite imagery and machine learning

Central Russia heavy metal contamination model based on satellite imagery and machine learning

Uzhinskiy Alexander Vladimirovich, Vergel Konstantin Nikolaevich

Статья научная

Atmospheric heavy metal contamination is a real threat to human health. In this work, we examined several models trained on in situ data and indices got from satellite images. During 2018-2019, 281 samples of naturally growing mosses were collected in the Vladimir, Yaroslavl, and Moscow regions in Russia. The samples were analyzed using Neutron Activation Analysis to get the contamination levels of 18 heavy metals. The Google Earth Engine platform was used to calculate indices from satellite images that represent summarized information about sampling sites. Statistical and neural models were trained on in situ data and the indices. We focused on the classification task with 8 levels of contamination and used balancing techniques to extend the training data. Three approaches were tested: variations of gradient boosting, multilayer perceptron, and Siamese networks. All these approaches produced results with minute differences, making it difficult to judge which one is better in terms of accuracy and graphical outputs. Promising results were shown for 9 heavy metals with an overall accuracy exceeding 89 %. Al, Fe, and Sb contamination was predicted for 3,000 and 12,100 grid nodes on a 500 km2 area in the Central Russia region for 2019 and 2020. The results, methods, and perspectives of the adopted approach of using satellite data together with machine learning for HM contamination prediction are presented.

Бесплатно

Classification of benign and malignant solid breast lesions on the ultrasound images based on the textural features: the importance of the perifocal lesion area

Classification of benign and malignant solid breast lesions on the ultrasound images based on the textural features: the importance of the perifocal lesion area

Kolchev A.A., Pasynkov D.V., Egoshin I.A., Kliouchkin I.V., Pasynkova O.O.

Статья научная

The amount of ultrasound (US) breast exams continues to grow because of the wider endorsement of breast cancer screening programs. When a solid lesion is found during the US the primary task is to decide if it requires a biopsy. Therefore, our goal was to develop a noninvasive US grayscale image analysis for benign and malignant solid breast lesion differentiation. We used a dataset consisting of 105 ultrasound images with 50 benign and 55 malignant non-cystic lesions. Features were extracted from the source image, the image of the gradient module after applying the Sobel filter, and the image after the Laplace filter. Subsequently, eight gray-level co-occurrence matrices (GLCM) were constructed for each lesion, and 13 Haralick textural features were calculated for each GLCM. Additionally, we computed the differences in feature values at different spatial shifts and the differences in feature values between the inner and outer areas of the lesion. The LASSO method was employed to determine the most significant features for classification. Finally, the lesion classification was carried out by various methods. The use of LASSO regression for feature selection enabled us to identify the most significant features for classification. Out of the 13 features selected by the LASSO method, four described the perilesional tissue, two represented the inner area of the lesion and five described the image of the gradient module. The final model achieved a sensitivity of 98%, specificity of 96%, and accuracy of 97%. Considering the perilesional area, Haralick feature differences, and the image of the gradient module can provide crucial parameters for accurate classification of US images. Features with a low AUC index (less than 0.6 in our case) can also be important for improving the quality of classification.

Бесплатно

Coherent field phase retrieval using a phase Zernike filter

Coherent field phase retrieval using a phase Zernike filter

Kotlyar Victor Victorovich, Khonina Svetlana Nikolaevna, Soifer Victor Alexandrovich, Wang Yangtiang, Zhao Datzu

Статья научная

Aberrations of the coherent wavefront are analyzed using a phase Zernike filter. Developed iterative methods allow us to design a filter that decomposes the analyzed light field into a set of diffraction orders with amplitudes proportional to the circular Zernike polynomials. We also apply the algorithm to the calculation of the light field phase from measurements of the modules of decomposition coefficients. Operation of a 25-channel filter is simulated.

Бесплатно

Comparative Performance Evaluation of Classical Methods and a Deep Learning Approach for Temperature Prediction in Fiber Optic Specklegram Sensors

Comparative Performance Evaluation of Classical Methods and a Deep Learning Approach for Temperature Prediction in Fiber Optic Specklegram Sensors

Vélez F.J., Arango J.D., Aristizábal V.H., Trujillo C.A., Herrera-Ramírez J.

Статья научная

In this study, an algorithm based on convolutional neural networks is employed as an interrogation method for a fiber specklegram sensor. This algorithm is compared with conventional interrogation methods, including correlation between images, measurement of optical power, and radial moments. Fiber specklegram sensors have room for improvement as conventional methods only consider a single characteristic of the specklegram for variable prediction, thus failing to leverage the full spectrum of information within the specklegram. Consequently, the approach put forth here introduces a convolutional neural network for the extraction of specklegram features, accompanied by an artificial neural network for variable regression. The specklegrams used in this investigation are obtained through simulating temperature disturbances in a multimode fiber using the Finite Elements Method. The results reveal prediction RMSE errors ranging from 10.26°C for the first radial moment to 1.42°C for the proposed algorithm. These findings underscore the effectiveness of the proposed strategy in enhancing sensor performance and robustness, all while upholding their cost-efficiency.

Бесплатно

Comparative analysis of neural network models performance on low-power devices for a real-time object detection task

Comparative analysis of neural network models performance on low-power devices for a real-time object detection task

Zagitov Artur, Chebotareva Elvira, Toschev Alexander, Magid Evgeni

Статья научная

A computer vision based real-time object detection on low-power devices is economically attractive, yet a technically challenging task. The paper presents results of benchmarks on popular deep neural network models, which are often used for this task. The results of experiments provide insights into trade-offs between accuracy, speed, and computational efficiency of MobileNetV2 SSD, CenterNet MobileNetV2 FPN, EfficientDet, YoloV5, YoloV7, YoloV7 Tiny and YoloV8 neural network models on Raspberry Pi 4B, Raspberry Pi 3B and NVIDIA Jetson Nano with TensorFlow Lite. We fine-tuned the models on our custom dataset prior to benchmarking and used post-training quantization (PTQ) and quantization-aware training (QAT) to optimize the models’ size and speed. The experiments demonstrated that an appropriate algorithm selection depends on task requirements. We recommend EfficientDet Lite 512×512 quantized or YoloV7 Tiny for tasks that require around 2 FPS, EfficientDet Lite 320×320 quantized or SSD Mobilenet V2 320×320 for tasks with over 10 FPS, and EfficientDet Lite 320×320 or YoloV5 320×320 with QAT for tasks with intermediate FPS requirements.

Бесплатно

Comparative analysis of reflection symmetry detection methods in binary raster images with skeletal and contour representations

Comparative analysis of reflection symmetry detection methods in binary raster images with skeletal and contour representations

Seredin Oleg Sergeevich, Kushnir Olesia Aleksandrovna, Fedotova Sofia Antonovna

Статья научная

The study is a comparative analysis of two fast reflection symmetry axis detection methods: an algorithm to refine the symmetry axis found with a chain of skeletal primitives and a boundary method based on the Fourier descriptor. We tested the algorithms with binary raster images of plant leaves (FLAVIA database). The symmetry axis detection quality and performance indicate that both methods can be used to solve applied problems. Neither method demonstrated any significant advantage in terms of accuracy or performance. It is advisable to integrate both methods for solving real-life problems.

Бесплатно

Comparison of convolutional networks and transformers for generating 3D-models via binary space partitioning from a single object image

Comparison of convolutional networks and transformers for generating 3D-models via binary space partitioning from a single object image

Gribanov D.N., Kilbas I.A., Mukhin A.V., Paringer R.A., Kupriyanov A.V.

Статья научная

This study explores the use of transformer architecture as an image encoder model for the task of 3D mesh generation from a single image. Traditionally, models based on autoencoder architecture perform such tasks, where an encoder produces a latent representation that a decoder subsequently converts into a 3D model. When processing image-based input, the ResNet18 convolutional network is a commonly used encoder. In this paper, we investigate replacing the convolutional network with a transformer-based approach while using binary space partitioning (BSP) for 3D object generation. Our experiments demonstrate that a transformer-based architecture, specifically the Compact Convolutional Transformer (CCT), can achieve performance comparable to its convolutional counterpart and exceed it both in quantitative metrics and visual quality. The best CCT-based model achieves a Chamfer Distance (CD) of 1.59 and a Light Field Distance (LFD) of 3907, whereas the convolutional variant attains a CD of 1.64 and an LFD of 3981. The CCT-based model also demonstrates superior 3D reconstruction quality on test samples. Additionally, the transformer model requires four times fewer parameters to achieve these results, though computational resources are two times higher in terms of Multiply-Accumulate operations (MACs). These findings indicate that the transformer-based model is more parameter-efficient and can achieve superior results compared to traditional convolutional networks in single-view reconstruction tasks.

Бесплатно

Comparison of hyperspectral and multi-spectral imagery to building a spectral library and land cover classification performance

Comparison of hyperspectral and multi-spectral imagery to building a spectral library and land cover classification performance

Boori Mukesh Singh, Paringer Rustam Aleksandrovich, Choudhary Komal, Kupriyanov Alexander Victorovich

Статья научная

The main aim of this research work is to compare k-nearest neighbor algorithm(KNN)super-vised classification with migrating means clustering unsupervised classification (MMC) method on the performance of hyperspectral and multispectral data for spectral land cover classes and de-velop their spectral library in Samara, Russia. Accuracy assessment of the derived thematic maps was based on the analysis of the classification confusion matrix statistics computed for each classi-fied map, using for consistency the same set of validation points. We were analyzed and compared Earth Observing-1 (EO-1) Hyperion hyperspectral data to Landsat 8 Operational Land Imager (OLI) and Advance Land Imager (ALI) multispectral data. Hyperspectral imagers, currently avail-able on airborne platforms, provide increased spectral resolution over existing space based sensors that can document detailed information on the distribution of land cover classes, sometimes spe-cies level. Results indicate that KNN (95, 94, 88 overall accuracy and .91, .89, .85 kappa coeffi-cient for Hyp, ALI, OLI respectively) shows better results than unsupervised classification (93, 90, 84 overall accuracy and .89, .87, .81 kappa coefficient for Hyp, ALI, OLI respectively). Develop-ment of spectral library for land cover classes is a key component needed to facilitate advance ana-lytical techniques to monitor land cover changes. Different land cover classes in Samara were sampled to create a common spectral library for mapping landscape from remotely sensed data. The development of these libraries provides a physical basis for interpretation that is less subject to conditions of specific data sets, to facilitate a global approach to the application of hyperspectral imagers to mapping landscape. In addition, it is demonstrated that the hyperspectral satellite image provides more accurate classification results than those extracted from the multispectral satellite image. The higher classification accuracy by KNN supervised was attributed principally to the ability of this classifier to identify optimal separating classes with low generalization error, thus producing the best possible classes’ separation.

Бесплатно

Computational and experimental studies on SnO2 thin films at various temperatures

Computational and experimental studies on SnO2 thin films at various temperatures

Gurushankar K., Grishina M., Gohulkumar M., Kannan Karthik

Статья научная

Tin oxide (SnO2) thin films was prepared by dip-coating technique at various bath temperatures (313, 333, 353 and 373 K) and annealed at 673 K in this study. And the obtained results were studied and correlated with the computational method. Scanning electron microscopy (SEM) investigation demonstrated that the prepared samples are spherical with agglomeration. The elemental analysis (EDAX) confirms the presence of Sn and O. Further, the SnO2 thin films microstructures are simulated, their thermodynamic and surface properties have been calculated. Micro-Raman spectra were recorded for the prepared samples. Micro-Raman results exhibit the first-order Raman mode E1g (475 cm-1) indicating that the grown SnO2 belongs to the rutile structure. In addition, the envelope method used for studying optical characteristics of the thin films from the transmittance spectra. The semiconducting nature of the films has been noticed from linear I-V characteristics. Furthermore, the electrical conductivity studies suggest that the highest conductivity samples acquire the lowest activation energy and their values are also in the semiconducting range.

Бесплатно

Computer controlling of writing beam in laser microfabrication of diffractive optics

Computer controlling of writing beam in laser microfabrication of diffractive optics

Korolkov V., Shimansky R., Cherkashin V., Denk D.

Статья научная

Laser microfabrication of diffractive optics with continuous relief is based on the direct local action of focused laser radiation on the recording material. Control of writing beam parameters (beam power, spot size, waist position) is one of the main tasks in microfabrication using laser writing systems. Method of the control defines the correspondence between the fabricated microrelief of the diffractive optical element and a designed one. Complexity of this task consists in the necessity to take into account a wide range of factors: laser irradiation noises, non-linear characteristic curve of recording material, finiteness of spot size, influence of power modulation and surrounding on beam energy absorption, influence of beam waist position according to recording layer, dependence of characteristic curve of recording material on beam scanning speed, etc. In the present paper we consider a number of methods for computer controlling of writing beam making it possible to compensate or reduce the influence of these factors and improve the quality of DOE microfabrication. The results of experimental application of the developed methods to circular laser writing systems are discussed.

Бесплатно

Журнал