Статьи журнала - Компьютерная оптика

Все статьи: 2511

An efficient algorithm for non-rigid object registration

An efficient algorithm for non-rigid object registration

Makovetskii Artyom Yurievch, Voronin Sergei Mikhailovich, Kober Vitalii Ivanovich, Voronin Aleksei Vyacheslavovich

Статья научная

An efficient algorithm for registration of two non-rigid objects based on geometrical transformation of the template object to target object is proposed. The transformation is considered as warping of the template onto the target. To choose the most suitable transformation from all possible warps, a registration algorithm should satisfy deformation constraints referred to as regularization of non-rigid objects. In this work, we use variational functionals for affine transformations. With the help of computer simulation, the proposed method for searching the optimal geometrical transformation is compared with that of common algorithms.

Бесплатно

An efficient algorithm for overlapping bubbles segmentation

An efficient algorithm for overlapping bubbles segmentation

Bettaieb Afef, Filali Nabila, Filali Taoufik, Ben Aissia Habib

Статья научная

Image processing is an effective method for characterizing various two-phase gas/liquid flow systems. However, bubbly flows at a high void fraction impose significant challenges such as diverse bubble shapes and sizes, large overlapping bubble clusters occurrence, as well as out-of-focus bubbles. This study describes an efficient multi-level image processing algorithm for highly overlapping bubbles recognition. The proposed approach performs mainly in three steps: overlapping bubbles classification, contour segmentation and arcs grouping for bubble reconstruction. In the first step, we classify bubbles in the image into a solitary bubble and overlapping bubbles. The purpose of the second step is overlapping bubbles segmentation. This step is performed in two subsequent steps: at first, we classify bubble clusters into touching and communicating bubbles. Then, the boundaries of communicating bubbles are split into segments based on concave point extraction. The last step in our algorithm addresses segments grouping to merge all contour segments that belong to the same bubble and circle/ellipse fitting to reconstruct the missing part of each bubble. An application of the proposed technique to computer generated and high-speed real air bubble images is used to assess our algorithm. The developed method provides an accurate and computationally effective way for overlapping bubbles segmentation. The accuracy rate of well segmented bubbles we achieved is greater than 90 % in all cases. Moreover, a computation time equal to 12 seconds for a typical image (1 Mpx, 150 overlapping bubbles) is reached.

Бесплатно

An efficient block-based algorithm for hair removal in dermoscopic images

An efficient block-based algorithm for hair removal in dermoscopic images

Zaqout Ihab Salah

Статья научная

Hair occlusion in dermoscopy images affects the diagnostic operation of the skin lesion. Segmentation and classification of skin lesions are two major steps of the diagnostic operation required by dermatologists. We propose a new algorithm for hair removal in dermoscopy images that includes two main stages: hair detection and inpainting. In hair detection, a morphological bottom-hat operation is implemented on Y-channel image of YIQ color space followed by a binarization operation. In inpainting, the repaired Y-channel is partitioned into 256 non-overlapped blocks and for each block, white pixels are replaced by locating the highest peak, using a histogram function and a morphological close operation. The proposed algorithm reports a true positive rate (sensitivity) of 97.36 %, a false positive rate (fall-out) of 4.25 %, and a true negative rate (specificity) of 95.75 %. The diagnostic accuracy achieved is recorded at a high level of 95.78 %.

Бесплатно

An improved gray-scale transformation method for pseudo-color image enhancement

An improved gray-scale transformation method for pseudo-color image enhancement

Gao Haibo, Zeng Wenjuan, Chen Jifeng

Статья научная

Image enhancement is a very important process of image preprocessing and it plays a critical role in the improvement of image quality and the follow-up image analysis, which makes the research of image enhancement algorithm a hot research field. Image enhancement not only needs to strengthen image determination and recognition, but also needs to avoid the consequential color distortion. Pseudo-color enhancement is the technique to map different gray scales of a black-and-white image into a color image. As humans have extremely strong ability in distinguishing different colors visually and relatively weak capacity in discriminating gray scales, so, color the gray-scale changes which cannot be differentiated by human eyes so that they can tell them apart. The mapping function in conventional gray-scale transform method is not working well in dark and low-contrast images. So, this paper comes up with an improved gray-scale transformation algorithm. This algorithm can achieve the enhancement, preserve the image colors, process dark and low-contrast images, reinforce the enhancement and improve the blocking effect. The experiment proves that the enhanced image obtained by the method of this paper can have improved average brightness, natural colors and more detail information and it has good application value.

Бесплатно

An improved video watermarking algorithm with extraction using a mobile device camera

An improved video watermarking algorithm with extraction using a mobile device camera

Evsutin O.O., Melman A.S., Podbolotov D.I., Stankevich A.G.

Статья научная

The use of screen-capture-resistant digital watermarks is a promising way to store information invisibly in a video stream for later retrieval by the user using a smartphone camera. However, the development of algorithms that implement this scenario is associated with the problem of balancing between the imperceptibility of embedding and robustness. A serious problem is the extraction of watermarks using a mobile device. Most people use the vertical positioning of the smartphone when shooting, which excludes the possibility of only marked video sequences entering the frame. The extraction algorithm first finds the screen area in the image and then extracts the watermark under various distortion conditions. This study proposes an approach to improve the efficiency of the algorithm for embedding digital watermarks into video data based on rectangular patterns, which provides resistance to screen shooting. The proposed approach to increasing the embedding imperceptibility provided an increase in the PSNR and SSIM values by 17.18 % and 7.90 %, respectively. The use of a neural network at the extraction stage reduced the BER value by 64.64 %.

Бесплатно

Analysis of a robust edge detection system in different color spaces using color and depth images

Analysis of a robust edge detection system in different color spaces using color and depth images

Mousavi Seyed Muhammad Hossein, Lyashenko Vyacheslav, Prasath Surya

Статья научная

Edge detection is very important technique to reveal significant areas in the digital image, which could aids the feature extraction techniques. In fact it is possible to remove un-necessary parts from image, using edge detection. A lot of edge detection techniques has been made already, but we propose a robust evolutionary based system to extract the vital parts of the image. System is based on a lot of pre and post-processing techniques such as filters and morphological operations, and applying modified Ant Colony Optimization edge detection method to the image. The main goal is to test the system on different color spaces, and calculate the system’s performance. Another novel aspect of the research is using depth images along with color ones, which depth data is acquired by Kinect V.2 in validation part, to understand edge detection concept better in depth data. System is going to be tested with 10 benchmark test images for color and 5 images for depth format, and validate using 7 Image Quality Assessment factors such as Peak Signal-to-Noise Ratio, Mean Squared Error, Structural Similarity and more (mostly related to edges) for prove, in different color spaces and compared with other famous edge detection methods in same condition...

Бесплатно

Analysis of logistics distribution path optimization planning based on traffic network data

Analysis of logistics distribution path optimization planning based on traffic network data

H.H. Li, H.R. Fu, W.H. Li

Статья

With the development of economy, the distribution problem of logistics becomes more and more complex. Based on the traffic network data, this study analyzed the vehicle routing problem (VRP), designed a dynamic vehicle routing problem with time window (DVRPTW) model, and solved it with genetic algorithm (GA). In order to improve the performance of the algorithm, the genetic operation was improved, and the output solution was further optimized by hill climbing algorithm. The analysis of example showed that the improved GA algorithm had better performance in path optimization planning, the total cost of planning results was 31.44 % less than that of GA algorithm, and the total cost of planning results increased by 11.48 % considering the traffic network data. The experimental results show that the improved GA algorithm has good performance and can significantly reduce the cost of distribution and that research on VRP based on the traffic network data is more in line with the actual situation of logistics distribution, which is conducive to the further application of the improved GA algorithm in VRP.

Бесплатно

Angular dependence of diffraction efficiency of a dynamic hologram in a reversible photochromatic medium

Angular dependence of diffraction efficiency of a dynamic hologram in a reversible photochromatic medium

Ivakhnik V.V., Nikonov V.I., Shilnikova E.V.

Статья научная

Angular characteristics of a dynamic hologram, recorded by a "fan" of plane waves in a reversible photochromatic medium, are studied under various correlations between the intensities of these waves. It is shown that the maximum increase of the relative diffraction efficiency with the increase of the angle between the reference and object waves is observed for thin holograms. There is such a PCM thickness beginning from which the relative dependence of the hologram diffraction efficiency upon the angle ceases to depend upon the PCM thickness, reaching some maximum distribution. Coincidence is shown of the angular characteristics of the holograms, recorded under the condition that the reference wave intensity is much greater than the object wave intensity, and of the holograms, recorded by a "fan" of plane waves of equal intensity. The hologram diffraction efficiency change is studied under the readout wave deviation from the Bragg angle. The reversibility of photochemical transformations in a number of photochromatic media (PCM) allows them to be used as non-linear media for a dynamic hologram recording [1-4]. In the works, accomplished up to now, on studying the dynamic hologram diffraction efficiency (HDE) in reversible PCM it has been considered that hologram recording is carried out by two plane waves, the surfaces of equal intensity of the recorded interference grating being perpendicular to the photochromatic layer faces [5-6]. In the present work the angular HDE dependence is investigated, knowing which, one can estimate the possibility of recording in such media the holograms of complex spatially modulated fields.

Бесплатно

Application of the fruit fly optimization algorithm to an optimized neural network model in radar target recognition

Application of the fruit fly optimization algorithm to an optimized neural network model in radar target recognition

Liu Min, Sun Zhihong

Статья научная

With the development of computer technology, there are more and more algorithms and models for data processing and analysis, which brings a new direction to radar target recognition. This study mainly analyzed the recognition of high resolution range profile (HRRP) in radar target recognition and applied the generalized regression neural network (GRNN) model for HRRP recognition. In order to improve the performance of HRRP, the fruit fly optimization algorithm (FOA) algorithm was improved to optimize the parameters of the GRNN model. Simulation experiments were carried out on three types of aircraft. The improved FOA-GRNN (IFOA-GRNN) model was compared with the radial basis function (RBF) and GRNN models. The results showed that the IFOA-GRNN model had a better convergence accuracy, the highest average recognition rate (96.4 %), the shortest average calculation time (275 s), and a good recognition rate under noise interference. The experimental results show that the IFOA-GRNN model has a good performance in radar target recognition and can be further promoted and applied in practice.

Бесплатно

Application of unmanned aerial vehicle remote sensing technology in hydrological monitoring for water conservancy projects

Application of unmanned aerial vehicle remote sensing technology in hydrological monitoring for water conservancy projects

Cheng S.N.

Статья научная

The article provides a brief introduction to hydrological monitoring and the methods of water body identification and flow velocity estimation based on unmanned aerial vehicle (UAV) remote sensing technology. Then, a case study of hydrological monitoring using UAV remote sensing technology was conducted on Gangnan Reservoir in Hebei Province. The effectiveness of the convolutional neural network (CNN) algorithm was verified, and measurements were made on the reservoir’s water area and average flow velocity in both flood and non-flood periods during 2020, 2021, and 2022. It was found that the CNN algorithm effectively identified water areas in UAV remote sensing images. Compared to non-flood periods, there was a significant increase in the water area of the reservoir during flood periods, as well as a noticeable increase in average flow velocity upstream; however, there was no significant change in average flow velocity downstream of the reservoir.

Бесплатно

Applied aspects of modern non-blind image deconvolution methods

Applied aspects of modern non-blind image deconvolution methods

Chaganova O.B., Grigoryev A.S., Nikolaev D.P., Nikolaev I.P.

Статья научная

The focus of this paper is the study of modern non-blind image deconvolution methods and their application to practical tasks. The aim of the study is to determine the current state-of-the-art in non-blind image deconvolution and to identify the limitations of current approaches, with a focus on practical application details. The paper proposes approaches to examine the influence of various effects on the quality of restoration, the robustness of models to errors in blur kernel estimation, and the violation of the commonly assumed uniform blur model. We developed a benchmark for validating non-blind deconvolution methods, which includes datasets of ground truth images and blur kernels, as well as a test scheme for assessing restoration quality and error robustness. Our experimental results show that those neural network models lacking any pre-optimization, such as quantization or knowledge distillation, fall short of classical methods in several key properties, such as inference speed or the ability to handle different types of blur. Nevertheless, neural network models have made notable progress in their robustness to noise and distortions. Based on the results of the study, we provided recommendations for more effective use of modern non-blind image deconvolution methods. We also developed suggestions for improving the robustness, versatility and performance quality of the models by incorporating additional practices into the training pipeline.

Бесплатно

Applying neural networks in coordinated group signal transformation to improve image quality

Applying neural networks in coordinated group signal transformation to improve image quality

Lopukhova E.A., Voronkov G.S., Kuznetsov I.V., Ivanov V.V., Kutluyarov R.V., Sultanov A.Kh., Grakhova E.P.

Статья научная

The rapid development of the Internet of Things and wireless sensor networks combined with the introduction of image analysis systems and computer vision technologies has led to the emergence of a new class of systems – multimedia Internet of Things and multimedia wireless sensing networks. The combination of the specifics of the Internet of Things, which requires simultaneous and long-term operation of a large number of autonomous devices, with the need to transmit video data poses the problem of creating new energy-efficient methods of image compression. The paper considers applying coordinated group signal transformation as such an algorithm, which performs compression based on the input signals’ correlation. Correlation between the image color channels makes this possible. However, it was necessary to supplement this method by clustering the original images using machine learning methods for better image reconstruction at reception. The criterion for clustering was the change of image gradient. The use of radial neural network in the clustering algorithm increased the speed of the proposed method. The resulting algorithm provides at least fourfold image compression with high-quality image restoration. Moreover, for multimedia Internet of Things systems, in which quality losses are acceptable, it is possible to provide large compression ratios without increasing computational complexity, i.e., without increasing power consumption.

Бесплатно

Arrhythmia detection using resampling and deep learning methods on unbalanced data

Arrhythmia detection using resampling and deep learning methods on unbalanced data

Shchetinin Eugene Yurievich, Glushkova Anastasia Gennadievna

Статья научная

Due to cardiovascular diseases millions of people die around the world. One way to detect abnormality in the heart condition is with the help of electrocardiogram signal (ECG) analysis. This paper’s goal is to use machine learning and deep learning methods such as Support Vector Machines (SVM), Random Forests, Light Gradient Boosting Machine (LightGBM), Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM) and Bidirectional Long Short-Term Memory (BLSTM) to classify arrhythmias, where particular interest represent the rare cases of disease. In order to deal with the problem of imbalance in the dataset we used resampling methods such as SMOTE Tomek-Links and SMOTE ENN to improve the representation ration of the minority classes. Although the machine learning models did not improve a lot when trained on the resampled dataset, the deep learning models showed more impressive results. In particular, LSTM model fitted on dataset resampled using SMOTE ENN method provides the most optimal precision-recall trade-off for the minority classes Supraventricular beat and Fusion of ventricular and normal beat, with recall of 83 % and 88 % and precision of 74 % and 66 % for the two classes respectively, whereas the macro-weighted recall is 92 % and precision is 82 %.

Бесплатно

Asymmetric apodization for the comma aberrated point spread function

Asymmetric apodization for the comma aberrated point spread function

Reddy Andra Naresh Kumar, Sagar Dasari Karuna, Khonina Svetlana Nikolaevna

Статья научная

This paper deals with the study of light flux distributions in the point spread function formed by an optical system with a one-dimensional aperture under the influence of the coma aberration. The traditional design of an asymmetric optical filter improves the resolution of a diffraction-limited optical imaging system. In this approach we explore the control of monochromatic aberrations through pupil engineering with asymmetric apodization. This technique employs the amplitude and phase apodization for the mitigation of the effects of third-order aberrations on the diffracted image. On introducing the coma wave aberration effect, the central peak intensity in the field of diffraction is a function of the edge strips width and the amplitude apodization parameter of a one-dimensional pupil filter, whereas the magnitude of the reduction of optical side-lobes is a function of the degree of phase apodization at the periphery of the aperture. The analytically computed results are illustrated graphically in terms of point spread function curves under various considerations of the coma aberrations and a different degree of amplitude and phase apodization. Hence, for the optimum values of apodization, the axial resolution has been analyzed using well-defined quality criteria.

Бесплатно

Automatic 4-mirrors system for alignment of high-power laser radiation

Automatic 4-mirrors system for alignment of high-power laser radiation

Toporovsky V.V., Alexandrov A.G., Galaktionov I.V., Rukosuev A.L., Kudryashov A.V.

Статья научная

This paper presents the automated system for minimizing the deviation of the path of passage and the divergence of a secondary radiation source with parameters similar to ones of the main beam of a high-power Ti:Sa laser using mirrors in kinematic mounts on the motorized stages. As an alignment laser, the diode laser with a fiber output was used with radiation characteristics coinciding with the parameters of the main beam (wavelength, beam diameter). The successive approximation algorithm was used to minimize the beam deflection. The positioning accuracy and beam size matching were analyzed on the near-field camera and were equaled to 28.6 µm along the X-axis and 26.4 µm along the Y-axis. Beam size mismatch was equaled to 0.151 mm. The pointing accuracy was analyzed on the far-field sensor and equaled 15.34 µrad along the X axis and 12.03 µrad along the Y axis. The curvature of the wavefront was 0.06 µm.

Бесплатно

Automatic mosaic method of remote sensing images based on machine vision

Automatic mosaic method of remote sensing images based on machine vision

Gao S.P., Xia M., Zhang S.J.

Статья научная

Unmanned Aerial Vehicle (UAV) remote sensing is a commonly used technical means in modern science and technology, but currently, remote sensing images captured by UAVs need to be spliced to obtain more comprehensive information. However, current image stitching techniques generally have shortcomings such as a small number of extracted features, low matching accuracy, and poor stability. To address the above issues, this study proposes an improved remote sensing image mosaic model on the bias of the Scale Invariant Feature Transform (SIFT) algorithm. Firstly, in this study, aiming at the problem that traditional SIFT cannot meet the requirements of feature extraction and matching for unconventional remote sensing images and special texture images, normalized cross correlation (NCC) and Forstner operator are introduced to optimize it, namely, a SIFT-NCC model is constructed. On this basis, for remote sensing images with high resolution and a wide range, this study further proposes a remote sensing image automatic mosaic model that combines point features and line features. That is, a linear segment detector (LSD) is introduced to extract the line features of remote sensing images. The experimental verification results of the final SIFT-NCC-LSD show that the matching accuracy for remote sensing images with different characteristics can reach over 95 %. Therefore, SIFT-NCC-LSD has good applicability.

Бесплатно

Automatic target recognition algorithm for low-count terahertz images

Automatic target recognition algorithm for low-count terahertz images

Antsiperov Viacheslav Evgenievich

Статья научная

The paper presents the results of developing an algorithm for automatic target recognition in broadband (0.1-10) terahertz images. Due to the physical properties of terahertz radiation and associated hardware, such images have low contrast, low signal-to-noise ratio and low resolution - i.e. all the characteristics of a low-count images. Therefore, standard recognition algorithms designed for conventional images work poorly or are not suitable at all for the problem considered. We have developed a fundamentally different approach based on clustering 2D point clouds in accordance with a set of predefined patterns. As a result, we reduce the problem of target recognition to the problem of maximizing the image data likelihood with respect to the classes of model objects up to the size and position. The resulting recognition algorithm has a structure close to that of the well-known EM algorithm; its formal scheme is at the end of the paper.

Бесплатно

Benign and malignant breast tumors classification based on texture analysis and backpropagation neural network

Benign and malignant breast tumors classification based on texture analysis and backpropagation neural network

Wisudawati Lulu Mawaddah, Madenda Sarifuddin, Wibowo Eri Prasetyo, Abdullah Arman Adel

Статья научная

Breast cancer is a leading cause of death in women due to cancer. According to WHO, in 2018, it is estimated that 627.000 women died from breast cancer, that is approximately 15 % of all cancer deaths among women [3]. Early detection is a very important factor to reduce mortality by 25 - 30 %. Mammography is the most commonly used technique in detecting breast cancer using a low-dose X-ray system in the examination of breast tissue that can reduce false positives. A Computer-Aided Detection (CAD) system has been developed to effectively assist radiologists in detecting masses on mammograms that indicate the presence of breast tumors. The type of abnormality in mammogram images can be seen from the presence of microcalcifications and the presence of mass lesions. In this research, a new approach was developed to improve the performance of CAD System for classifying benign and malignant tumors. Areas suspected of being masses (RoI) in mammogram images were detected using an adaptive thresholding method and mathematical morphological operations. Wavelet decomposition is performed on the Region of Interest (RoI) and the feature extraction process is performed using a GLCM method with 4 statistical features, namely, contrast, correlation, entropy, and homogeneity. Classification of benign and malignant tumors using the MIAS database provided an accuracy of 95.83 % with a sensitivity of 95.23 % and a specificity of 96.49 %. A comparison with other methods illustrates that the proposed method provides better performance.

Бесплатно

Bidirectional Encoder representation from Image Transformers for recognizing sunflower diseases from photographs

Bidirectional Encoder representation from Image Transformers for recognizing sunflower diseases from photographs

Baboshina V.A., Lyakhov P.A., Lyakhova U.A., Pismennyy V.A.

Статья научная

This paper proposes a modern system for recognizing sunflower diseases based on Bidirectional Encoder representation from Image Transformers (BEIT). The proposed system is capable of recognizing various sunflower diseases with high accuracy. The presented research results demonstrate the advantages of the proposed system compared to known methods and contemporary neural networks. The proposed visual diagnostic system for sunflower diseases achieved 99.57 % accuracy on the sunflower disease dataset, which is higher than that of known methods. The approach described in the work can serve as an auxiliary tool for farmers, assisting them in promptly identifying diseases and pests and taking timely measures to treat plants. This, in turn, helps in preserving and enhancing the yield. This work can have a significant impact on the development of agriculture and the fight against the global food shortage problem.

Бесплатно

Bragg-fresnel optics and supermirrors

Bragg-fresnel optics and supermirrors

Erko A., Vidal B.

Статья научная

The main principles and some applications of Bragg-Fresnel multilayer optics and X-ray supermirrors are described. An elliptical Bragg-Fresnel multilayer lens (BFML), designed and fabricated in the IMT RAS has been used for 2-dimensional focusing of the white X-ray synchrotron beam. For the beam energy of about 12 KeV the spot size checked with the knife edge method was about 1 mm. Applications of BFML and supermirrors in x-ray imaging are discussed.

Бесплатно

Журнал