Обработка изображений, распознавание образов. Рубрика в журнале - Компьютерная оптика

Публикации в рубрике (330): Обработка изображений, распознавание образов
все рубрики
Convolutional Neural Network-Based Low Light Image Enhancement Method

Convolutional Neural Network-Based Low Light Image Enhancement Method

Guo J.

Статья научная

Low-light image augmentation has become increasingly important with the advancement of computer vision technologies in a variety of application settings. However, noise and contrast reduction frequently have an impact on image quality in low-light situations. In this paper, a convolutional neural network-based technique for low-light picture augmentation is put forth. The stability of local binary features under variations in illumination is the study’s initial method of providing directional advice for the enhancement algorithm. Second, the addition of a channel attentiveness mechanism improves the network’s capacity to acquire low-light image features. The proposed model of the study performed better on average in the two dataset tests when compared to the contrast-constrained adaptive histogram equalization algorithm and the bilateral filtering algorithm. Additionally, the recall and DICE coefficient performed better in the tests as well, improving by 16.24 % and 4.98 %, respectively. The proposed method outperformed all others in the picture enhancement studies, according to the experimental findings, proving the validity of this study. The purpose of the study is to offer a reference framework for low-light image enhancing techniques.

Бесплатно

Copy move forgery detection using key point localized super pixel based on texture features

Copy move forgery detection using key point localized super pixel based on texture features

Rajalakshmi C., Alex Dr. M. germanux, Balasubramanian Dr. R.

Статья научная

The most important barrier in the image forensic is to ensue a forgery detection method such can detect the copied region which sustains rotation, scaling reflection, compressing or all. Traditional SIFT method is not good enough to yield good result. Matching accuracy is not good. In order to improve the accuracy in copy move forgery detection, this paper suggests a forgery detection method especially for copy move attack using Key Point Localized Super Pixel (KLSP). The proposed approach harmonizes both Super Pixel Segmentation using Lazy Random Walk (LRW) and Scale Invariant Feature Transform (SIFT) based key point extraction. The experimental result indicates the proposed KLSP approach achieves better performance than the previous well known approaches.

Бесплатно

Crop growth monitoring through Sentinel and Landsat data based NDVI time-series

Crop growth monitoring through Sentinel and Landsat data based NDVI time-series

Boori Mukesh Singh, Choudhary Komal, Kupriyanov Alexander Victorovich

Статья научная

Crop growth monitoring is an important phenomenon for agriculture classification, yield estimation, agriculture field management, improve productivity, irrigation, fertilizer management, sustainable agricultural development, food security and to understand how environment and climate change effect on crops especially in Russia as it has a large and diverse agricultural production. In this study, we assimilated monthly crop phenology from January to December 2018 by using the NDVI time series derived from moderate to high Spatio-temporal resolution Sentinel and Landsat data in cropland field at Samara airport area, Russia. The results support the potential of Sentinel and Landsat data derived NDVI time series for accurate crop phenological monitoring with all crop growth stages such as active tillering, jointing, maturity and harvesting according to crop calendar with reasonable thematic accuracy. This satellite data generated NDVI based work has great potential to provide valuable support for assessing crop growth status and the above-mentioned objectives with sustainable agriculture development.

Бесплатно

Cross-layer optimization technology for wireless network multimedia video

Cross-layer optimization technology for wireless network multimedia video

Xia Wei

Статья научная

With the development of communication technology, wireless Internet has become more and more popular. The traditional network layered protocols cannot meet the increasingly rich network services, especially video. This paper briefly introduced the cross-layer transmission of video in wireless network and the cross-layer optimization algorithm used for improving video transmission quality and improved the traditional cross-layer algorithm. Then, the two cross-layer algorithms were simulated and analyzed on MATLAB software. The results showed that the packet delivery rate, peak signal to noise ratio and downlink throughput of the improved cross-layer algorithm were significantly higher than those of the traditional cross-layer algorithm under the same signal to interference plus noise ratio of receiving users in wireless network; meanwhile, with the increase of signal to interference plus noise ratio of the receiving user, the packet delivery rate and peak signal to noise ratio of the two algorithms increased, and tended to be stable after some signal to interference plus noise ratio, while the throughput of the two algorithms increased linearly. In the established real wireless network, the package delivery rate, peak signal to noise ratio and throughput of video after application of cross-layer algorithm were significantly improved, and the wireless network applying the improved cross-layer algorithm improved more. In summary, compared with the traditional cross-layer algorithm, the improved cross-layer algorithm can better improve the transmission quality of video in wireless network.

Бесплатно

Decision-making support system for the personalization of retinal laser treatment in diabetic retinopathy

Decision-making support system for the personalization of retinal laser treatment in diabetic retinopathy

Ilyasova Nataly Yurievna, Kirsh Dmitriy Victorovich, Demin Nikita Sergeevich

Статья научная

The purpose of research to create automated personalization of diabetic macular edema laser treatment. The results are based on analysis of large semi-structured data, methods and algorithms for fundus image processing. The technology improves the quality of retina laser coagulation in the treatment of diabetic macular edema, which is one of the main reasons for pronounced vision decrease. The proposed technology includes original solutions to establish an optimal localization of multitude burns by determining zones exposed to laser. It also includes the recognition of large amount of unstructured data on the anatomical and pathological locations' structures in the area of edema and data optical coherent tomography. As a result, a uniform laser application on the pigment epithelium of the affected retina is ensured. It will increase the treatment safety and its effectiveness, thus avoiding the use of more expensive treatment methods. Assessment of retinal lesions volume and quality will allow predicting the laser photocoagulation results and will contribute to the improvement of laser surgeon's skills. The architecture of a software complex comprises a number of modules, including image processing methods, algorithms for photocoagulation pattern mapping, and intelligent analysis methods.

Бесплатно

Deep-learning feature extraction with their subsequent selection and support vector machine classification of the breast ultrasound images

Deep-learning feature extraction with their subsequent selection and support vector machine classification of the breast ultrasound images

Kolchev A.A., Pasynkov D.V., Egoshin I.A., Kliouchkin I.V., Pasynkova O.O.

Статья научная

Our study aimed to develop a comprehensive system for discriminating between benign and malignant breast lesions on ultrasound images. The system integrated deep learning (DL) and conventional machine learning techniques. Our database consisted of 494 ultrasound images, comprising 231 benign and 263 malignant breast lesions. In the initial stage, we evaluated the performance of non-modified DL networks, including VGG-16, ResNet-18, and InceptionRes-NetV2. We assessed the results for the entire lesion as well as its inner and outer parts. For training the networks, we employed supervised transfer learning. In the second stage, we utilized a support vector machine (SVM) model for lesion classification. The features obtained from the modified DL networks, where we removed the last layers, were used for training and testing the SVM. In the final stage, we assessed the classification results using SVM, with a focus on selecting the most significant features obtained from the modified DL networks. We employed techniques such as ReliefF, FSCNCA, and LASSO for feature selection. Our three-step approach yielded impressive results, with an accuracy of 0.987, sensitivity of 0.989, and specificity of 0.983. These results significantly outperformed using only DL or DL+SVM without feature selection. Overall, our algorithm demonstrated sufficient accuracy in the clinical task of discriminating between benign and malignant breast lesions on ultrasound images.

Бесплатно

Design of a home video behavior recognition system based on visual privacy security mechanism

Design of a home video behavior recognition system based on visual privacy security mechanism

Zhao D.M.

Статья научная

The rapid development of the Internet and advanced technology has brought great convenience to people’s lives; However, real-time video and other privacy information obtained from computers can be leaked, resulting in economic losses and not conducive to the construction of computer network security. In response to the above issues, this study introduces compressed perception theory and temporal adaptive modules to achieve visual shielding, and based on this, designs a home video behavior system based on visual privacy security mechanism. The research results show that in the comparison of measurement matrices at different levels, the Bernoulli random matrix has the highest recognition accuracy, with recognition accuracy rates of 100 %, 98.73 %, 98.76 %, and 85.62 % from the first layer to the fourth layer, respectively. In the recognition performance results of different video behavior recognition systems in the YouTube database, UCF Sports database, and Hollywood2 database, the average recognition accuracy of the proposed system is the highest in most cases, with 94.6 %, 73.5 %, and 77.1 %, respectively. In summary, the system proposed in the study can achieve accurate recognition of home video behavior after visual masking, and has good results in practical applications.

Бесплатно

Document image analysis and recognition: a survey

Document image analysis and recognition: a survey

Arlazarov Vladimir Viktorovich, Andreeva Elena Igorevna, Bulatov Konstantin Bulatovich, Nikolaev Dmitry Petrovich, Petrova Olga Olegovna, Savelev Boris Igorevich, Slavin Oleg Anatolevich

Статья научная

This paper analyzes the problems of document image recognition and the existing solutions. Document recognition algorithms have been studied for quite a long time, but despite this, currently, the topic is relevant and research continues, as evidenced by a large number of associated publications and reviews. However, most of these works and reviews are devoted to individual recognition tasks. In this review, the entire set of methods, approaches, and algorithms necessary for document recognition is considered. A preliminary systematization allowed us to distinguish groups of methods for extracting information from documents of different types: single-page and multi-page, with text and handwritten contents, with a fixed template and flexible structure, and digitalized via different ways: scanning, photographing, video recording. Here, we consider methods of document recognition and analysis applied to a wide range of tasks: identification and verification of identity, due diligence, machine learning algorithms, questionnaires, and audits. The groups of methods necessary for the recognition of a single page image are examined: the classical computer vision algorithms, i.e., keypoints, local feature descriptors, Fast Hough Transforms, image binarization, and modern neural network models for document boundary detection, document classification, document structure analysis, i.e., text blocks and tables localization, extraction and recognition of the details, post-processing of recognition results. The review provides a description of publicly available experimental data packages for training and testing recognition algorithms. Methods for optimizing the performance of document image analysis and recognition methods are described.

Бесплатно

Efficiency of object identification for binary images

Efficiency of object identification for binary images

Magdeev Radik Gilfanovich, Tashlinskii Alexander Grigorevich

Статья научная

In this paper, a comparative analysis of the correlation-extreme method, the method of contour analysis and the method of stochastic gradient identification in the objects identification for a binary image is carried out. The results are obtained for a situation where possible deformations of an identified object with respect to a pattern can be reduced to a similarity model, that is, the pattern and the object may differ in scale, orientation angle, shift along the base axes, and additive noise. The identification of an object is understood as the recognition of its image with an estimate of the strain parameters relative to the template.

Бесплатно

Evaluation of the change in synthetic aperture radar imaging using transfer learning and residual network

Evaluation of the change in synthetic aperture radar imaging using transfer learning and residual network

Hamdi Imad, Tounsi Yassine, Benjelloun Mohammed, Nassim Abdelkrim

Статья научная

Change detection from synthetic aperture radar images becomes a key technique to detect change area related to some phenomenon as flood and deformation of the earth surface. This paper proposes a transfer learning and Residual Network with 18 layers (ResNet-18) architecture-based method for change detection from two synthetic aperture radar images. Before the application of the proposed technique, batch denoising using convolutional neural network is applied to the two input synthetic aperture radar image for speckle noise reduction. To validate the performance of the proposed method, three known synthetic aperture radar datasets (Ottawa; Mexican and for Taiwan Shimen datasets) are exploited in this paper. The use of these datasets is important because the ground truth is known, and this can be considered as the use of numerical simulation. The detected change image obtained by the proposed method is compared using two image metrics. The first metric is image quality index that measures the similarity ratio between the obtained image and the image of the ground truth, the second metrics is edge preservation index, it measures the performance of the method to preserve edges. Finally, the method is applied to determine the changed area using two Sentinel 1 B synthetic aperture radar images of Eddahbi dam situated in Morocco.

Бесплатно

Experimental assessment of the distance measurement accuracy using the active-pulse television measuring system and a digital terrain model

Experimental assessment of the distance measurement accuracy using the active-pulse television measuring system and a digital terrain model

Kapustin Vyacheslav Valerievich, Zahlebin Alexander Sergeevich, Movchan Andrey Kirillovich, Kuryachiy Mikhail Ivanovich, Krutikov Mikhail Vladimirovich

Статья научная

This paper considers an experimental study of the layout of an active-pulse television measuring system in the problem of assessing the accuracy of measuring the distance to objects using the depth maps. The main technical characteristics and structure of the active-pulse television measuring system layout are described, the description of the multi-zone ranging method used in the experiment is given. The field tests were carried out using a system for terrain orthophotomaps construction by an unmanned aerial vehicle and a geodetic measuring instrument, which is a reference for building a terrain plan and fixing distances between objects on the ground. The technique of carrying out aerial work is described to obtain the necessary data array, on which a digital model and an orthophotomap of the area were subsequently built. Conclusions are drawn about the accuracy of digital terrain models built based on the results of aerial photography from an unmanned aerial vehicle with a geodetic receiver on board and the applicability of these data as reference data for testing a prototype of an active-pulse television measuring system.

Бесплатно

Face anti-spoofing with joint spoofing medium detection and eye blinking analysis

Face anti-spoofing with joint spoofing medium detection and eye blinking analysis

Nikitin Mikhail Yurievich, Konushin Vadim Sergeyevich, Konushin Anton Sergeyevich

Статья научная

Modern biometric systems based on face recognition demonstrate high recognition quality, but they are vulnerable to face presentation attacks, such as photo or replay attack. Existing face anti-spoofing methods are mostly based on texture analysis and due to lack of training data either use hand-crafted features or fine-tuned pretrained deep models. In this paper we present a novel CNN-based approach for face anti-spoofing, based on joint analysis of the presence of a spoofing medium and eye blinking. For training our classifiers we propose the procedure of synthetic data generation which allows us to train powerful deep models from scratch. Experimental analysis on the challenging datasets (CASIA-FASD, NUUA Imposter) shows that our method can obtain state-of-the-art results.

Бесплатно

Facedetectnet: face detection via fully-convolutional network

Facedetectnet: face detection via fully-convolutional network

Gorbatsevich Vladimir Sergeevich, Moiseenko Anastasia Sergeevna, Vizilter Yury Valentinovich

Статья научная

Ace detection is one of the most popular computer vision tasks. There are a lot of face detection approaches proposed including different CNN-based techniques, but the problem of optimal balancing between detection quality and computational speed is still relevant. In this paper we propose new CNN-based solution for face detection called FaceDetectNet. Our CNN architecture is based on ideas of YOLO/DetectNet and GoogleNet architecture supported with some new tools and implementation details created especially for our face detection application. We propose: original iterative proposal clustering (IPC) algorithm for aggregation of output face proposals formed by CNN and the 2-level “weak pyramid” providing better detection quality on the testing sets containing both small and huge images. Our face detection approach is close to previously proposed SSD-based face detection, but the principal difference is that we use the deep features of top hidden CNN layer for forming the face proposals of any size...

Бесплатно

Fusion of information from multiple kinect sensors for 3D object reconstruction

Fusion of information from multiple kinect sensors for 3D object reconstruction

Ruchay Alexey N., Dorofeev Konstantin A., Kolpakov Vladimir I.

Статья научная

In this paper, we estimate the accuracy of 3D object reconstruction using multiple Kinect sen-sors. First, we discuss the calibration of multiple Kinect sensors, and provide an analysis of the ac-curacy and resolution of the depth data. Next, the precision of coordinate mapping between sen-sors data for registration of depth and color images is evaluated. We test a proposed system for 3D object reconstruction with four Kinect V2 sensors and present reconstruction accuracy results. Ex-periments and computer simulation are carried out using Matlab and Kinect V2.

Бесплатно

GPU acceleration of edge detection algorithm based on local variance and integral image: application to air bubbles boundaries extraction

GPU acceleration of edge detection algorithm based on local variance and integral image: application to air bubbles boundaries extraction

Bettaieb Afef, Filali Nabila, Filali Taoufik, Ben Aissia Habib

Статья научная

Accurate detection of air bubbles boundaries is of crucial importance in determining the performance and in the study of various gas/liquid two-phase flow systems. The main goal of this work is edge extraction of air bubbles rising in two-phase flow in real-time. To accomplish this, a fast algorithm based on local variance is improved and accelerated on the GPU to detect bubble contour. The proposed method is robust against changes of intensity contrast of edges and capable of giving high detection responses on low contrast edges. This algorithm is performed in two steps: in the first step, the local variance of each pixel is computed based on integral image, and then the resulting contours are thinned to generate the final edge map. We have implemented our algorithm on an NVIDIA GTX 780 GPU. The parallel implementation of our algorithm gives a speedup factor equal to 17x for high resolution images (1024×1024 pixels) compared to the serial implementation. Also, quantitative and qualitative assessments of our algorithm versus the most common edge detection algorithms from the literature were performed. A remarkable performance in terms of results accuracy and computation time is achieved with our algorithm.

Бесплатно

Generation and study of the synthetic brain electron microscopy dataset for segmentation purpose

Generation and study of the synthetic brain electron microscopy dataset for segmentation purpose

Sokolov N.A., Vasiliev E.P., Getmanskaya A.A.

Статья научная

Advanced microscopy technologies such as electron microscopy have opened up a new field of vision for biomedical researchers. The use of artificial intelligence methods for processing EM data is largely difficult due to the small amount of annotated data at the training stage. Therefore, we add synthetic images to an annotated real EM dataset or use a fully synthetic training dataset. In this work, we present an algorithm for the synthesis of 6 types of organelles. Based on the EPFL dataset, a training set of 1161 real fragments 256×256 (ORG) and 2000 synthetic ones (SYN), as well as their combination (MIX), were generated. The experiment of training models for 6, 5-classes and binary segmentation showed that, despite the imperfections of synthetics, training on a mixed (MIX) dataset gave a significant increase (about 0.1) in the Dice metric for 6 and 5 and same results at binary. The synthetic data strategy gives annotations for free, but shifts the effort to producing sufficiently realistic images.

Бесплатно

Gradient-based technique for image structural analysis and applications

Gradient-based technique for image structural analysis and applications

Asatryan David G.

Статья научная

This paper is devoted to application of gradients field characteristics in selected problems of image intellectual analysis and processing. To analyse the properties and structure of an image several approaches and models based on the use of the gradients field characteristics, are proposed. In this paper, models based on Weibull distribution are considered, an image dominant direction estimation algorithm using the parameters of scattering ellipse of gradients field components is proposed, and a similarity measure of two images with arbitrary dimensions and orientation is proposed. Some examples of applications of these models for estimation of blur and structuredness of an image, for the quality assessment of resizing and rotating algorithms, as well as for detection of a specified object on the image delivered by an unmanned aerial vehicle, are given.

Бесплатно

Head model reconstruction and animation method using color image with depth information

Head model reconstruction and animation method using color image with depth information

Kozlova Yu.kh., Myasnikov V.V.

Статья научная

The article presents a method for reconstructing and animating a digital model of a human head from a single RGBD image, a color RGB image with depth information. An approach is proposed for optimizing the parametric FLAME model using a point cloud of a face corresponding to a single RGBD image. The results of experimental studies have shown that the proposed optimization approach makes it possible to obtain a head model with more prominent features of the original face compared to optimization approaches using RGB images or the same approaches generalized to RGBD images.

Бесплатно

High-speed recursive-separable image processing filters with variable scanning aperture sizes

High-speed recursive-separable image processing filters with variable scanning aperture sizes

Kamenskiy A.V., Kuryachiy M.I., Krasnoperova A.S., Ilyin Yu.V., Akaeva T.M., Boyarkin S.E.

Статья научная

In the process of development of computer technologies, the number of areas of their application naturally grows and, along with it, the complexity of the tasks to be solved, which entails the need for new research. Similar tasks include digital filtering of images in the field of medical technologies and active-pulse television measuring systems. There are many methods and algorithms of digital filtering designed to solve the problem of improving the quality; algorithms that can improve the quality of images while reducing computational costs are widely used. High demands, which are made due to the constant growth in the size of the generated images, as well as the requirement for modern television systems, is real-time operation. When solving practical problems, it is required to use different filter aperture sizes, which provide an increase in quality and preservation of image details. The solution of these problems was the reason for the emergence of adaptive filters that are able to change the parameters in the process of processing the received data, while not spending additional time on processing with an increase in the size of the aperture. The paper presents the principles of constructing adaptive image processing filters, which, by obtaining an input parameter indicating the required dimension of a multi-element aperture, are able to implement the construction of the required aperture. The Laplacian “Truncated Pyramid” filter and the “double pyramid” Laplacian were modified. A feature of these filters is the oddness of the multi-element aperture, so the coefficient used to build the mask is always set to odd. When using these filters, it is possible to use two coefficients that are responsible for increasing the filtration efficiency, since, in their original form, the Laplacian filters have a sum of coefficients equal to zero. The experiment shows a comparison with high-dimensional filters that work when using classical two-dimensional convolution. The next stage of the presented research will be the application of parallel computing techniques, which will increase the speed of the developed filters.

Бесплатно

Image compression and encryption based on wavelet transform and chaos

Image compression and encryption based on wavelet transform and chaos

Gao Haibo, Zeng Wenjuan

Статья научная

With the rapid development of network technology, more and more digital images are transmitted on the network, and gradually become one important means for people to access the information. The security problem of the image information data increasingly highlights and has become one problem to be attended. The current image encryption algorithm basically focuses on the simple encryption in the frequency domain or airspace domain, and related methods also have some shortcomings. Based on the characteristics of wavelet transform, this paper puts forward the image compression and encryption based on the wavelet transform and chaos by combining the advantages of chaotic mapping. This method introduces the chaos and wavelet transform into the digital image encryption algorithm, and transforms the image from the spatial domain to the frequency domain of wavelet transform, and adds the hybrid noise to the high frequency part of the wavelet transform, thus achieving the purpose of the image degradation and improving the encryption security by combining the encryption approaches in the spatial domain and frequency domain based on the chaotic sequence and the excellent characteristics of wavelet transform...

Бесплатно

Журнал