Обработка изображений, распознавание образов. Рубрика в журнале - Компьютерная оптика
Screen recapture detection based on color-texture analysis of document boundary regions
Статья научная
This paper examines a presentation attack detection when a document recaptured from a screen is presented instead of the original document. We propose an algorithm based on analyzing a moiré pattern within document boundary regions as a distinctive feature of the recaptured image. It is assumed that the pattern overlapping the document boundaries is a recapture artifact, not a match between document and background textures. To detect such a pattern, we propose an algorithm that employs the result of the fast Hough transform of the document boundary regions with enhanced pattern contrast. The algorithm performance was measured for the open dataset DLC-2021, which contains images of mock documents as originals and their screen recaptures. The precision of the proposed solution was evaluated as 95.4 %, and the recall as 90.5 %.
Бесплатно
Статья научная
3D mesh segmentation has become an essential step in many applications in 3D shape analysis. In this paper, a new segmentation method is proposed based on a learning approach using the artificial neural networks classifier and the spectral clustering for segmentation. Firstly, a training step is done using the artificial neural network trained on existing segmentation, taken from the ground truth segmentation (done by humane operators) available in the benchmark proposed by Chen et al. to extract the candidate boundaries of a given 3D-model based on a set of geometric criteria. Then, we use this resulted knowledge to construct a new connectivity of the mesh and use the spectral clustering method to segment the 3D mesh into significant parts. Our approach was evaluated using different evaluation metrics. The experiments confirm that the proposed method yields significantly good results and outperforms some of the competitive segmentation methods in the literature.
Бесплатно
Статья
Melanoma skin cancer is one of the most dangerous forms of skin cancer because it grows fast and causes most of the skin cancer deaths. Hence, early detection is a very important task to treat melanoma. In this article, we propose a skin lesion segmentation method for dermoscopic images based on the U-Net architecture with VGG-16 encoder and the semantic segmentation. Base on the segmented skin lesion, diagnostic imaging systems can evaluate skin lesion features to classify them. The proposed method requires fewer resources for training, and it is suitable for computing systems without powerful GPUs, but the training accuracy is still high enough (above 95 %). In the experiments, we train the model on the ISIC dataset – a common dermoscopic image dataset. To assess the performance of the proposed skin lesion segmentation method, we evaluate the Sorensen-Dice and the Jaccard scores and compare to other deep learning-based skin lesion segmentation methods. Experimental results showed that skin lesion segmentation quality of the proposed method are better than ones of the compared methods.
Бесплатно
Статья научная
The SUPPOSe enhanced deconvolution algorithm relies in assuming that the image source can be described by an incoherent superposition of virtual point sources of equal intensity and finding the number and position of such virtual sources. In this work we describe the recent advances in the implementation of the method to gain resolution and remove artifacts due to the presence of fluorescent molecules close enough to the image frame boundary. The method was modified removing the invariant used before given by the product of the flux of the virtual sources times the number of virtual sources, and replacing it by a new invariant given by the total flux within the frame, thus allowing the location of virtual sources outside the frame but contributing to the signal inside the frame.
Бесплатно
Spatiotemporal ecosystem health assessment comparison under the pressure-state-response framework
Статья научная
A spatiotemporal ecosystem health (EH) assessment study is necessary for sustainable development and proper management of natural resources. At present higher rate of human-socio-economic activities, industrialization, and misuse of land are major factors for ecosystem degradation. Therefore this research work used remote sensing (RS) and geographical information system (GIS) technology, under pressure-state-response (PSR) framework with analytic hierarchy process (AHP) weight method based on 29 indicators were analyzed for spatiotemporal EH assessment in Tatarstan and Samara states in Russia from 2010 to 2020. Results indicate continuous degradation of EH in Tatarstan state while in Samara state first decreased and later on an improved ecosystem health condition. This is one of the most innovative analyses work for real-time accurate ecosystem health assessment, mapping, and monitoring as well as protect fragile eco-environment with sustainable development, proper policy-making, and management at any scale and region.
Бесплатно
Super-resolution microscopy based on interpolation and wide spectrum de-noising
Статья научная
In the conventional single-molecule localizations and super-resolution microscopy, the pixel size of a raw image is approximately equal to the standard deviation of the point spread function. Such a raw image is referred to herein as a conventional raw image, based on which better single molecule localization effect and efficiency can be achieved. It is found that both interpolation and de-noising can effectively improve the Signal to Noise Ratio of the conventional raw image. The conventional raw image, the de-noised, the interpolated and the de-noised interpolated are compared and analyzed and compressed sensing is used for super-resolution reconstruction. The simulation results show that both the highest Signal to Noise Ratio and the best super-resolution reconstruction can be obtained by de-noising the interpolated conventional raw image. This method also renders the best super-resolution reconstruction and minimum gradient in the real experiment. De-noising the interpolated conventional raw image is an effective method to improve the super-resolution microscopy.
Бесплатно
Super-resolution microscopy based on wide spectrum denoising and compressed sensing
Статья научная
WSD can effectively remove random noise of a raw image from very low density to ultra-high density fluorescent molecular distribution scenarios. The size of the raw image that WSD can denoise is subject to the used measurement matrix. A large raw image must be divided into blocks so that WSD denoises each block separately. Based on traditional single-molecule localization and super-resolution reconstruction scenarios, wide spectrum denoising (WSD) for blocks of different sizes was studied. The denoising ability is related to block sizes. The general trend is when the block gets larger, the denoising effect gets worse. When the block size is equal to 10, the denoising effect is the best. Using compressed sensing, only 20 raw images are needed for reconstruction. The temporal resolution is less than half a second. The spatial resolution is also greatly improved.
Бесплатно
Switching median filter for suppressing multi-pixel impulse noise
Статья научная
This paper proposes a new switching median filter for suppressing multi-pixel impulse noise in X-ray images. A multi-pixel impulse is understood as a set of several neighboring pixels, the intensity of each significantly exceeds background intensity. Multi-pixel noise can occur, for example, due to the blooming effect, the reason being the limited value of pixel saturation capacity. This article defines the thresholds for the intensity increment relative to the eight immediate neighbors, above which the current pixel is processed by the median filter. The dependence of these thresholds on the number of pixels in an impulse is presented. The proposed algorithm is based on the median filtering process, which consists of several iterations. In this case, the filter has the smallest possible size, which minimizes image distortion during processing. In particular, to exclude a single-pixel impulse, pixel processing is turned on when intensity surge exceeds 3.5 with the grayscale value ranging from 0 to 1. At the same time, to exclude nine-pixel impulses, three iterations are required with the following thresholds: the first iteration with a threshold 2.0; the second iteration also with a threshold 2.0 and the third iteration with a threshold 3.5. The algorithm proposed was tested on real X-ray images corrupted by multi-pixel impulse noise. The algorithm is not only simple, but also reliable and suitable for real-time implementation and application. The efficiency of the technique is shown in comparison with other known filtering methods with respect to the degree of noise suppression. The main result of the testing is that only the proposed method allows excluding multi-pixel noise. Other advantage of the algorithm is its weak effect on the level of Gaussian noise leading to the absence of image blurring (or preserving image details) during processing.
Бесплатно
Статья научная
We discuss a synthesis of stochastic algorithms, obtaining expressions for gradients of Shannon, Renyi and Tsallis mutual information on the basis of the mathematical apparatus of stochastic gradient adaptation of algorithms for estimating image registration parameters. To obtain the expressions, derivatives of the image entropy with respect to the estimated parameters are used. The entropies are calculated using a Parzen window method. A comparative study of the synthesized algorithms in terms of stability and accuracy of the registration parameter estimates, including in conditions of additive noise, is carried out.
Бесплатно
Статья научная
Topography is the study of an area on the earth's surface. This term relates to the land's slope or contour, which is the interval of elevation differences between two adjacent and parallel contour lines. Topography generally presents a three-dimensional model of object surface relief and an identification of land or hilly areas based on horizontal coordinates such as latitude and longitude, and vertical position, namely elevation. The topography is essential information that must be provided in the execution of building or road construction based on the ground contour. The problem which is the ground contour which can provide visualization topography as a three-dimensional (3D) condition of the ground contour is not normal (non-linear). Another problem is that the traditional measurement techniques with wheel rotation only measure distances and cannot represent the trajectory of the ground contour in 3D. The proposed in-depth evaluation of orientation estimation results in the topography accuracy level. This methodology consists of several processes; Inertia and orientation of an object, Distance measurement, Terrestrial topocentric - Euclidean transformation, and Topography visualization. This research designed a prototype and proposed a new visualization method of the ground contours to reconstruct a topography map between three algorithms; Direct Cosine Matrix-3D Coordinate, Madgwick-3D Coordinate, and Complementary Filter. The methodology was tested and evaluated intensively by direct observation at three measurement locations with different difficulty levels. As a result, the Direct Cosine Matrix-3D Coordinate is able to visualize the ground contours by reconstructing a topography map much better than other methods.
Бесплатно
Статья научная
In this study, the diagnostic abilities of intensity parameters of optical coherence tomography angiography (OCTA) images in the early detection of diabetic retinopathy (DR) were determined. 78 normal healthy eyes, 10 diabetic eyes with mild non-proliferative diabetic retinopathy (NPDR), and 10 diabetic eyes with moderate NPDR were employed. Four retinal vascular plexuses were generated by using OCTA, which included the nerve fiber layer vascular plexus (NFLVP), superficial vascular plexus (SVP), intermediate capillary plexus (ICP) and deep capillary plexus (DCP). The parafoveal zone in each OCTA image was divided into four sectors which were the superior, temporal, inferior, and nasal sectors. Five intensity parameters including the mean, median, variance, skewness, and kurtosis of intensities were calculated for each sector. The factor of aging was evaluated among normal healthy subgroups. The diagnostic abilities of intensity parameters were evaluated between normal healthy subjects and diabetic patients with DR. Our results showed that the variance of intensities in superior sector in ICP achieved the highest AUROC value of 0.95 with the sensitivity of 0.87 and the specificity of 1.000 when comparing the diabetic patients with the mild NPDR to normal healthy subjects. The mean intensity in superior sector in ICP achieved the second highest AUROC value of 0.95 with the sensitivity of 0.90 and the specificity of 0.90 when comparing the diabetic patients with the moderate NPDR to normal healthy subjects. The proposed approach could offer a simple way to differentiate diabetic patients with early DR from normal healthy subjects without performing the relatively complicated image processing techniques.
Бесплатно
The study of skeleton description reduction in the human fall-detection task
Статья научная
Accurate and reliable real-time fall detection is a key aspect of any intelligent elderly people care system. A lot of modern RGB-D cameras can provide a skeleton description of a human figure as a compact pose presentation. This makes it possible to use this description for further analysis without access to real video and, thus, to increase the privacy of the whole system. The skeleton description reduction based on the anthropometrical characteristics of a human body is proposed. The experimental study on the TST Fall Detection dataset v2 by the Leave-One-Person-Out method shows that the proposed skeleton description reduction technique provides better recognition quality and increases the overall performance of a Fall-Detection System.
Бесплатно
Time-optimal algorithms focused on the search for random pulsed-point sources
Статья научная
The article describes methods and algorithms related to the analysis of dynamically changing discrete random fields. Time-optimal strategies for the localization of pulsed-point sources having a random spatial distribution and indicating themselves by generating instant delta pulses at random times are proposed. An optimal strategy is a procedure that has a minimum (statistically) average localization time. The search is performed in accordance with the requirements for localization accuracy and is carried out by a system with one or several receiving devices. Along with the predetermined accuracy of localization of a random pulsed-point source, a significant complicating factor of the formulated problem is that the choice of the optimal search procedure is not limited to one-step algorithms that end at the moment of first pulse generation. Moreover, the article shows that even with relatively low requirements for localization accuracy, the time-optimal procedure consists of several steps, and the transition from one step to another occurs at the time of registration of the next pulse by the receiving system. In this case, the situation is acceptable when during the process of optimal search some of the generated pulses are not fixed by the receiving system. The parameters of the optimal search depending on the number of receiving devices and the required accuracy of localization are calculated and described in the paper.
Бесплатно
Traffic extreme situations detection in video sequences based on integral optical flow
Статья научная
Road traffic analysis is an important task in many applications and it can be used in video surveillance systems to prevent many undesirable events. In this paper, we propose a new method based on integral optical flow to analyze cars movement in video and detect flow extreme situations in real-world videos. Firstly, integral optical flow is calculated for video sequences based on optical flow, thus random background motion is eliminated; secondly, pixel-level motion maps which describe cars movement from different perspectives are created based on integral optical flow; thirdly, region-level indicators are defined and calculated; finally, threshold segmentation is used to identify different cars movements. We also define and calculate several parameters of moving car flow including direction, speed, density, and intensity without detecting and counting cars. Experimental results show that our method can identify cars directional movement, cars divergence and cars accumulation effectively.
Бесплатно
Tree-serial parametric dynamic programming with flexible prior model for image denoising
Статья научная
We consider here image denoising procedures, based on computationally effective tree-serial pa-rametric dynamic programming procedures, different representations of an image lattice by the set of acyclic graphs and non-convex regularization of a new type which allows to flexibly set a priori pref-erences. Experimental results in image denoising, as well as comparison with related methods, are provided. A new extended version of multi quadratic dynamic programming procedures for image denoising, proposed here, shows an improved accuracy for images of a different type.
Бесплатно
U-net-bin: hacking the document image binarization contest
Статья научная
Image binarization is still a challenging task in a variety of applications. In particular, Document Image Binarization Contest (DIBCO) is organized regularly to track the state-of-the-art techniques for the historical document binarization. In this work we present a binarization method that was ranked first in the DIBCO' 17 contest. It is a convolutional neural network (CNN) based method which uses U-Net architecture, originally designed for biomedical image segmentation. We describe our approach to training data preparation and contest ground truth examination and provide multiple insights on its construction (so called hacking). It led to more accurate historical document binarization problem statement with respect to the challenges one could face in the open access datasets. A docker container with the final network along with all the supplementary data we used in the training process has been published on Github.
Бесплатно
Unsupervised color texture segmentation based on multi-scale region-level Markov random field models
Статья научная
In the field of color texture segmentation, region-level Markov random field model (RMRF) has become a focal problem because of its efficiency in modeling the large-range spatial constraints. However, the RMRF defined on a single scale cannot describe the un-stationary essence of the image, which highly limits its robustness. Hence, by combining wavelet transformation and the RMRF model, we present a multi-scale RMRF (MsRMRF) model in wavelet domainin this paper. In the Bayesian framework, the proposed model seamlessly integrates the multi-scale information stemmed from both the original image and the region-level spatial constraints. Therefore, the new model can accurately describe the characteristics of different kinds of texture. Based on MsRMRF, an unsupervised segmentation algorithm is designed for segmenting color texture images. Both synthetic color texture images and remote sensing images are employed in the comparative experiments, and the experimental results show that the proposed method can obtain more accurate segmentation results than the competitors.
Бесплатно
Vanishing point detection with direct and transposed fast hough transform inside the neural network
Статья научная
In this paper, we suggest a new neural network architecture for vanishing point detection in images. The key element is the use of the direct and transposed fast Hough transforms separated by convolutional layer blocks with standard activation functions. It allows us to get the answer in the coordinates of the input image at the output of the network and thus to calculate the coordinates of the vanishing point by simply selecting the maximum. Besides, it was proved that calculation of the transposed fast Hough transform can be performed using the direct one. The use of integral operators enables the neural network to rely on global rectilinear features in the image, and so it is ideal for detecting vanishing points. To demonstrate the effectiveness of the proposed architecture, we use a set of images from a DVR and show its superiority over existing methods. Note, in addition, that the proposed neural network architecture essentially repeats the process of direct and back projection used, for example, in computed tomography.
Бесплатно
Vehicle wheel weld detection based on improved YOLO V4 algorithm
Статья научная
In recent years, vision-based object detection has made great progress across different fields. For instance, in the field of automobile manufacturing, welding detection is a key step of weld inspection in wheel production. The automatic detection and positioning of welded parts on wheels can improve the efficiency of wheel hub production. At present, there are few deep learning based methods to detect vehicle wheel welds. In this paper, a method based on YOLO v4 algorithm is proposed to detect vehicle wheel welds. The main contributions of the proposed method are the use of k-means to optimize anchor box size, a Distance-IoU loss to optimize the loss function of YOLO v4, and non-maximum suppression using Distance-IoU to eliminate redundant candidate bounding boxes. These steps improve detection accuracy. The experiments show that the improved methods can achieve high accuracy in vehicle wheel weld detection (4.92 % points higher than the baseline model with respect to AP75 and 2.75 % points higher with respect to AP50). We also evaluated the proposed method on the public KITTI dataset. The detection results show the improved method’s effectiveness.
Бесплатно
Video images compression and restoration methods based on optimal sampling
Статья научная
The study proposes video images compression and restoration methods based on multidimensional sampling theory that provide four-fold video compression and subsequent real-time restoration with loss levels below visually perceptible threshold. The proposed methods can be used separately or along with any other video compression techniques, thus providing additional quadruple compression.
Бесплатно