In this paper, a robust voice activity detection algorithm based on a long-term metric using dominant frequency and spectral flatness measure is proposed. The propose algorithm makes use of the discriminating power of both features to derive the decision rule. This method reduces the average number of speech detection errors. We evaluate its performance using 15 additive noises at different SNRs (-10 dB to 10 dB) and compared with some of the most recent standard algorithms. Experiments show that our propose algorithm achieves the best performance in terms of accuracy rate average over all SNRs and noises.

Бесплатно

Role of GLCM Features in Identifying Abnormalities in the Retinal Images

Shantala Giraddi, Jagadeesh Pujari, Shivanand Seeri

Статья научная

Accurate detection of exudates in the diabetic retinal images is a challenging task. The images can have varying contrast and color characteristics. In this paper authors present the performance comparison of two feature extraction methods namely color intensity features and second order texture features based on GLCM. Authors have proposed and implemented new approach for GLCM feature calculation in which the input image is divided into number smaller blocks and GLCM features are computed on these blocks. The performance of each feature extraction method is evaluated using Back Propagation Neural Network (BPNN) classifier that is classifying the blocks as either abnormal block or normal block. With GLCM features, an accuracy of 76.6% was obtained and with color features an accuracy of 100% was obtained. It was found that color features are better in identifying true positives than GLCM based texture features. However use of GLCM features reduces the occurrence of false positives.

Бесплатно

Rough Neuron network for Fault Diagnosis

Yueling ZHAO, Hui jin, Lihong Wang, Shuang WANG

Статья научная

Considering training time of traditional BP neural network is too long and it cannot solve the problems in the input vector with multiple-valued, a new method of BP neural network based on rough neuron is presented. A rough neuron can be viewed as a pair of neurons. One neuron corresponds to the upper boundary and the other corresponds to the lower boundary. Upper and lower neuron exchange information with each other during the calculation of their outputs. Firstly, the continuous attributes in diagnostic decision system are discretized with particle swarm optimization. Then, the reducts are found based on attribute dependence of rough set, and the optimal diagnostic decision is determined. Lastly, according to the optimal decision system, rough neuron network is designed for fault diagnosis. A practical example is given , the method is feasible and available.

Бесплатно

SIFT-BZM: Pixel Based Forgery Image Detection Using Scale-Invariant Feature Transform and Block Based Zernike Moments

Kshipra Ashok Tatkare, Manoj Devare

Статья научная

New area of image processing termed "digital image forensics" aims to gather quantifiable proof of a digital image's authenticity and place of origin. Detection of forgery images to look for copied and pasted portions; however, depending on whether the copied portion underwent post-processing before being transferred to another party, the detection method may differ. Zernike Moments and Scale-Invariant Feature Transform (SIFT) combined are unique techniques that aid in the identification of textured and smooth regions. But compared to SIFT separately, this combination is the slowest. So in the proposed work, Block based image division and SIFT based key point detection model is developed to detect forgery images. The gathered images are poor visual quality and various dimension, so it is resized and converter grayscale conversion. In addition, pixel values of images are improved using optimal Gaussian filter and adaptive histogram equalization which remove noise and blurring based on sigma value. Then, using the SIFT key point extraction algorithm to extract the image's key point and compute the feature vector of each key-points. In that using a block based matching technique to split the pre-images into blocks, and each blocks are diagonally subdivide. Length of the feature vector is computed using Zernike moments of each blocks. Both SIFT features and Zernike moments features are matched to identify the manipulated image from the given data. The proposed model provides 100% recall, 98.2% precision, and 99.09% F1_score. Thus provide the proposed model was effectively detects forgery image in the given data.

Бесплатно

SNR Improvement by Photon Noise Filtering in Ocean Color Monitor Satellite Images

Ashok Kumar, Rajiv Kumaran, Harsh C Trivedi

Статья научная

In high radiometric resolution electro optical image payloads of remote sensing satellites, photon noise dominates SNR performance. Photon noise is input signal dependent and difficult to filter. This paper proposes a photon noise filtering technique for Ocean Color Monitor (OCM) images. Existing filtering techniques are meant for object detection and handles images with poor SNR. As OCM SNR is on higher side, custom sigma filter based denoising technique is developed. Proposed technique first converts photon noise to signal independent Gaussian noise. For this variance stabilization, Anscombe transform is used. Simulations are carried on various images. Proposed technique provides 20- 50% reduction in overall as well count-wise RMSE. FFT analysis shows significant reduction in noise. Proposed technique is of low complexity.

Бесплатно

SWT-PnP-DnCNN: Medical Image Fusion Using Stationary Wavelet Transform and Plug-and-Play Deep Denoising Model

Amit Pandey, Prabhishek Singh, Akansha Singh, Achyut Shankar, Manoj Diwakar

Статья научная

This paper presents a hybrid medical image fusion (MIF) technique (SWT-PnP-DnCNN) that combines multiscale decomposition, spatial-frequency-driven fusion, and deep denoising priors to efficiently integrate MIF images. The SWT-PnP-DnCNN begins with the Stationary Wavelet Transform (SWT) to decompose input medical images into low-frequency (LFSBs) and high-frequency (HFSBs) subbands. The LFSBs are fused using spatial frequency-based weighted averaging, effectively integrating overall intensity and contrast information. For the HFSBs, a local energy and max-selection strategy is adopted to retain salient edge features from the source images. Following the initial fusion, a Plug-and-Play (PnP) optimization strategy is applied to improve this fused image. This step uses a pretrained DnCNN model as a deep denoiser, serving as an implicit image prior in a model-driven iterative framework. Each iteration alternates between a data consistency step and a denoising step, significantly reducing artifacts and enhancing structural fidelity in the result. The effectiveness of SWT-PnP-DnCNN is demonstrated on benchmark CT-MRI, MRI-PET, and MRI-PET datasets. Extensive evaluation against classical hybrid strategies and recent CNN-based fusion methods shows that SWT-PnP-DnCNN achieves the best performance across standard metrics. We further include mean±std reporting and paired t-tests, confirming statistically significant improvements (p < 0.05). Ablation studies validate each design choice by comparing SWT-only vs. SWT+PnP and evaluating denoiser alternatives, with sensitivity to PnP iterations, regularization strength, and SWT levels. The runtime analysis clarifies feasible deployment, particularly in offline or cloud-based environments. Overall, SWT-PnP-DnCNN emerges as a robust, interpretable, and clinically valuable solution for enhancing MIF in medical imaging applications.

Бесплатно

Satellite Image Classification and Segmentation by Using JSEG Segmentation Algorithm

Khamael Abbas, Mustafa Rydh

Статья научная

In this paper, a adopted approach to fully automatic satellite image segmentation, called JSEG, "JPEG image segmentation" is presented. First colors in the image are quantized to represent differentiate regions in the image. Then image pixel colors are replaced by their corresponding color class labels, thus forming a class-map of the image. A criterion for “good” segmentation using this class-map is proposed. Applying the criterion to local windows in the class-map results in the “J-image”, in which high and low values corresponding to possible region boundaries and region centers, respectively. A region growing method is then used to segment the image based on the multi-scale J-images. Experiments show that JSEG provides good segmentation and classification results on a variety of images.

Бесплатно

Satellite Image Processing for Land Use and Land Cover Mapping

Ashoka Vanjare, S.N. Omkar, J.Senthilnath

Статья научная

In this paper, urban growth of Bangalore region is analyzed and discussed by using multi-temporal and multi-spectral Landsat satellite images. Urban growth analysis helps in understanding the change detection of Bangalore region. The change detection is studied over a period of 39 years and the region of interest covers an area of 2182 km2. The main cause for urban growth is the increase in population. In India, rapid urbanization is witnessed due to an increase in the population, continuous development has affected the existence of natural resources. Therefore observing and monitoring the natural resources (land use) plays an important role. To analyze changed detection, researcher’s use remote sensing data. Continuous use of remote sensing data helps researchers to analyze the change detection. The main objective of this study is to monitor land cover changes of Bangalore district which covers rural and urban regions using multi-temporal and multi-sensor Landsat - multi-spectral scanner (MSS), thematic mapper (TM), Enhanced Thematic mapper plus (ETM+) MSS, TM and ETM+ images captured in the years 1973, 1992, 1999, 2002, 2005, 2008 and 2011. Temporal changes were determined by using maximum likelihood classification method. The classification results contain four land cover classes namely, built-up, vegetation, water and barren land. The results indicate that the region is densely developed which has resulted in decrease of water and vegetation regions. The continuous transformation of barren land to built-up region has affected water and vegetation regions. Generally, from 1973 to 2011 the percentage of urban region has increased from 4.6% to 25.43%, mainly due to urbanization.

Бесплатно

Scale Adaptive Object Tracker with Occlusion Handling

Ramaravind K M, Shravan T R, Omkar S N

Статья научная

Real-time object tracking is one of the most crucial tasks in the field of computer vision. Many different approaches have been proposed and implemented to track an object in a video sequence. One possible way is to use mean shift algorithm which is considered to be the simplest and satisfactorily efficient method to track objects despite few drawbacks. This paper proposes a different approach to solving two typical issues existing in tracking algorithms like mean shift: (1) adaptively estimating the scale of the object and (2) handling occlusions. The log likelihood function is used to extract object pixels and estimate the scale of the object. The Extreme learning machine is applied to train the radial basis function neural network to search for the object in case of occlusion or local convergence of mean shift. The experimental results show that the proposed algorithm can handle occlusion and estimate object scale effectively with less computational load making it suitable for real-time implementation.

Бесплатно

Scale Space Reduction with Interpolation to Speed up Visual Saliency Detection

Omprakash S. Rajankar, Uttam D.Kolekar

Статья научная

The scale of salient object in an image is not a known priori, therefore to detect salient objects accurately multiple scale analysis is used by saliency detection models. However, multiple scale analysis makes the saliency detection slow. Fast and accurate saliency detection is essential to obtain Region of Interest in image processing applications. This paper proposes a scale space reduction with interpolation to speed up the saliency detection. To demonstrate the concept, this method is integrated with Hypercomplex Fourier Transform saliency detection which reduced the computational complexity from O(N) to O(N/2).

Бесплатно

Scaling of Digital Images by Adaptive and Combined Application of Interpolation Algorithms

Serhiy Balovsyak, Mariana Borcha, Yurii Hnatiuk, Khrystyna Odaiska, Ihor Fodchuk

Статья научная

The article describes the theoretical foundations and software tools for scaling digital images by adaptive and combined application of bilinear and bicubic interpolation algorithms. An analysis of modern algorithms and image scaling tools has been performed. The theoretical foundations of image scaling using interpolation algorithms are described. The root mean square error between the pixel values of the original and scaled images was used as the scaling error. The scaling of images was performed by a complex of two interpolation algorithms. The first algorithm reduces the image scale, after which the second algorithm increases the scale. Such image processing is performed, in particular, in telecommunication systems for transmitting images at reduced scales. A correlation was found between the values of the average spatial period of the image and the relative scaling error, which is equal to the ratio of the scaling errors for different interpolation algorithms. The spatial period of the image was calculated based on its energy spectrum. A regression analysis was performed to determine the dependence of the relative scaling error on the spatial period of the images. It is found that in most cases bicubic interpolation provides a smaller scaling error, but for some images with small spatial period, bilinear interpolation provides a smaller error. It is proposed to increase the scaling accuracy by adaptively selecting the image interpolation algorithm depending on its spatial period. A combined application of interpolation algorithms was performed, which consists of reducing the scale using the bilinear interpolation algorithm and increasing the scale using the bicubic interpolation algorithm. A statistical analysis of the results of image scaling was performed. It was found that the combined application of algorithms in most cases provides a smaller error than the separate application of the bicubic and bilinear interpolation algorithms.

Бесплатно

Scene based non-uniformity correction for optical remote sensing imagery

Lolith Gopan, E.Venkateswarlu, Thara Nair, G.P.Swamy, B.Gopala Krishna

Статья научная

In this work, we propose and evaluate different scene based methods for non-uniformity corrections for optical remote sensing data sets. These methods can be used to correct or refine the existing radiometric calibrations, thereby improving the image quality. The performance of each algorithm against different datasets are analyzed and a quantitative comparison of different quality parameters viz. entropy, correlation coefficient, signal to noise ratio, peak signal to noise ratio and structural similarity index are carried out to recommend the best method for each scene. For a given data set, the selected method depends on the severity, type of terrain it covered, etc.

Бесплатно

Score Fusion of SIFT & SURF Descriptors for Face Recognition Using Wavelet Transforms

Musa M.Ameen, Alaa Eleyan

Статья научная

Automatic face recognition is a major research area in computer vision which aims to recognize human face without human intervention. Significant developments in this field have shown that in many face recognition applications the automated techniques outperform humans. The conventional Scale-Invariant Feature Transform (SIFT) and Speeded-Up Robust Features (SURF) are used in face recognition where they provide high performances. However, this performance can be improved further by transforming the input into different domains before applying SIFT and SURF algorithms. Hence, we apply Discrete Wavelet Transform (DWT) or Gabor Wavelet Transform (GWT) at the input face images, which provides denser and extra information to be used by the conventional SIFT or SURF algorithms. Matching scores of SIFT or SURF from each subimage is fused before making final decision. Simulations show that the proposed approaches based on wavelet transforms using SIFT or SURF provides very high performance compared to the conventional algorithms.

Бесплатно

Score-level-based face anti-spoofing system using handcrafted and deep learned characteristics

Omid. Sharifi

Статья научная

Recognition performance of biometric systems is affected through spoofing attacks made by fake identities. The focus of this paper is on presenting a new scheme based on score level and decision level fusion to monitor individuals in term of real and fake. The proposed fake detection scheme involve consideration of both handcrafted and deep learned techniques on face images to differentiate real and fake individuals. In this approach, convolutional neural network (CNN) and overlapped histograms of local binary patterns (OVLBP) methods is used to extract facial features of images. The produced matching scores provided by CNN and OVLBP then combined to form a fused score vector. Finally, the last decision on real and attack images is done by combining decisions of hybrid scheme using majority vote of CNN, OVLBP and their fused vector. Experimental results on public spoof databases such as Print-Attack and Replay-Attack face databases demonstrate the strength of the proposed anti-spoofing method for fake detection.

Бесплатно

Seamless Panoramic Image Stitching Based on Invariant Feature Detector and Image Blending

Megha V., Rajkumar K.K.

Статья научная

Image stitching is the method of creating a composite image from several images of the same scene. This paper addresses the issues of generating a seamless panoramic image from a series of photographs of the same scene by varying scale, orientation and illumination. A feature-based approach is proposed in this paper. Scale Invariant Feature Transform (SIFT) is used to detect key points in the image. SIFT is both a feature detector and descriptor. The common region between different images is identified by comparing the feature descriptors of each image. Brute-Force matcher with KNN algorithm is used for feature matching. The outliers in the matching features are eliminated by Random Sample Consensus (RANSAC) algorithm. To create seamless image, alpha blending operation is applied. Experiments are conducted on UDISD (Unsupervised Deep Image Stitching Data set). The overall performance of the proposed stitching method is evaluated based on metrics such as PSNR, SSIM, RMSE, MSE and UIQI, and the proposed stitching algorithm yields good result with seamless stitched image.

Бесплатно

Secure Data Transmission in Video Format Based on LSB and Huffman Coding

Shwe Sin Myat Than

Статья научная

The growth of needing to transmit bit amount of data through the internet in secure format encourage the research for steganography technique, especially in video file. Stenographic technics in video format gives many advantages to transportation of important data because video files are a part of people’s daily life and the attackers can’t notice easily. The high embedding capacity of video file improves the popularity of video steganography among the various media types. Therefore, the simplest form but with many advantage of (Least significant bit) LSB, that is enforced with the high compression method of Huffman chunk coding method is proposed in this paper to embed data in video file in multi-step cryptography embedding schemes. The intension is to get more secure nature of the system and to get more embedding capacity system. The experiments are carried out with various sizes of video files and text file sizes are used to show the effectiveness of the proposed methods. The results manifest superior performance for proposed algorithm with the performance parameters like Peak Signal to Noise Ratio (PSNR), Mean Square Error (MSE) and Bit Error Rate (BER) are calculated to test the quality of stego video.

Бесплатно

Secure Transmission and Recovery of Embedded Patient Information from Biomedical Images of Different Modalities through a Combination of Cryptography and Watermarking

Subhajit Koley, Koushik Pal, Goutam Ghosh, Mahua Bhattacharya

Статья научная

In this paper a new type of information hiding skill in biomedical images is proposed through a combination of cryptography and digital watermarking to achieve the enhancement in confidential and authenticated data storage and secured transmission. Here patient's name and doctor's name are considered as patient's information which is encrypted using cryptography and embedded in the scan image of that patient through watermarking. RSA algorithm is used for encryption and higher order bit LSB replacement technique is used for embedding the information. The private keys are also embedded in the cover image to have better security and accurate recovery of the hidden information. The outcome of the proposed methodology shows that the hidden information doesn't affect the cover image and it can be recovered efficiently even from several noisy images. The strength of the proposed embedding scheme is also supported by several image quality matrices.

Бесплатно

Secured Lossy Color Image Compression Using Permutation and Predictions

S.Shunmugan, P.Arockia Jansi Rani

Статья научная

Due to rapid growth in image sizes, an alternate of numerically lossless coding named visually lossless coding is considered to reduce storage size and lower data transmission. In this paper, a lossy compression method on encrypted color image is introduced with undetectable quality loss and high compression ratio. The proposed method includes the Xinpeng Zhang lossy compression [1], Hierarchical Oriented Prediction (HOP)[2], Uniform Quantization, Negative Sign Removal, Concatenation of 7-bit data and Huffman Compression. The encrypted image is divided into rigid and elastic parts. The Xinpeng Zhang elastic compression is applied on elastic part and HOP is applied on rigid part. This method is applied on different test cases and the results were evaluated. The experimental evidences suggest that, the proposed method has better coding performance than the existing encrypted image compressions, with 9.645 % reductions in bit rate and the eye perception is visually lossless.

Бесплатно

Segment Wise EEG Signal Compression Using LSTM Auto Encoder for Enhanced Efficiency

Uma. M., Mohammed Javidh S., Ruchi Shah, Prabhu Sethuramalingam, M.M. Reddy

Статья научная

Efficient compression of electroencephalogram (EEG) signals is crucial for enabling real-time monitoring, storage, and transmission in various medical and non-medical applications. This paper presents a segment-wise processing approach using temporal modeling-based auto encoders for EEG signal compression. By leveraging models such as Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Recurrent Neural Network (RNN), and Self-Attention, the proposed method effectively captures temporal dependencies in the EEG data. Segment-wise processing not only enhances compression efficiency but also significantly reduces the processing time of these sequence models. Extensive experiments demonstrate that GRU-based auto encoders offer the best performance, particularly at lower Data Reduction Factors (DRFs), achieving a minimal signal loss of 0.2% at a 50% compression ratio, making it suitable for medical applications. For non-medical scenarios, a higher compression ratio of 75% with a signal loss of 5.4% is found to be acceptable. The results indicate that the proposed approach achieves a favorable balance between compression efficiency, signal fidelity, and computational performance.

Бесплатно

Segment-wise Quality Evaluation for Identification of Face Spoofing

Akhilesh Kumar Pandey, Rajoo Pandey

Статья научная

Non-intrusive nature of the face-based recognition technology makes it more popular among hand held devices. Spoof detection in face-based recognition systems has been an important topic of the research in the last decade. Among several techniques available in the literature for liveness detection, image quality measure (IQM) based technique are particularly attractive due to their computational efficiency. In this paper, an approach based on segment-wise computation of image quality measures is proposed to improve the accuracy of detection. Two types of the non-overlapping segments are considered here: 1) rectangular segments of identical sizes, 2) segment based on neighborhood variance. It is found that both approaches exhibit better performance in comparison with other techniques without increasing too much computational complexity. The experiments are carried out with well-known Replay-Attack database to prove its robustness under different conditions.

Бесплатно

1
...
49
50
51
52
53
54
55
...
В конец

Журнал