Статьи журнала - International Journal of Image, Graphics and Signal Processing

Все статьи: 1110

Kannada Language Parameters for Speaker Identification with The Constraint of Limited Data

Kannada Language Parameters for Speaker Identification with The Constraint of Limited Data

Nagaraja B.G., H.S. Jayanna

Статья научная

In this paper we demonstrate the impact of language parameter variability on mono, cross and multi-lingual speaker identification under limited data condition. The languages considered for the study are English, Hindi and Kannada. The speaker specific features are extracted using multi-taper mel-frequency cepstral coefficients (MFCC) and speaker models are built using Gaussian mixture model (GMM)-universal background model (UBM). The sine-weighted cepstrum estimators (SWCE) with 6 tapers are considered for multi-taper MFCC feature extraction. The mono and cross-lingual experimental results show that the performance of speaker identification trained and/or tested with Kannada language is decreased as compared to other languages. It was observed that a database free from ottakshara, arka and anukaranavyayagalu results a good performance and is almost equal to other languages.

Бесплатно

Lattice Boltzmann implementation for Fluids Flow Simulation in Porous Media

Lattice Boltzmann implementation for Fluids Flow Simulation in Porous Media

Xinming Zhang

Статья научная

In this paper, the lattice-Boltzmann method is developed to investigate the behavior of isothermal two-phase fluid flow in porous media. The method is based on the Shan–Chen multiphase model of nonideal fluids that allow coexistence of two phases of a single substance. We reproduce some different idealized situations (phase separation, surface tension, contact angle, pipe flow, and fluid droplet motion, et al) in which the results are already known from theory or laboratory measurements and show the validity of the implementation for the physical two-phase flow in porous media. Application of the method to fluid intrusion in porous media is discussed and shows the effect of wettability on the fluid flow. The capability of reproducing critical flooding phenomena under strong wettability conditions is also proved.

Бесплатно

Leaf Vein Extraction Based on Gray-scale Morphology

Leaf Vein Extraction Based on Gray-scale Morphology

Xiaodong Zheng, Xiaojie Wang

Статья научная

Leaf features play an important role in plant species identification and plant taxonomy. The type of the leaf vein is an important morphological feature of the leaf in botany. Leaf vein should be extracted from the leaf in the image before discriminating its type. In this paper a new method of leaf vein extraction has been proposed based on gray-scale morphology. Firstly, the color image of the plant leaf is transformed to the gray image according to the hue and intensity information. Secondly, the gray-scale morphology processing is applied to the image to eliminate the color overlap in the whole leaf vein and the whole background. Thirdly, the linear intensity adjustment is adopted to enlarge the gray value difference between the leaf vein and its background. Fourthly, calculate a threshold with OSTU method to segment the leaf vein from its background. Finally, the leaf vein can be got after some processing on details. Experiments have been conducted with several images. The results show the effectiveness of the method. The idea of the method is also applicable to other linear objects extraction.

Бесплатно

Learning Semantic Image Attributes Using Image Recognition and Knowledge Graph Embeddings

Learning Semantic Image Attributes Using Image Recognition and Knowledge Graph Embeddings

Ashutosh Kumar Tiwari, Sandeep Varma Nadimpalli

Статья научная

Extracting structured knowledge from texts has traditionally been used for knowledge base generation. However, other sources of information, such as images can be leveraged into this process to build more complete and richer knowledge bases. Structured semantic representation of the content of an image and knowledge graph embeddings can provide a unique representation of semantic relationships between image entities. Linking known entities in knowledge graphs and learning open-world images using language models has attracted lots of interest over the years. In this paper, we propose a shared learning approach to learn semantic attributes of images by combining a knowledge graph embedding model with the recognized attributes of images. The proposed model premises to help us understand the semantic relationship between the entities of an image and implicitly provide a link for the extracted entities through a knowledge graph embedding model. Under the limitation of using a custom user-defined knowledge base with limited data, the proposed model presents significant accuracy and provides a new alternative to the earlier approaches. The proposed approach is a step towards bridging the gap between frameworks which learn from large amounts of data and frameworks which use a limited set of predicates to infer new knowledge.

Бесплатно

Learning a Backpropagation Neural Network With Error Function Based on Bhattacharyya Distance for Face Recognition

Learning a Backpropagation Neural Network With Error Function Based on Bhattacharyya Distance for Face Recognition

Naouar Belghini, Arsalane Zarghili, Jamal Kharroubi, Aicha Majda

Статья научная

In this paper, a color face recognition system is developed to identify human faces using Back propagation neural network. The architecture we adopt is All-Class-in-One-Network, where all the classes are placed in a single network. To accelerate the learning process we propose the use of Bhattacharyya distance as total error to train the network. In the experimental section we compare how the algorithm converge using the mean square error and the Bhattacharyya distance. Experimental results indicated that the image faces can be recognized by the proposed system effectively and swiftly.

Бесплатно

Least Significant Bit and Discrete Wavelet Transform Algorithm Realization for Image Steganography Employing FPGA

Least Significant Bit and Discrete Wavelet Transform Algorithm Realization for Image Steganography Employing FPGA

Kalpana Sanjay Shete, Mangal Patil, J. S. Chitode

Статья научная

Steganography is the science that deals with conveying secret information by embedding into the cover object invisibly. In steganography, only the authorized party is aware of the existence of the hidden message to achieve secret communication. The image file is mostly used cover medium amongst various digital files such as image, text, audio and video. The proposed idea of this research work is to develop the robust image steganography. It is implemented using Least Significant Bit and Discrete Wavelet Transform techniques for digital image signal to improve the robustness & evaluate the performance of these algorithms. The parameters such as mean square error (MSE), bit error rate (BER), peak signal to noise ratio (PSNR) and processing time are considered here to evaluate the performance of the proposed work. In the proposed system, PSNR and MSE value ranges from 42 to 46 dB and 1.5 to 3.5 for LSB method respectively. For DWT method these results are further improved as it gives higher PSNR values between 49 to 57 dB and lower MSE values 0.2 to 0.7.

Бесплатно

Left Ventricle Segmentation in Magnetic Resonance Images with Modified Active Contour Method

Left Ventricle Segmentation in Magnetic Resonance Images with Modified Active Contour Method

Maryam Aghai Amirkhizi, Siyamak Haghipour

Статья научная

Desired segmentation of the image is a pivotal problem in image processing. Segmenting the left ventricle (LV) in magnetic resonance images (MRIs) is essential for evaluation of cardiac function. For the segmentation of cardiac MRI several methods have been proposed and implemented. Each of them has advantages and restrictions. A modified region-based active contour model was applied for segmentation of LV chamber. A new semi-automatic algorithm was suggested calculating the appropriate Balloon force according to mean intensity of the region of interest for each image. The database is included of 2,039 MR images collected from 18 children under 18. The results were compared with previous literatures according to two standards: Dice Metric (DM) and Point to Curve (P2C). The obtained segmentation results are better than previously reported values in several literatures. In this study different points were used in cardiac cycle and several slice levels and classified into three levels: Base, Mid. and Apex. The best results were obtained at end diastole (ED) in comparison with end systole (ES), and on base slice than other slices, because of LV bigger size in ED phase and base slice. With segmentation of LV MRI based on novel active contour and application of the suggested algorithm for balloon force calculation, the mean improvement of DM compared to Grosgeorge et al. is 19.6% in ED and 49.5% in ES phase. The mean improvement of P2C compared with the same literature respectively for ED and ES phase is 43.8% and 39.6%.

Бесплатно

Level Set Segmentation of Images using Block Matching Local SVD Operator based Sparsity and TV Regularization

Level Set Segmentation of Images using Block Matching Local SVD Operator based Sparsity and TV Regularization

Kama Ramudu, Gajula Laxmi Bhavani, Manabolu Nishanth, Akula Prakash Raj, Vamshika Analdas

Статья научная

Image segmentation is one of the most important steps in computer vision and image processing. Image segmentation is dividing the image into meaningful regions based on similarity pixels. We propose a new segmentation algorithm based on de-noising of images, good segmentation results depends on the noisy free images. This means that, we may not get the proper segmentation results in the presence of noise. For this, image pre-processing stage is necessary to denoise the image. An image segmentation result depends on the pre-processing results. In this paper, proposed a new integrating approach based on de-noising and segmentation which is called Level Set Segmentation of Images using Block Matching Local SVD Operator Based Sparsity and TV Regularization (BMLSVD-TV). The proposed method is dividing into two stages, in the first stage images are de-noised based on BMLSVDTV algorithm. De-noising images is a crucial aspect of image processing, there are a few factors to keep in mind during image de-noising such as smoothing the flat areas, safeguarding the edges without blurring, and keeping the textures and new artifacts should not be created. Block Matching, Updating of basis vector, Sparsity regularization, and TV regularization. This method searches for blocks that are comparable to each other in block matching. The data in the array demonstrates a high level of correlation after the matching blocks are grouped together. The sparse coefficients will be gathered after adequate modification. Most of the noise in the image will be minimized through the sparsity regularization step by employing different de-noising algorithms such as Block matching 3D using fixed basis vectors. The edge information will be retained and the piecewise smoothness of the image will be produced using the TV regularization step. Later, in the second state create a contour on the de-noised image and evolve the contour based on level Set function (LSF) defined. This combined approach gives better performance for segmenting the image regions over existing level set methods. When compared our proposed level set method over state of art level set methods. The proposed segmentation method is superior in terms of no.of iterations, CPU time and area covered over the existing level set methods. By this model, we obtained a good quality of restored image from noisy image and the performance of the image quality assessed by the two important parameters such as PSNR and Mean Square Error (MSE). The higher value of PSNR and lower value of MSE leads to good quality of image. In this research work, the proposed denoising method got higher PSNR values over existing methods. Where recovering the original image content is essential for effective performance, image denoising is a key component. It is used in a variety of applications, including image restoration, visual tracking, image registration, image segmentation, and image classification. This model is the best segmentation method for accurate segmentation of objects based on denoising images when compared with the other models in the field.

Бесплатно

Level Sets based Directed Surface Extraction

Level Sets based Directed Surface Extraction

Xueshu Liu

Статья научная

Directed surface extraction from CT images is the first task in the design of medical equipment. In this paper a new approach based on level set method is proposed to extract the directed surface from CT images. Two level set functions with corresponding speed functions are involved in this study. One is used to cut the desired bone from the input CT model in which the directed surface, usually the outermost surface, and the complex inner surface are both contained. The other is used to remove the complex inner surface. The experimental results show the feasible of the proposed method.

Бесплатно

Leveraging Convolutional Neural Network and Transfer Learning for Cotton Plant and Leaf Disease Recognition

Leveraging Convolutional Neural Network and Transfer Learning for Cotton Plant and Leaf Disease Recognition

Md. Rayhan Ahmed

Статья научная

Automatic Recognition of Diseased Cotton Plant and Leaves (ARDCPL) using Deep Learning (DL) carries a greater significance in agricultural research. The cotton plant and leaves are severely infected by a disease named Bacterial Blight-affected by bacterium, Xanthomonas axonopodis pv. Malvacearum and a new rolling leaf disease affected by an unorthodox leaf roll dwarf virus. Existing research in ARDCPL requires various complicated image preprocessing, feature extraction approaches and cannot ensure higher accuracy in their detection rates. This work suggests a Deep Convolutional Neural Network (CNN) based DCPLD-CNN model that achieves a higher accuracy by leveraging the DL models ability to extract features from images automatically. Due to the enormous success of numerous pre-trained architectures regarding several image classification task, this study also explores eight CNN based pre-trained architectures: DenseNet121, NasNetLarge, VGG16, VGG19, ResNet50, InceptionV3, InceptionResNetV2, and Xception models by Fine-Tuning them using Transfer Learning (TL) to recognize diseased cotton plant and leaves. This study utilizes those pre-trained architectures by adding extra dense layers in the last layers of those models. Several Image Data Augmentation (IDA) methods were used to expand the training data to increase the model's generalization capability and reduce overfitting. The proposed DCPLD-CNN model achieves an accuracy of 98.77% in recognizing disease in cotton plant and leaves. The customized DenseNet121 model achieved the highest accuracy of 98.60% amongst all the pre-trained architectures. The proposed method's feasibility and practicality were exhibited by several simulated experimental results for this classification task.

Бесплатно

Linear Discriminate Analysis based Robust Watermarking in DWT and LWT Domain with PCA based Statistical Feature Reduction

Linear Discriminate Analysis based Robust Watermarking in DWT and LWT Domain with PCA based Statistical Feature Reduction

Sushma Jaiswal, Manoj Kumar Pandey

Статья научная

With aiming to design a novel image watermarking technique, this paper presents a novel method of image watermarking using lifting wavelet transform, discrete wavelet transform, and one-dimensional linear discriminate analysis. In this blind watermarking technique, statistical features of the watermarked image have been incorporated for preparing the training set and testing set. After that, the principal component analysis is applied to reduce the obtained feature set, so the training time is reduced to the desired level and accuracy is enhanced. The one-dimensional linear discriminate analysis is used for binary classification as it has the ability to classify with good accuracy. This technique applies discrete wavelet transform and lifting wavelet transform in two different watermarking schemes for the image transformation. Both transformations give higher tolerance against image distortion than other conventional transformation methods. One of the significant challenges of a watermarking technique is maintaining the proper balance between robustness and imperceptibility. The proposed blind watermarking technique exhibits the imperceptibility of 43.70 dB for Lena image in case of no attack for the first scheme (using LWT) and 44.71 dB for the second scheme (using DWT+LWT). The first watermarking scheme is tested for robustness, and it is seen that the given scheme is performing well against most of the image attacks in terms of robustness. This technique is compared using some existing similar watermarking methods, and it is found to be robust against most image attacks. It also maintains the excellent quality of the watermarked image.

Бесплатно

Local Content Based Image Authentication for Tamper Localization

Local Content Based Image Authentication for Tamper Localization

L. Sumalatha, V. Venkata Krishna, V. Vijaya Kumar

Статья научная

Digital images make up a large component in the multimedia information. Hence Image authentication has attained a great importance and lead to the development of several image authentication algorithms. This paper proposes a block based watermarking scheme for image authentication based on the edge information extracted from each block. A signature is calculated from each edge block of the image using simple hash function and inserted in the same block. The proposed local edge based content hash (LECH) scheme extracts the original image without any distortion from the marked image after the hidden data have been extracted. It can also detect and localize tampered areas of the watermarked image. Experimental results demonstrate the validity of the proposed scheme.

Бесплатно

Local Entropy-based Non Blind Robust Image Watermarking: Case of Medical Images

Local Entropy-based Non Blind Robust Image Watermarking: Case of Medical Images

Lamri Laouamer, Mohannad Alswailim

Статья научная

Medical image, watermarking, spatial domain, local entropy, imperceptibility, robustness

Бесплатно

Local binary pattern family descriptors for texture classification

Local binary pattern family descriptors for texture classification

E. Jebamalar Leavline, D. Asir Antony Gnana Singh, P. Maheswari

Статья научная

Texture classification is widely employed in many computer vision and pattern recognition applications. Texture classification is performed in two phases namely feature extraction and classification. Several feature extraction methods and feature descriptors have been proposed and local binary pattern (LBP) has attained much attraction due to their simplicity and ease of computation. Several variants of LBP have been proposed in literature. This paper presents a performance evaluation of LBP based feature descriptors namely LBP, uniform LBP (ULBP), LBP variance (LBPV), LBP Fourier histogram, rotated LBP (RLBP) and dominant rotation invariant LBP (DRLBP). For performance evaluation, nearest neighbor classifier is employed. The benchmark OUTEX texture database is used for performance evaluation in terms of classification accuracy and runtime.

Бесплатно

Lossless Image Compression Using A Simplified MED Algorithm with Integer Wavelet Transform

Lossless Image Compression Using A Simplified MED Algorithm with Integer Wavelet Transform

Mohamed M. Fouad, Richard M. Dansereau

Статья научная

In this paper, we propose a lossless (LS) image compression technique combining a prediction step with the integer wavelet transform. The prediction step proposed in this technique is a simplified version of the median edge detector algorithm used with JPEG-LS. First, the image is transformed using the prediction step and a difference image is obtained. The difference image goes through an integer wavelet transform and the transform coefficients are used in the lossless codeword assignment. The algorithm is simple and test results show that it yields higher compression ratios than competing techniques. Computational cost is also kept close to competing techniques.

Бесплатно

Lossy Compression Color Medical Image Using CDF Wavelet Lifting Scheme

Lossy Compression Color Medical Image Using CDF Wavelet Lifting Scheme

I.boukli hacene, M. beladghem, A.bessaid

Статья научная

As the coming era is that of digitized medical information, an important challenge to deal with is the storage and transmission requirements of enormous data, including color medical images. Compression is one of the indispensable techniques to solve this problem. In this work, we propose an algorithm for color medical image compression based on a biorthogonal wavelet transform CDF 9/7 coupled with SPIHT coding algorithm, of which we applied the lifting structure to improve the drawbacks of wavelet transform. In order to enhance the compression by our algorithm, we have compared the results obtained with wavelet based filters bank. Experimental results show that the proposed algorithm is superior to traditional methods in both lossy and lossless compression for all tested color images. Our algorithm provides very important PSNR and MSSIM values for color medical images.

Бесплатно

Loudspeaker operation status monitoring system based on power line communication technology

Loudspeaker operation status monitoring system based on power line communication technology

Biyue Diao, Guoping Chen, Feng He

Статья научная

With the rapid development of science and technology, intelligent systems have been applied to various fields. A monitoring system for the operating status of loudspeakers based on power line communication was designed and implemented. In this paper, firstly analyzes the deficiencies of previous research, and then according to the actual situation, it is concluded that the power line communication technology is more suitable for loudspeaker operating status monitoring than other communication technologies. The overall design, hardware design and software design of the entire system was introduced. And in the last, the reliability of the system were proved by many experiments. This system can be used in other applications in addition to the monitoring of the operating status of the loudspeakers.

Бесплатно

Low-Light Image Enhancement Technology Based on Image Categorization, Processing and Retinex Deep Network

Low-Light Image Enhancement Technology Based on Image Categorization, Processing and Retinex Deep Network

Zhengbing Hu, Oksana Shkurat, Krzysztof Przystupa, Orest Kochan, Marharyta Ivakhnenko

Статья научная

Low-light scenes are characterized by the loss of illumination, the noise, the color distortion and serious information degradation. The low-light image enhancement is a significant part of computer vision technology. The low-light image enhancement methods aim to an image recover to a normal-light image from dark one, a noise-free image from a noisy one, a clear image from distorting one. In this paper, the low-light image enhancement technology based on Retinex-based deep network combined with the image processing-based module is proposed. The proposed technology combines the use of traditional and deep learning methodologies, designed within a simple yet efficient architectural framework that focuses on essential feature extraction. The proposed preprocessing module of low-light image enhancement is centered on the unique knowledge and features of an image. The choice of a color model and a technique of an image transformation depends on an image dynamic range to ensure high results in terms of transfer a color, detail integrity and overall visual quality. The proposed Retinex-based deep network has been trained and tested on transformed images by means of preprocessing module that leads to an effective supervised approach to low-light image enhancement and provide superior performance. The proposed preprocessing module is implemented as an independent image enhancement module in a computer system of an image analysis and as the component module in a neural network system of an image analysis. Experimental results on the low light paired dataset show that the proposed method can reduce noise and artifacts in low-light images, and can improve contrast and brightness, demonstrating its advantages. The proposed approach injects new ideas into low light image enhancement, providing practical applications in challenging low-light scenarios.

Бесплатно

Luminance-based Embedding Approach for Color Image Watermarking

Luminance-based Embedding Approach for Color Image Watermarking

Jamal Ali Hussein

Статья научная

In this paper a new non-blind luminance-based color image watermarking technique is proposed. The original 512×512 color host image is divided into 8×8 blocks, and each block is converted to YCbCr color space. A 32×32 monochrome image is used as a watermark and embedded in the selected blocks of the original image. The selected blocks must have log-average luminance that is closer to the log-average luminance of the image. DCT transform is applied to the Y component of each selected block. Each four values of the watermark image are embedded into each selected block of the host image. The watermark values are embedded in the first four AC coefficients leaving the DC value unchanged. The watermark is extracted from the watermarked image using the same selected blocks and DCT coefficients that have been used in the embedding process. This approach is tested against variety of attacks and filters: such as, highpass, lowpass, Gaussian, median, salt and peppers, and JPEG compression. The proposed approach shows a great ability to preserve the watermark against these attacks.

Бесплатно

Lung Tumor Segmentation and Staging from CT Images Using Fast and Robust Fuzzy C-Means Clustering

Lung Tumor Segmentation and Staging from CT Images Using Fast and Robust Fuzzy C-Means Clustering

Rupak Bhakta, A. B. M. Aowlad Hossain

Статья научная

Lung tumor is the result of abnormal and uncontrolled cell division and growth in lung region. Earlier detection and staging of lung tumor is of great importance to increase the survival rate of the suffered patients. In this paper, a fast and robust Fuzzy c-means clustering method is used for segmenting the tumor region from lung CT images. Morphological reconstruction process is performed prior to Fuzzy c-means clustering to achieve robustness against noises. The computational efficiency is improved through median filtering of membership partition. Tumor masks are then reconstructed using surface based and shape based filtering. Different features are extracted from the segmented tumor region including maximum diameter and the tumor stage is determined according to the tumor staging system of American Joint Commission on Cancer. 3D shape of the segmented tumor is reconstructed from series of 2D CT slices for volume measurement. The accuracy of the proposed system is found as 92.72% for 55 randomly selected images from the RIDER Lung CT dataset of Cancer imaging archive. Lower complexity in terms of iterations and connected components as well as better noise robustness are found in comparison with conventional Fuzzy c-means and k-means clustering techniques.

Бесплатно

Журнал