Статьи журнала - International Journal of Image, Graphics and Signal Processing
Все статьи: 1092
Multi Point Search Pattern for Fast Search Motion Estimation of High Resolution Video Coding
Статья научная
Block matching algorithm (BMA) based motion estimation (ME) is most accepted method for removal of temporal redundancy between frames in video coding. With recent advancement in resolution of video, the need of search pattern covering most of macroblocks within search area in frame is increasing. Existing search patterns are tiny and take plenty of time to reach at edge or corner of the search window. With aim of covering nearly every probable candidate macroblocks in all direction and to speed up the search process, multipoint search patterns are presented in this paper. Initial candidate macroblocks are chosen on grid of 12x12 and then search progresses like traditional diamond or hexagon search. Due to multipoint, chances of trapping in incorrect direction is very less and method can exhibit better quality of encoding with optimum number of search points.
Бесплатно
Multi Resolution Analysis for Consonant Classification in Noisy Environments
Статья научная
This paper investigates on the use of Wavelet Transform (WT) to model and recognize the utterances of Consonant – Vowel (CV) speech units in noisy environments. The peculiarity of the proposed method lies in the fact that using WT, non stationary nature of the speech signal can be accurately considered. A hybrid feature extraction namely Normalized Wavelet Hybrid Feature (NWHF) using the combination of Classical Wavelet Decomposition (CWD) and Wavelet Packet Decomposition (WPD) along with z-score normalization technique are studied here. CV speech unit recognition tasks performed for both noisy and clean speech units using Artificial Neural Network (ANN) and k – Nearest Neighborhood (k – NN) are also presented. The result indicates the robustness of the proposed technique based on WT in additive noisy condition.
Бесплатно
Статья научная
This paper mainly studies Multi Band Spectral Subtraction (MBSS) for speech enhancement based on the spectrum representation in the frequency domain with three different scales(linear, log, mel) and their effect on performance measures in presence of additive non-stationary noise at different ranges of input SNR. Since speech is non-stationary signal, noise distribution is non-uniform i.e few frequency components are affected severely than others. A common method to restore the original speech in presence of noise is speech enhancement by suppressing the back ground noise. Multi Band Spectral Subtraction is one among the speech enhancement techniques which performs spectral subtraction by dividing noisy speech spectrum into uniformly spaced non over lapping frequency bands and spectral over subtraction is performed in each band separately. The performance of this method is evaluated in terms of objective measures such as Cepstrum distance, Log Likelihood Ratio, Weighted Spectral Slope distance, segmental SNR and Perceptual Evaluation of Speech Quality.
Бесплатно
Multi class fruit classification using efficient object detection and recognition techniques
Статья научная
In this paper, an efficient approach has been proposed to localize every clearly visible object or region of object from an image, using less memory and computing power. For object detection we have processed every input image to overcome several complexities, which are the main limitations to achieve better result, such as overlap between multiple objects, noise in the image background, poor resolution etc. We have also implemented an improved Convolutional Neural Network based classification or recognition algorithm which has proved to provide better performance than baseline works. Combining these two detection and recognition approaches, we have developed a competent multi-class Fruit Detection and Recognition (FDR) model that is very proficient regardless of different limitations such as high and poor image quality, complex background or lightening condition, different fruits of same shape and color, multiple overlapped fruits, existence of non-fruit object in the image and the variety in size, shape, angel and feature of fruit. This proposed FDR model is also capable of detecting every single fruit separately from a set of overlapping fruits. Another major contribution of our FDR model is that it is not a dataset oriented model which works better on only a particular dataset as it has been proved to provide better performance while applying on both real world images (e.g., our own dataset) and several states of art datasets. Nevertheless, taking a number of challenges into consideration, our proposed model is capable of detecting and recognizing fruits from image with a better accuracy and average precision rate of about 0.9875.
Бесплатно
Статья научная
Misalignment of the camera, some jerk during capture is natural that results some tilt or geometric transformed photo. The accurate recognition on these misaligned facial images is one of the biggest challenges in real time systems. In this paper, a fuzzy enabled multi-parameter based model is presented, which is applied to individual blocks to assign block weights. At first, the model has divided the image into square segments of fixed size. Each segmented division is analyzed under directional, structural and texture features. Fuzzy rule is applied on the obtained quantized values for each segment and to assign weights to each segment. While performing the recognition process, each weighted block is compared with all weighted-feature blocks of training set. A weight-ratio to exactly map and one-to-all map methods are assigned to identify overall matching accuracy. The work is applied on FERET and LFW datasets with rotational, translational and skewed transformation. The comparative observations are taken against KPCA and ICA methods. The proportionate transformation specific observations show that the model has improved the accuracy up to 30% for rotational and skewed transformation and in case of translation the improvement is up to 11%.
Бесплатно
Multi-Metric Based Face Identification with Multi Configuration LBP Descriptor
Статья научная
This paper deals with the performance improvement of a mono modal face identification. A statistical study of various structures of the LBPs (Local Binary Patterns) features associated to two metrics is performed to find out those committing errors on different subjects. Then, during the identification stage, these optimal variants are used, and a simple score level fusion is adopted. The score fusion is done after min-max normalization. The main contribution of this paper consists in the association of multiple LBP schemes with different metrics using simple fusion operation. The overall identification rating up to 99% using AT&T database is achieved.
Бесплатно
Multi-Module Convolutional Neural Network Based Optimal Face Recognition with Minibatch Optimization
Статья научная
Technology is getting smarter day by day and facilitating every part of human life from automatic alarming, automatic temperature, and personalised choice prediction and behaviour recognition. Such technological advancements are using different machine learning techniques for artificial intelligence. Face recognition is also one of the techniques to develop futuristic artificial intelligence-based technology used to get devices equipped with personalised features and security. Face recognition is also used for keeping information of facial data of employees of any company citizens of any country to get tracked and control over crimes in unfair incidents. For making face recognition more reliable and faster, several techniques are evolving every day. One of the fastest and most dependable face recognitions is CNN based face recognition. This work is designed based on the multiple convolutional module-based CNN equipped with batch normalisation and linear rectified unit for normalising and optimising features with minibatch. Faces in CNN’s fully connected layer are classified using the SoftMax classifier. The ORL and Yale face datasets are used for training. The average accuracy achieved is 94.74% for ORL and 96.60% for Yale Datasets. The convolutional neural network training was done for different training percentages, e.g., 66%, 67%, 68%, 69%, 70%, and 80%. The experimental outcomes exhibited that the defined approach had enhanced the face recognition performance.
Бесплатно
MultiBiometric Fusion: Left and Right Irises based Authentication Technique
Статья научная
Biometric science is one of the important applications in the pattern recognition field. There are several modalities used in the biometric applications, among these different traits we choose the iris modality. Therefore, this paper proposes a multi-biometric technique which combines the both units of the iris modality: the left and the right irises. The fusion combines the advantages of the two instances. For the both units of the iris, the segmentation is realized by a modified method and the feature extraction is done by a global approach (the Daubechies wavelets). The Support Vector Machine SVM is used to obtain scores for fusion. Then the scores obtained are normalized by Min-Max method and the fusion is performed at score level by the combination of two methods: a combination method with a classification method. The Fusion is tested using four databases which are: CASIAV4 database, SDUMLA-HMT database, MMU1, and MMU2 databases. The obtained results have confirmed that the multi-biometric systems are better than the mono-modal systems according to their performance.
Бесплатно
Статья научная
The fundamental component of the work contains a summary of the theoretical foundations of the algorithms of the scale-self-similar approach for the analysis of digital Mueller-matrix images of birefringent architectonics of biological tissues. The theoretical consideration of multifractal analysis and determination of singularity spectra of fractal dimensions of coordinate distributions of matrix elements (Mueller-matrix images - MMI) of biological tissue preparations is based on the method of maxima of amplitude modules of the wavelet transform (WTMM). The applied part of the work is devoted to the comparison of diagnostic capabilities for determining the prescription of mechanical brain injury using algorithms of statistical (central statistical moments of the 1st - 4th orders), fractal (approximating curves to logarithmic dependences of power spectra) and multifractal (WTMM) analysis of MMI linear birefringence of fibrillar networks of neurons of nervous tissue. Excellent (~95%) accuracy of differential diagnosis of the prescription of mechanical injury has been achieved.
Бесплатно
Multiple Objects Tracking Using CAMShift Algorithm and Implementation of Trip Wire
Статья научная
In this paper we represent Security application which is developed using concepts of Video Analytics. User can draw Trip wire on video stream with help of Mouse Callback events. Using this application user can restrict any area of total video scene. Direction selection for tripping is also a choice of a user. If any undesired moving object cross this drawn trip wire then motion of this moving object is getting detected and also tracked. If object crosses trip wire in the same direction as that of user selected then Alarm Indication will appear on that moving object. OpenCV library functions are used for motion detection and motion tracking. CAMShift algorithm is implemented for tracking. An experimental result shows Motion detection, Motion Tracking and drawn trip wire on video.
Бесплатно
Myanmar Continuous Speech Recognition System Using Convolutional Neural Network
Статья научная
Translating the human speech signal into the text words is also known as Automatic Speech Recognition System (ASR) that is still many challenges in the processes of continuous speech recognition. Recognition System for Continuous speech develops with the four processes: segmentation, extraction the feature, classification and then recognition. Nowadays, because of the various changes of weather condition, the weather news becomes very important part for everybody. Mostly, the deaf people can’t hear weather news when the weather news is broadcast by using radio and television channel but the deaf people also need to know about that news report. This system designed to classify and recognize the weather news words as the Myanmar texts on the sounds of Myanmar weather news reporting. In this system, two types of input features are used based on Mel Frequency Cepstral Coefficient (MFCC) feature extraction method such MFCC features and MFCC features images. Then these two types of features are trained to build the acoustic model and are classified these features using the Convolutional Neural Network (CNN) classifiers. As the experimental result, The Word Error Rate (WER) of this entire system is 18.75% on the MFCC features and 11.2% on the MFCC features images.
Бесплатно
Статья научная
Super resolution is a technique to enhance the scale of image in digital image processing. The single low resolution and multiple low resolution techniques have been used by many researchers in reconstructing high resolution image. The above resolution increasing techniques are researched under spatial and frequency domain. When increased in the resolution of image, it is very important to retain the quality of image, which is the challenging task in the domain of digital image processing. Here in this paper, the super resolution architecture for single low resolution technique has been proposed to reconstruct the high resolution image by combining interpolation and restoration methods in spatial domain. The modified adaptive bilinear interpolation is proposed for interpolation and contra harmonic mean & adaptive median filter are used for restoration of single low resolution image. The experimentation is done on standard data set show that, the results obtained from modified adaptive bilinear interpolation are competitively improved when compare to other existing single low resolution techniques in interpolation domain.
Бесплатно
Neural Network Synchronous Binary Counter Using Hybrid Algorithm Training
Статья научная
Information processing using Neural Network Counter can result in faster and accurate computation of data due to their parallel processing, learning and adaptability to various environments. In this paper, a novel 4-Bit Negative Edge Triggered Binary Synchronous Up/Down Counter using Artificial Neural Networks trained with hybrid algorithms is proposed. The Counter was built solely using logic gates and flip flops, and then they are trained using different evolutionary algorithms, with a multi objective fitness function using the back propagation learning. Thus, the device is less prone to error with a very fast convergence rate. The simulation results of proposed hybrid algorithms are compared in terms of network weights, bit-value, percentage error and variance with respect to theoretical outputs which show that the proposed counter has values close to the theoretical outputs.
Бесплатно
Статья научная
Fractal analysis is currently in full swing in particular in the medical field because of the fractal nature of natural phenomena (vascular system, nervous system, bones, breast tissue ...). For this, many algorithms for estimating the fractal dimension have emerged. Most of them are based on the principle of box counting. In this work we propose a new method for calculating fractal attributes based on contrast homogeneity and energy that have been extracted from gray level co-occurrence matrix. As application we are investigated in the characterization and classification of mammographic images with SuportVectorMachine classifier. We considered in particular images with tumor masses and architectural disorder to compare with normal ones. We calculate, for comparison the fractal dimension obtained by a reference method (triangular prism) and perform a classification similar to the previous. Results obtained with new algorithm are better than reference method (classification rate is 0.91 vs 0.65). Hence new fractal attributes are relevant.
Бесплатно
New Biometric Approaches for Improved Person Identification Using Facial Detection
Статья научная
Biometrics is measurable characteristics specific to an individual. Face detection has diverse applications especially as an identification solution which can meet the crying needs in security areas. While traditionally 2D images of faces have been used, 3D scans that contain both 3D data and registered color are becoming easier to acquire. Before 3D face images can be used to identify an individual, they require some form of initial alignment information, typically based on facial feature locations. We follow this by a discussion of the algorithms performance when constrained to frontal images and an analysis of its performance on a more complex dataset with significant head pose variation using 3D face data for detection provides a promising route to improved performance.
Бесплатно
New Intelligent-based Approach for the Early Detection of Disorders: Use on Rhinological Data
Статья научная
Medical data are characterized by complexity, inaccuracy, heterogeneity, the presence of hidden dependencies, often their distributions are unknown. Correlations between factors of disorders, including clinical data, parameters of time series, patient’s subjective assessments have a high complexity that cannot be fully comprehended by humans anymore. This problem is extremely important especially in case of the early detection of disorders. Machine learning methods are very useful for such detection task. Special area of interest is a problem of breathing disorders. In the paper, author demonstrates the potential use of computational intelligence tools for rhinologic data processing. Implementation of supervised learning techniques will allow improving accuracy of disorders detection as well as decrease medical insurance company expenses. Proposed intelligent-based approach makes it possible to process a variety of heterogeneous data in the medical domain. A combination of conventional and fractal features for time series of rhinomanometric data as well as inclusion of hydrodynamic characteristics of nasal breathing process provides the best accuracy. Such approach may be modified for other breathing disorders detection.
Бесплатно
New Mean-Variance Gamma Method for Automatic Gamma Correction
Статья научная
Gamma correction is an interesting method for improving image quality in uncontrolled illumination conditions case. This paper presents a new technique called Mean-Variance Gamma (MV-Gamma), which is used for estimating automatically the amount of gamma correction, in the absence of any information about environmental light and imaging device. First, we valued every row and column of image pixels matrix as a random variable, where we can calculate a feature vector of means/variances of image rows and columns. After that, we applied a range of inverse gamma values on the input image, and we calculated the feature vector, for each inverse gamma value, to compare it with the target one defined from statistics of good-light images. The inverse gamma value which gave a minimum Euclidean distance between the image feature vector and the target one was selected. Experiments results, on various test images, confirmed the superiority of the proposed method compared with existing tested ones.
Бесплатно
New automatic target recognition approach based on Hough transform and mutual information
Статья научная
This paper presents a new automatic target recognition approach based on Hough transform and mutual information. The Hough transform groups the extracted edge points in edged images to an appropriate set of lines which helps in features extraction and matching processes in both of target and stored database images. This gives an initial indication about realization and recognition between target image and its corresponding database image. Mutual information is used to emphasize the recognition of the target image and its verification with its corresponding database image. The proposed recognition approach passed through five stages which are: edge detection by Sobel edge detector, thinning as a morphological operation, Hough transformation, matching process and finally measuring the mutual information between target and the available database images. The experimental results proved that, the target recognition is realized and gives more accurate and successful recognition rate than other recent recognition techniques which are based on stable edge weighted HOG.
Бесплатно
Noise Removal From Microarray Images Using Maximum a Posteriori Based Bivariate Estimator
Статья научная
Microarray Image contains information about thousands of genes in an organism and these images are affected by several types of noises. They affect the circular edges of spots and thus degrade the image quality. Hence noise removal is the first step of cDNA microarray image analysis for obtaining gene ex-pression level and identifying the infected cells. The Dual Tree Complex Wavelet Transform (DT-CWT) is preferred for denoising microarray images due to its properties like improved directional selectivity and near shift-invariance. In this paper, bivariate estimators namely Linear Minimum Mean Squared Error (LMMSE) and Maximum A Posteriori (MAP) derived by applying DT-CWT are used for denoising microarray images. Experimental results show that MAP based denoising method outperforms existing denoising techniques for microarray images.
Бесплатно
Noisy Image Decomposition Based On Texture Detecting Function
Статья научная
At present, most of image decomposition models only apply to some ideal images, such as, noise-free, without blurring and super resolution images, and so on. In this paper, they propose a novel decomposition model based on dual method and texture detecting function for noisy image. Firstly, they prove the existence of minimal solutions of the noisy decomposition model functional. Secondly, they write down an alterative implementation algorithm. Finally, they give some numerical experiments, which show that their model can effectively work for Gaussian noisy image decomposition.
Бесплатно