Статьи журнала - International Journal of Image, Graphics and Signal Processing

Все статьи: 1056

Automatically Gradient Threshold Estimation of Anisotropic Diffusion for Meyer’s Watershed Algorithm Based Optimal Segmentation

Automatically Gradient Threshold Estimation of Anisotropic Diffusion for Meyer’s Watershed Algorithm Based Optimal Segmentation

Mithun Kumar PK, Md. Gauhar Arefin, Mohammad Motiur Rahman, A. S. M. Delowar Hossain

Статья научная

Medical image segmentation is a fundamental task in the medical imaging field. Optimal segmentation is required for the accurate judgment or appropriate clinical diagnosis. In this paper, we proposed automatically gradient threshold estimator of anisotropic diffusion for Meyer’s Watershed algorithm based optimal segmentation. The Meyer’s Watershed algorithm is the most significant for a large number of regions separations but the over segmentation is the major drawback of the Meyer’s Watershed algorithm. We are able to remove over segmentation after using anisotropic diffusion as a preprocessing step of segmentation in the Meyer’s Watershed algorithm. We used a fixed window size for dynamically gradient threshold estimation. The gradient threshold is the most important parameter of the anisotropic diffusion for image smoothing. The proposed method is able to segment medical image accurately because of obtaining the enhancement image. The introducing method demonstrates better performance without loss of any clinical information while preserving edges. Our investigated method is more efficient and effective in order to segment the region of interests in the medical images indeed.

Бесплатно

Automating the Process of Work-Piece Recognition and Location for a Pick-and-Place Robot in a SFMS

Automating the Process of Work-Piece Recognition and Location for a Pick-and-Place Robot in a SFMS

R. V. Sharan, G. C. Onwubolu

Статья научная

This paper reports the development of a vision system to automatically classify work-pieces with respect to their shape and color together with determining their location for manipulation by an in-house developed pick-and-place robot from its work-plane. The vision-based pick-and-place robot has been developed as part of a smart flexible manufacturing system for unloading work-pieces for drilling operations at a drilling workstation from an automatic guided vehicle designed to transport the work-pieces in the manufacturing work-cell. Work-pieces with three different shapes and five different colors are scattered on the work-plane of the robot and manipulated based on the shape and color specification by the user through a graphical user interface. The number of corners and the hue, saturation, and value of the colors are used for shape and color recognition respectively in this work. Due to the distinct nature of the feature vectors for the fifteen work-piece classes, all work-pieces were successfully classified using minimum distance classification during repeated experimentations with work-pieces scattered randomly on the work-plane.

Бесплатно

Autonomous Image Segmentation using Density-Adaptive Dendritic Cell Algorithm

Autonomous Image Segmentation using Density-Adaptive Dendritic Cell Algorithm

Vishwambhar Pathak, Praveen Dhyani, Prabhat Mahanti

Статья научная

Contemporary image processing based applications like medical diagnosis automation and analysis of satellite imagery include autonomous image segmentation as inevitable facility. The research done shows the efficiency of an adaptive evolutionary algorithm based on immune system dynamics for the task of autonomous image segmentation. The recognition dynamics of immune-kernels modeled with infinite Gaussian mixture models exhibit the capability to automatically determine appropriate number of segments in presence of noise. In addition, the model using representative density-kernel-parameters processes the information with much reduced space requirements. Experiments conducted with synthetic images as well as real images recorded assured convergence and optimal autonomous model estimation. The segmentation results tested in terms of PBM-index values have been found comparable to those of the Fuzzy C-Means (FCM) for the same number of segments as generated by our algorithm.

Бесплатно

Autonomous Multiple Gesture Recognition System for Disabled People

Autonomous Multiple Gesture Recognition System for Disabled People

Amarjot Singh, John Buonassisi, Sukriti Jain

Статья научная

The paper presents an intelligent multi gesture spotting system that can be used by disabled people to easily communicate with machines resulting into easement in day-to-day works. The system makes use of pose estimation for 10 signs used by hearing impaired people to communicate. Pose is extracted on the basis of silhouettes using timed motion history (tMHI) followed by gesture recognition with Hu-Moments. Signs involving motion are recognized with the help of optical flow. Based on the recognized gestures, particular instructions are sent to the robot connected to system resulting into an appropriate action/movement by the robot. The system is unique as it can act as a assisting device and can communicate in local as well as wide area to assist the disabled person.

Бесплатно

Baker Map's Itinerary Based Image Scrambling Method and Its Watermarking Application in DWT Domain

Baker Map's Itinerary Based Image Scrambling Method and Its Watermarking Application in DWT Domain

Ruisong Ye, Leyi Zhuang

Статья научная

An image scrambling method based on the itinerary of the improved 3D Baker map is proposed in this paper. The standard 3D Baker map is improved by the tent map, so that the itinerary becomes more complicated and can be used to encode the image pixel positions to scramble the image. The scrambling method is applied to the preprocessing in watermarking. The watermark bits are embedded in the discrete wavelet transform (DWT) spectral domain based on the scrambling of watermark, the odd-even adjustment rule and the neighbor mean value. The watermark bits are embedded in medium coefficients in DWT domain of the host image. Experimental results show that the watermarked image looks visually identical to the original one and the watermark can be effectively extracted upon image processing attacks, which demonstrates strong robustness against a variety of attacks.

Бесплатно

Bangla Handwritten Character Recognition using Convolutional Neural Network

Bangla Handwritten Character Recognition using Convolutional Neural Network

Md. Mahbubar Rahman, M. A. H. Akhand, Shahidul Islam, Pintu Chandra Shill, M. M. Hafizur Rahman

Статья научная

Handwritten character recognition complexity varies among different languages due to distinct shapes, strokes and number of characters. Numerous works in handwritten character recognition are available for English with respect to other major languages such as Bangla. Existing methods use distinct feature extraction techniques and various classification tools in their recognition schemes. Recently, Convolutional Neural Network (CNN) is found efficient for English handwritten character recognition. In this paper, a CNN based Bangla handwritten character recognition is investigated. The proposed method normalizes the written character images and then employ CNN to classify individual characters. It does not employ any feature extraction method like other related works. 20000 handwritten characters with different shapes and variations are used in this study. The proposed method is shown satisfactory recognition accuracy and outperformed some other prominent exiting methods.

Бесплатно

Bearing Health Assessment Using Time Domain Analysis of Vibration Signal

Bearing Health Assessment Using Time Domain Analysis of Vibration Signal

Om Prakash Yadav, G. L. Pahuja

Статья научная

Objective: Bearing defects are the most frequently occurring fault in any electrical machine. In this perspective, this manuscript proposed a novel statistical time-domain approach utilizing the vibration signal to detect incipient faults of rolling-element bearing used in three-phase induction motor. Methodology: To detect bearing defects, six time-domain features (TDFs) namely Mean Value (µ), Peak, Root Mean Square (RMS), Crest Factor (CRF), Skewness (SKW) and Kurtosis (K) were extracted from the standard database of the vibration signal. The standard databases of vibration signals were taken from the publicly available datacenter website of Case Western Reserve University (CWRU) relating to healthy, inner raceway and ball defects of bearing. Initially, the mean and standard deviation analysis of each considered TDFs of vibration signals were performed to discriminate the health conditions of bearing. Then, the box or whisker plot method was applied to visualize the variation in each TDF in terms of median and interquartile range (IQR) value for better analysis of bearing defects. Finally, a new index parameter termed as bearing fault index (BFIT) was also computed and this parameter predicts the bearing defects based on the mean of all considered TDFs. Results: The results of the “mean±σ” analysis have depicted that all considered TDFs except µ feature are almost independent to operating loads, and have discerning potential to diagnose bearing defects. The computations of these TDFs are mathematically very simple. The box plot representation of TDFs of vibration databases have shown that peak, RMS, and skewness features outperforms to demarcate bearing health conditions in terms of median and IQR value. The results of quantitative analysis of BFIT parameter have shown that if the magnitude of this parameter is higher than 1.8 then bearing is supposed to be faulty at all operating loads of machine. Thus, the BFIT analysis of TDFs is more simple and reliable to discriminate the health conditions of bearing. As most of the available techniques rely on the multi-processing of vibration data that requires large processing time and complicated mathematical model, so the proposed method prove to be simple and reliable in identifying the incipient bearing defects.

Бесплатно

Bearing fault detection using logarithmic wavelet packet transform and support vector machine

Bearing fault detection using logarithmic wavelet packet transform and support vector machine

Om Prakash Yadav, G. L. Pahuja

Статья научная

Objective: This paper presents an automated approach that combines Fisher ranking and dimensional reduction method as kernel principal component analysis (KPCA) with support vector machine (SVM) to accurately classify the defects of rolling element bearing used in induction motor. Methodology: In this perspective, vibration signal produced by rolling element bearing was decomposed to four levels using wavelet packet decomposition (WPD) method. Thirty one Logarithmic Root Mean Square Features (LRMSF) were extracted from four level decomposed vibration signals. Initially, thirty one features were rank by Fisher score and top ten rank features were selected. For effective detection, top ten features were reduced to a new feature using dimension reduction methods as KPCA and generalized discriminant analysis (GDA). After this, the new feature applied to SVM for binary classification of bearing defects. For analysis of this thirty six standard vibration datasets taken from online available bearing data center website of Case Western Reserve University on bearing conditions like healthy (NF), inner race defect (IR) and ball bearing (BB) defects at different loads. Results: The simulated numerical results show that proposed method KPCA with SVM classifier using Gaussian Kernel achieved an accuracy (AC) of 100, Sensitivity (SE) of 100%, Specificity (SP) of 99.3% and Positive prediction value (PPV) of 99.3% for NF_IRB dataset, and an AC of 100, SE of 99.8%, SP of 100% and PPV of 100% for NF_BBB dataset.

Бесплатно

Bio-chip design using multi-rate system for EEG signal on FPGA

Bio-chip design using multi-rate system for EEG signal on FPGA

Nazifa Tabassum, Sheikh Md. Rabiul Islam, Xu Huang

Статья научная

Digital Signal Processing (DSP) is one of the fastest growing techniques in the electronics industry. The signal-rate system in digital signal processing has evolved the key of fastest speed in digital signal processor. Field Programmable Gate Array (FPGA) offers good solution for addressing the needs of high performance DSP systems. The focus of this paper is on the basic DSP functions, namely filtering signals to remove unwanted frequency bands. Multi-rate Digital Filters (MDFs) are the main theme to build bio-chip design in this paper. For different purposes DSP systems need to change the sampling rate of the signal to achieve some applications. This can be done using multi-rate system where designers can increase or decrease the operating sampling rate. This bio-chip has attractive features like, low requirement of the coefficient word lengths, significant saving in computation time and storage which results in a reduction in its dynamic power consumption. This paper introduces an efficient FPGA realization of multi-rate digital filter with narrow pass-band and narrow transition band to reduce noises and changing the frequency sampling rate by factor which is required according to application. This bio-chip works on bio-signals like EEG signal.

Бесплатно

Bit Serial Architecture for Variable Block Size Motion Estimation

Bit Serial Architecture for Variable Block Size Motion Estimation

Krishna Kaveri Devarinti, T.Sai Lokesh, Gangadhar Vukkesala

Статья научная

H.264/AVC is the latest video coding standard adopting variable block size, quarter-pixel accuracy and motion vector prediction and multi-reference frames for motion estimations. These new features result in higher computation requirements than that for previous coding standards.The computational complexity of motion estimation is about 60% in the H.264/AVC encoder. In this paper most significant bit (MSB first) arithmetic based bit serial Variable Block Size Motion Estimation (VBSME) hardware architecture is proposed. MSB first bit serial architecture main feature is, its early termination SAD computation compared to normal bit serial architectures. With this early termination technique, number computations are reduced drastically. Hence power consumption is also less compared to parallel architectures. An efficient bit serial processing element is proposed and developed 2D architecture for processing of 4x4 block in parallel .Inter connect structure is developed in such way that data reusability is achieved between PEs. Two types of adder trees are employed for variable block size SAD calculation with less number of adders. The proposed architecture can generate up to 41 motion vectors (MVs) for each macroblock. The inter connection complexity between PEs reduced drastically compared to parallel architectures. The architecture supports processing of SDTV (640x480) with 30fps at 172.8 MHz for search range [+8, -7]. We could reduce 14% of computations by using early termination technique.

Бесплатно

Block Texture Pattern Detection Based on Smoothness and Complexity of Neighborhood Pixels

Block Texture Pattern Detection Based on Smoothness and Complexity of Neighborhood Pixels

Amir Farhad Nilizadeh, Ahmad Reza Naghsh Nilchi

Статья научная

In this paper, a novel method for detecting Block Texture Patterns (BTP), based on two measures: smoothness and complexity of neighborhood pixels is proposed. With these two measures, a new classification for texture detection is defined. Texture detection with these measures can be used in many image processing and computer vision applications. As an example, the applicability of BTP on data hiding algorithms is discussed, and the advantages of this classification on these algorithms are shown.

Бесплатно

Block-Based Compressive Sensed Thermal Image Reconstruction using Greedy Algorithms

Block-Based Compressive Sensed Thermal Image Reconstruction using Greedy Algorithms

Usham V. Dias, Milind E. Rane

Статья научная

This paper implements a block based compressive sensing technique for thermal image reconstruction using greedy algorithms. A total of fourteen different sensing patterns were tested for data acquisition. Orthogonal Matching Pursuit (OMP) and Regularized Orthogonal Matching Pursuit (ROMP) with two different thresholds were implemented for image reconstruction with OMP having an edge over ROMP in terms of error and PSNR. ROMP was faster in terms of iterations needed for reconstruction. As the threshold for ROMP was increased the number of iterations needed decreased. Gaussian, Bernoulli and Hadamard patterns were the best for reconstruction. Hadamard matrix, Bernoulli matrix with +/-1 entries and Bernoulli matrix with 0/1 entries have the added advantage of being more conducive for hardware implementation. This paper used Discrete Cosine Transform as the sparsifying basis for reconstruction.

Бесплатно

Blur Classification using Ridgelet Transform and Feed Forward Neural Network

Blur Classification using Ridgelet Transform and Feed Forward Neural Network

Shamik Tiwari, V. P. Shukla, S. R. Biradar, A. K. Singh

Статья научная

The objective of image restoration approach is to recover a true image from a degraded version. This problem can be stated as blind or non-blind depending upon whether blur parameters are known prior to the restoration process. Blind restoration deals with parameter identification before deconvolution. Though there exists multiple blind restorations techniques but blur type recognition is extremely desirable before application of any blur parameters estimation approach. In this paper, we develop a blur classification approach that deploys a feed forward neural network to categories motion, defocus and combined blur types. The features deployed for designing of classification system include mean and standard deviation of ridgelet energies. Our simulation results show the preciseness of proposed method.

Бесплатно

Brain Tissue Classification from Multispectral MRI by Wavelet based Principal Component Analysis

Brain Tissue Classification from Multispectral MRI by Wavelet based Principal Component Analysis

Sindhumol S, Kannan Balakrishnan, Anil Kumar

Статья научная

In this paper, we propose a multispectral analysis system using wavelet based Principal Component Analysis (PCA), to improve the brain tissue classification from MRI images. Global transforms like PCA often neglects significant small abnormality details, while dealing with a massive amount of multispectral data. In order to resolve this issue, input dataset is expanded by detail coefficients from multisignal wavelet analysis. Then, PCA is applied on the new dataset to perform feature analysis. Finally, an unsupervised classification with Fuzzy C-Means clustering algorithm is used to measure the improvement in reproducibility and accuracy of the results. A detailed comparative analysis of classified tissues with those from conventional PCA is also carried out. Proposed method yielded good improvement in classification of small abnormalities with high sensitivity/accuracy values, 98.9/98.3, for clinical analysis. Experimental results from synthetic and clinical data recommend the new method as a promising approach in brain tissue analysis.

Бесплатно

Brain Tumor Boundary Detection by Edge Indication Map Using Bi-Modal Fuzzy Histogram Thresholding Technique from MRI T2-Weighted Scans

Brain Tumor Boundary Detection by Edge Indication Map Using Bi-Modal Fuzzy Histogram Thresholding Technique from MRI T2-Weighted Scans

T. Kalaiselvi, P. Sriramakrishnan, P. Nagaraja

Статья научная

Tumor boundary detection is one of the challenging tasks in the medical diagnosis field. The proposed work constructed brain tumor boundary using bi-modal fuzzy histogram thresholding and edge indication map (EIM). The proposed work has two major steps. Initially step 1 is aimed to enhance the contrast in order to make the sharp edges. An intensity transformation is used for contrast enhancement with automatic threshold value produced by bimodal fuzzy histogram thresholding technique. Next in step 2 the EIM is generated by hybrid approach with the results of existing edge operators and maximum voting scheme. The edge indication map produces continuous tumor boundary along with brain border and substructures (cerebrospinal fluid (CSF), sulcal CSF (SCSF) and interhemispheric fissure) to reach the tumor location easily. The experimental results compared with gold standard using several evaluation parameters. The results showed better values and quality to proposed method than the traditional edge detection techniques. The 3D volume construction using edge indication map is very useful to analysis the brain tumor location during the surgical planning process.

Бесплатно

Brain Tumor Classification Using Back Propagation Neural Network

Brain Tumor Classification Using Back Propagation Neural Network

N. Sumitra, Rakesh Kumar Saxena

Статья научная

The conventional method for medical resonance brain images classification and tumors detection is by human inspection. Operator-assisted classification methods are impractical for large amounts of data and are also non-reproducible. Hence, this paper presents Neural Network techniques for the classification of the magnetic resonance human brain images. The proposed Neural Network technique consists of the following stages namely, feature extraction, dimensionality reduction, and classification. The features extracted from the magnetic resonance images (MRI) have been reduced using principles component analysis (PCA) to the more essential features such as mean, median, variance, correlation, values of maximum and minimum intensity. In the classification stage, classifier based on Back-Propagation Neural Network has been developed. This classifier has been used to classify subjects as normal, benign and malignant brain tumor images. The results show that BPN classifier gives fast and accurate classification than the other neural networks and can be effectively used for classifying brain tumor with high level of accuracy.

Бесплатно

Breast Cancer Classification from Ultrasound Images using VGG16 Model based Transfer Learning

Breast Cancer Classification from Ultrasound Images using VGG16 Model based Transfer Learning

A.B.M. Aowlad Hossain, Jannatul Kamrun Nisha, Fatematuj Johora

Статья научная

Ultrasound based breast screening is gaining attention recently especially for dense breast. The technological advancement, cancer awareness, and cost-safety-availability benefits lead rapid rise of breast ultrasound market. The irregular shape, intensity variation, and additional blood vessels of malignant cancer are distinguishable in ultrasound images from the benign phase. However, classification of breast cancer using ultrasound images is a difficult process owing to speckle noise and complex textures of breast. In this paper, a breast cancer classification method is presented using VGG16 model based transfer learning approach. We have used median filter to despeckle the images. The layers for convolution process of the pretrained VGG16 model along with the maxpooling layers have been used as feature extractor and a proposed fully connected two layers deep neural network has been designed as classifier. Adam optimizer is used with learning rate of 0.001 and binary cross-entropy is chosen as the loss function for model optimization. Dropout of hidden layers is used to avoid overfitting. Breast Ultrasound images from two databases (total 897 images) have been combined to train, validate and test the performance and generalization strength of the classifier. Experimental results showed the training accuracy as 98.2% and testing accuracy as 91% for blind testing data with a reduced of computational complexity. Gradient class activation mapping (Grad-CAM) technique has been used to visualize and check the targeted regions localization effort at the final convolutional layer and found as noteworthy. The outcomes of this work might be useful for the clinical applications of breast cancer diagnosis.

Бесплатно

Breast lesion segmentation and area calculation for MR images

Breast lesion segmentation and area calculation for MR images

Gökçen Çetinel, Sevda Gül

Статья научная

In this paper, our goal is to determine the boundaries of lesion and then calculate the area of existing lesion in breast magnetic resonance (MR) images to provide a useful information to the radiologists. For this purpose, at first stage region growing (RG) method and active contour model (Snake) is applied to the images to make the boundaries of lesion visible. RG method is one of the simplest approaches for image segmentation and provides accurate results with lower computation time due to its seed point initialization step. Snake method molds a closed contour to the boundary of a region in an image and is also popular in medical image segmentation studies. In the presented study, both of these methods are utilized to determine the lesion boundaries. After determining the boundaries of lesion accurately in the second stage of the study, bit-quad method is applied to the segmented images. Bit quad method is used to compute the area and perimeter of binary lesion images based on matching the logical state of regions of image to binary patterns. Finally, to evaluate the performance of the proposed study, computer simulations are performed. It is demonstrated via computer simulations that the lesion area and parameter values are very close to real values. By means of this study it is aimed to support radiologists during diagnosis and assessment of breast lesions.

Бесплатно

Building a Medium Scale Dataset for Non-destructive Disease Classification in Mango Fruits Using Machine Learning and Deep Learning Models

Building a Medium Scale Dataset for Non-destructive Disease Classification in Mango Fruits Using Machine Learning and Deep Learning Models

Vani Ashok, Bharathi R.K., Palaiahnakote Shivakumara

Статья научная

The growing quality and safety concern about fresh agricultural produce among consumers have led to the development of non-destructive quality assessment and testing techniques of fruits and vegetables. Humans judge the quality of fruits based on sensory attributes like taste, aroma etc. The shape, size, color, presence of defects which are external to fruits also influence the degree of consumer acceptability of produce. The traditional time consuming, manual fruit quality inspection is replaced by automated, fast, consistent, non-destructive techniques using computer vision in combination with learning algorithms. But the lack of benchmark datasets for agricultural produce has made an objective comparison of the proposed methods difficult. Hence, the proposed work aims to build a medium scale dataset for mango fruits of “Alphonso” cultivar with three classes: chilling injury, defective and non-defective. The reliability of the proposed dataset consisting of 2279 color images of mango fruits with 736 samples in chilling injury class, 632 samples in defective class and 911 samples in non-defective class, was established using a novel approach of developing a predictive model based on discriminant function analysis (DFA) which assigns group membership to each sample of the dataset. Extensive benchmarking analysis is established on the validated dataset using statistical and deep learning algorithms like support vector machine (SVM) and convolutional neural network (CNN), respectively. SVM achieved significant disease classification accuracy of 95% and 91.52% accuracy was achieved by custom CNN. The results of the proposed work indicate that the proposed color image dataset of mango fruits can be used as a benchmark dataset by other researchers for objective comparison in quality evaluation of mango fruits.

Бесплатно

COVID-19 Automatic Detection from CT Images through Transfer Learning

COVID-19 Automatic Detection from CT Images through Transfer Learning

B. Premamayudu, Chavala Bhuvaneswari

Статья научная

Identification of COVID-19 may help the community and patient to prevent the disease containment and plan to attend disease in right time. Deep neural network models widely used to analyze the medical images of COVID-19 for automatic detection and give the decision support for radiologists to summarize the accurate remarks. This paper proposed deep transfer learning for chest CT scan images to detection and diagnosis of COVID-19. VGG19, InceptionRestNetV3, InceptionV3 and DenseNet201 neural network used for automatic detection of COVID-19 disease form CT scan images (SARS-CoV-2 CT scan Dataset). Four deep transfer learning models were developed, tested and compared. The main objective of this paper is to use pre-trained features and converge pre-trained features with targeted features to improve the classification accuracy. It is observed that DenseNet201 noted the best performance and the classification accuracy is 99.98% for 300 epochs. The findings of the experiments show that the deeper networks struggle to train adequately and give less consistency when there is limited data. The DenseNet201 model adopted for COVID-19 identification from lung CT scans has been intensively optimized with optimal hyper parameters and performs at noteworthy levels with precision 99.2%, recall 100%, specificity 99.2%, and F1 score 99.2%.

Бесплатно

Журнал