International Journal of Image, Graphics and Signal Processing @ijigsp
Статьи журнала - International Journal of Image, Graphics and Signal Processing
Все статьи: 1110

An Autoencoder-Based Deep Learning Model for Fractographic Characterization of Tungsten Heavy Alloys
Статья научная
Fracture surface analysis is crucial in investigating manufacturing failures and material characterization. Traditional manual inspection methods are slow and subjective, prompting the need for efficient automated tools using advanced computer vision techniques. Recent machine learning models for classifying surface fractures show potentials but struggle due to the lack of large, labeled datasets. This study explores the potential application of autoencoders, a self-supervised neural network, to identify unintended fracture surfaces from anomalous manufacturing of tungsten-heavy alloys. The proposed autoencoder-based model achieves 97% accuracy in distinguishing undesirable fracture patterns by analyzing the reconstruction loss of the images, surpassing existing methods. This high accuracy highlights the autoencoder's ability to automatically extract and reduce dimensional features from fracture surfaces effectively. The experimental result obtained on tungsten-heavy alloys demonstrate the model's potential towards developing autoencoder-based automated tools for fractographic analyses across various materials and operational scenarios.
Бесплатно

Статья научная
The identification of breast cancer stages plays a vital role for understanding the aggressiveness of cancer disease and the patient survival as an outcome. The main parameter of breast cancer staging is counting the mitotic cells in biopsy samples of breast cancer tissues. In the present scenario the manually counting of the mitotic cells in histopathology image slides of the tissue examined by the expert under clinical microscope is 10X, 20X ,40X ,100X,400X magnification of the sample. The manual process is laborious, inaccurate, erroneous and tedious, hence the traditional method demands the computerized approach to recognize and identify the cancer stages for the expert to come up with robust decision. In this work we proposed a novel approach for automatic recognition and identification through computer aided diagnosis systems (CAD). In this CAD proposed model the work is divided into five stages. In the first stage histopathological image are preprocessed to enhance the contrast of the mitotic cells and non mitotic cells using image adjustment technique. In second stage the foreground and background is segmented using Otsu segmentation algorithm. In the third stage the Bit plane slicing is applied to separate the mitotic and non mitotic cells. In the fourth stage the number of mitotic cells is counted in the samples. In the fifth stage of the work, based on the number of mitotic cells the cancer stages are determined. In this work, ICPR 2012 database images are adopted for the experimentation. The diagnosis of the stage of the cancer will help the oncologist to take proper decision and also reduces the burden of the work.
Бесплатно

An Automatic Segmentation of Brain Tumor from MRI Scans through Wavelet Transformations
Статья научная
Fully automatic brain tumor detection is one of the critical tasks in medical image processing. The proposed study discusses the tumor segmentation process by means of wavelet transformation and clustering technique. Initially, MRI brain images are preprocessed by various wavelet transformations to sharpen the images and enhance the tumor region. This helps to quicken the clustering technique since tumor region appears good in sharpened CSF region. Finally, a wavelet decomposition method is applied in CSF region and extracts the tumor portion. This proposed method analyzes the performance of various wavelet types such as Haar, Daubechies (db1, db2, db3, db4 and db5), Coiflet, Morlet and Symlet in MRI scans. Experiments with the proposed method were done on 5 volume datasets collected from the popular brain tumor pools are BRATS2012 and whole brain atlas. The quantitative measures of results were compared using the metrics false alarm (FA) and missed alarm (MA). The results demonstrate that the proposed method obtaining better performance in the terms of both quantity and visual appearance.
Бесплатно

An Automatic Video-based Drowning Detection System for Swimming Pools Using Active Contours
Статья научная
Safety in swimming pools is a crucial issue. In this paper, a real time drowning detection method based on HSV color space analysis is presented which uses prior knowledge of the video sequences to set the best values for the color channels. Our method uses a HSV thresholding mechanism along with Contour detection to detect the region of interest in each frame of video sequences. The presented software can detect drowning person in indoor swimming pools and sends an alarm to the lifeguard rescues if the previously detected person is missing for a specific amount of time. The presented algorithm for this system is tested on several video sequences recorded in swimming pools in real conditions and the results are of high accuracy with a high capability of tracking individuals in real time. According to the evaluation results, the number of false alarms generated by the system is minimal and the maximum alarm delay reported by the system is 2.6 sec which can relatively be reliable compared to the acceptable time for rescue and resuscitation.
Бесплатно

Статья научная
The present paper proposes an innovative technique that classifies human age group in to five categories i.e 0 to 12, 13 to 25, 26 to 45, 46 to 60, and above 60 based on the Topological Texture Features (TTF) of the facial skin. Most of the existing age classification problems in the literature usually derive various facial features on entire image and with large range of gray level values in order to achieve efficient and precise classification and recognition. This leads to lot of complexity in evaluating feature parameters. To address this, the present paper derives TTF’s on Second Order image Compressed and Fuzzy Reduced Grey level (SICFRG) model, which reduces the image dimension from 5 x 5 into 2 x 2 and grey level range without any loss of significant feature information. The present paper assumes that bone structural changes do not occur after the person is fully grown that is the geometric relationships of primary features do not vary. That is the reason secondary features i.e TTF’s are identified and exploited. In the literature few researchers worked on TTF for classification of age, but so far no research is implemented on reduced dimensionality model. The proposed Second order Image Compressed and Fuzzy Reduced Grey level (SICFRG) model reduces overall complexity in recognizing and finding histogram of the TTF on the facial skin. The experimental evidence on FG-NET aging database and Google Images clearly indicates the high classification rate of the proposed method.
Бесплатно

An Efficient Brain Tumor Detection Algorithm Using Watershed & Thresholding Based Segmentation
Статья научная
During past few years, brain tumor segmentation in magnetic resonance imaging (MRI) has become an emergent research area in the field of medical imaging system. Brain tumor detection helps in finding the exact size and location of tumor. An efficient algorithm is proposed in this paper for tumor detection based on segmentation and morphological operators. Firstly quality of scanned image is enhanced and then morphological operators are applied to detect the tumor in the scanned image.
Бесплатно

An Efficient CNN Model for Automatic Diagnosis of Cardiomegaly from Chest Radiographic Images
Статья научная
This work presents an algorithm for the automatic detection of cardiomegaly on CXR images. Cardiomegaly is a medical condition in which the heart becomes enlarged than the actual and the efficiency of the heart would decrease and sometimes congestive heart failure occurs. Although there could be numerous reasons, high blood pressure and coronary artery disease are the main causes of cardiomegaly. Hence, the main intention of this work is to develop a CNN based model to efficiently identify the presence of cardiomegaly abnormality. The learning phase of the model is achieved by using CXRs that are extracted from the publically available “chest x-ray14” medical dataset and to compute the proposed model performance, an experimental platform is designed and implemented in the MATLAB tool. We have trained the model with 100, 120, 150, and 200 epochs. But the trained model with 120 epochs shows a revolutionary outcome. The acquired accuracies of 100,120,150 and 200 epochs are 84.69%, 98.00%, 89.09% and 87.64% respectively. However, many approaches have been developed for cardiomegaly identification but the proposed model shows record performance.
Бесплатно

An Efficient Characterization of Gait for Human Identification
Статья научная
In this work, a simple characterization of human gait, which can be used for surveillance purpose, is presented. Different measures, like leg rise from ground (LRFG), the angles created between the legs with the centroid (ABLC), the distances between the control points and centroid (DBCC) have been taken as different features. In this method, the corner points from the edge of the object in the image have been considered. Out of several corner points thus extracted, a set of eleven significant points, termed as control points, that effectively and rightly characterize the gait pattern, have been selected. The boundary of the object has been considered and using control points on the boundary the centroid of those has been found out. Statistical approach has been used for recognition of individuals based on the n feature vectors, each of size 23(collected from LRFG, ABLCs, and DBCCs) for each video frame, where n is the number of video frames in each gait cycles. It has been found that recognition result of our approach is encouraging with compared to other recent methods.
Бесплатно

An Efficient Deblocking Filter Algorithm for Reduction of Blocking Artifacts in HEVC Standard
Статья научная
The international standard of High Efficiency Video Coding (HEVC) improves the compression ratio over %50 compared with previous standards such as H264/AVC which maintain the same perceptual quality. HEVC has achieved significant coding efficiency improvement beyond existing video coding standard by employing several new coding tools. Deblocking filter, Adaptive Loop Filter (ALF) and Sample Adaptive Offset (SAO) are currently introduced for the HEVC standard. The deblocking filter detects the artifacts at the coded block boundaries and attenuates them by employing a selected filter. However, it was shown that the HEVC encoder may produce visible block artifacts on some sequences. In this paper, we propose an efficient deblocking filter to reduce block artifacts. The simulation results indicate that the maximum increase Peak signal-to-noise ratio (PSNR) of the proposed method is 0.09db, comparing to other deblocking filter algorithm.
Бесплатно

An Efficient Embedding Technique in Image Steganography Using Lucas Sequence
Статья научная
Image steganography have been observed as one of the useful techniques to prevent unintended users to understand what information has been communicated over the network. Many crucial methods of image steganography include Discrete Cosine Transformation (DCT) based approaches. In this paper an efficient algorithm in image steganography is proposed extending DCT based approaches and incorporating number theory. The existing DCT algorithm is modified to increase randomness in the embedding technique with the help of Lucas sequence specifically. The effectiveness of the proposed method has been evaluated by computing Mean Square Error (MSE) and Peak Signal to Noise Ratio (PSNR). Results show that the proposed method has higher embedding capacity and increases a significant level of security by using Lucas sequence in addition to the advantages provided by existing DCT algorithm.
Бесплатно

An Efficient Gait Recognition Approach for Human Identification Using Energy Blocks
Статья научная
Human gait recognition is an emerging research topic in the biometrics research field. It has recently gained a wider interest from machine vision research community because of its rich amount of merits. In this paper, a robust energy blocks based approach is proposed. For each silhouette sequence, gait energy image (GEI) is generated. Then it is split into three blocks, namely lower legs, upper-half and head. Further, Radon transform is applied to three energy blocks separately. Then, standard deviation is used to capture the variation in radial axis angle. Finally, support vector machine classifier (SVM) is effectively used for the classification procedures. The more prominent gait covariates such as multi views, backpack, carrying, least number of frames, clothing and different walking speed conditions are effectively addressed in this work by choosing sequential, even, odd and multiple’s of three numbering frames for each sequence. Extensive experiments are conducted on four considerably large, publicly available standard datasets and the promising results are obtained.
Бесплатно

An Efficient Switching Filter Based on Cubic B-Spline for Removal of Salt-and-Pepper Noise
Статья научная
In this paper, an efficient filter method for salt-and-pepper noise removal is proposed. This method is developed by using cubic B-spline. A noise detector is employed to check whether the selected pixel is noisy or noise free. In this method, noise free pixels are left unaltered. Since not every pixel is filtered, undue distortion can be avoided. Noise pixels are subjected to the filtering operation to reconstruct the intensity values of the noisy pixels. The noise free pixels are only considered in the filter operation. The cubic B-spline is used as a fitting function to generate additional values within the noise free pixels. The noisy pixel is replaced by the mean value of these pixel values. The window size is selected as 3 X 3 in the first step. If all pixels within the window are considered to be noise, then change the selected window size to 5 X 5. If all the pixels within this window are considered to be noise, then the noisy pixel is replaced by the previous resultant pixel. Comparison of the given filter with other existing filters is provided in this paper. The results demonstrate that the proposed technique can obtain better performances than other existing denoising techniques. As a result of this, the proposed method removes the noise effectively even at noise level as high as 97%.
Бесплатно

An Efficient System for Medical Image Retrieval using Generalized Gamma Distribution
Статья научная
Efficient diagnosis plays a crucial role for treatment. In many cases of criticalness, radiologists, doctors prefer to the usage of internet technologies in order to search for similar cases. Accordingly in this paper an effective mechanism of Content Based Image Retrieval (CBIR) is presented, which helps the radiologists/doctors in retrieving similar images from the medical dataset. The paper is presented by considering brain medical images from a medical dataset. Feature vectors are to be extracted efficiently so as to retrieve the images of interest. In this paper a two-way approach is adopted to retrieve the images of relevance from the dataset. In the first step the Probability Density Functions (PDF) are extracted and in the second step the relevant images are extracted using correlation coefficient. The accuracy of the model is tested on a database consisting of 1000 MR images related to brain. The effectiveness of the model is tested using Precision, Recall, Error rate and Retrieval efficiency. The performance of the proposed model is compared to Gaussian Mixture Model (GMM) using quality metrics such as Maximum distance, Mean Squared Error, Signal to Noise Ratio and Jaccard quotient.
Бесплатно

Статья научная
Video steganography is used to conserve the confidential information in various security applications. To give advance protection to the secrete message, pixels locations are optimized using nature inspired algorithm. The input video is separated into a sequence of still image frames then key frames are extracted. The proposed Required Pixel Density (RPD) value calculation and feature extraction are carried out on the extracted frames to perform the frame classification. The frame classification is done using proposed Fractional Water-Earth Worm optimization algorithm based Deep Convolutional Neural Network (FrWEWO-Deep CNN) in order to classify the frames as high, low and medium quality. Thus pixel location prediction is carried out using trained Deep CNN then secret image is hide within high quality frame with Wavelet Transform (WT) and Inverse WT (IWT). Peak Signal to Noise Ratio (PSNR) and Correlation Coefficient (CC) are performance evaluation parameters. For efficient video steganography better imperceptibility and robustness are required. Imperceptibility is a scale of PSNR value showing similarity between original and stego video frames. The robustness of video steganography is measured by CC between embedded and extracted secret images. The proposed algorithm gives enhanced performance is compared with previous state of art such as WEWO-Deep RNN. The PSNR value is progressed from 41.8492 to 46.5728 dB and CC value improved from 0.9660 to 0.9847.
Бесплатно

An Efficient and Generalized approach for Content Based Image Retrieval in MatLab
Статья научная
There is a serious flaw in existing image search engines, since they basically work under the influence of keywords. Retrieving images based on the keywords is not only inappropriate, but also time consuming. Content Based Image Retrieval (CBIR) is still a research area, which aims to retrieve images based on the content of the query image. In this paper we have proposed a CBIR based image retrieval system, which analyses innate properties of an image such as, the color, texture and the entropy factor, for efficient and meaningful image retrieval. The initial step is to retrieve images based on the color combination of the query image, which is followed by the texture based retrieval and finally, based on the entropy of the images, the results are filtered. The proposed system results in retrieving the images from the database which are similar to the query image. Entropy based image retrieval proved to be quite useful in filtering the irrelevant images thereby improving the efficiency of the system.
Бесплатно

An Efficient and Simple Switching Filter for Removal of High Density Salt-and-Pepper Noise
Статья научная
This paper presents an efficient and simple adaptive method for high density salt-and-pepper noise removal. A noise detector is utilized to check whether the selected pixel is noisy or noise free. Noise pixels will then be subjected to the second stage of the filtering action, while the noise free pixels are left unaltered. Since not every pixel is filtered, undue distortion can be avoided. The noise free pixels are only considered in the filter operation for finding the value of the processed pixel. The window size is selected as 3 X 3 in the first step. If all pixels within the window are considered to be noise, then change the selected window size to 5 X 5. If all the pixels within the selected 5 x 5 window are considered to be noise, then the processing pixel is replaced by the previous resultant pixel. This technique requires one nonnoise original image as training image. The key point of the filter operation is based on the solution of the equations system X=A-1B in the nonnoise original image. An algorithm to extract the data from the nonnoisy image and form it in the linear equation system is presented. Comparison of the given filter with other existing filters is provided in this paper. The results demonstrate that the proposed technique can obtain better performances than other existing denoising techniques. The proposed method works well for high-density salt & pepper noise even up to a noise density of 97%.
Бесплатно

An Enhanced Approach for Solving Class Imbalance Problem in Automatic Image Annotation
Статья научная
Classifying an object captured in an image is useful for understanding the contents of the image and annotating it exactly with corresponding tags automatically is the problem faced recently. As the real world data set is highly imbalanced it degrades the performance of automatic image annotation and object detection. To prevail over this drawback we have proposed a new system for pattern matching and annotation which is based on the fusion of principles obtained from Fractal Transform and gentle AdaBoost algorithm. This paper, also tries to overcome deterioration in the performance occurring through imbalance dataset, different orientation, scaling in image annotation by choosing an over sampling method for learning the classifier. The proposed IFSMOTE classifier is initially trained up by setting a threshold value which helps to identify the objects correctly and an over-sampling technique based on fractal is used to classify the imbalanced dataset. Experimental results on the Flicker image dataset have shown superior performance results in terms of precision, recall and F-measure. This paper also presents the comparative results of our proposed system with other traditional image annotation algorithm like SVM, SMOTE and FSMOTE.
Бесплатно

An Enhanced Feature Extraction Technique for Diagnosis of Pathological Problems in Mango Crop
Статья научная
Lack of apparent shape and texture features in disease recognition (Powdery Mildew and Anthracnose) of crop is a key challenge of Agriculture domain in the last few decades. The various soft computing techniques exists in computer vision system still there is need of most efficient methods to meet accuracy. In this work An enhanced Wavelet-PCA based Statistical Feature Extraction technique along with Modified Rotation Kernel Transformation (MRKT) based directional features is proposed in order to address the issues arising in different methodologies for plant disease recognition. This enhanced scheme extracts twenty wavelet features in addition to twelve direction features for different plant parts mango flower, fruit and leaf. This research work is an extended part presents in reference 1 by the authors. The feature set of total 32 features is used to train with Artificial Neural Network to diagnose both Powdery Mildew and Anthracnose disease which occur in the form of Fungus and black spots respectively on different parts of mango plant. The results obtained are found with accuracy of 98.50%, 98.75%, and 98.70% respectively for flower, fruit and leaf .
Бесплатно

An Evaluation and Improved Matching Cost of Stereo Matching Method
Статья научная
The main target of stereo matching algorithms is to find out the three dimensional (3D) distance, or depth of objects from a stereo pair of images. Depth information can be derived from images using disparity map of the same scene. There are many applications of computer vision like People tracking, Gesture recognition, Industrial automation and inspection, Security and Biometrics, Three-dimensional modeling, Web and Cloud, Aerial surveys etc. There are large categories of stereo algorithms which are used for finding the disparity or depth. This paper presents a proposed stereo matching algorithm to obtain depth map, enhance and measure. The hybrid mathematical process of the algorithm are color conversion, block matching, guided filtering, Minimum disparity assignment design, mathematical perimeter, zero depth assignment, combination of hole filling and permutation of morphological operator and last non linear spatial filtering. Our algorithm is produce noise less, reliable, smooth and efficient depth map. We obtained the results with ground truth image using Structural Similarity Index Map (SSIM) and Peak Signal to Noise Ratio (PSNR).
Бесплатно

Статья научная
Scream is recognized as constant and ear-splitting non-linguistic verbal communication that has no phonological structure. This research is based on the study to assess the effect of regional accent on distress screams of women of a very specific age group. The primary goal of this research is to identify the components of non-speech sound so that the region of origin of the speaker can be determined. Furthermore, this research can aid in the development of security techniques based on emotions to prevent and report criminal activities where victims used to yell for help. For the time being, we have limited the study to women because women are the primary victims of all types of criminal’s activities. The Non-Speech corpus has been used to explore different parameters of scream samples collected from three different regions by using high-reliability audio recordings. The detailed investigation is based on the vocal characteristics of female speakers. Further, the investigations have been verified with bi-variate, partial correlation and one-way ANOVA to find out the impact of region-based accent non-speech distress signal. Results from the correlation techniques indicate that out of four attributes only jitter varies with respect to the specific region. Whereas ANOVA depicts that there is no significant regional impact on distress non-speech signals.
Бесплатно