scholarly journals Multi-Level Thresholding with Fractional-Order Darwinian PSO and Tsallis Function

A novel optimal multi-level thresholding is proposed using gray scale images for Fractional-order Darwinian Particle Swarm Optimization (FDPSO) and Tsallis function. The maximization of Tsallis entropy is chosen as the Objective Function (OF) which monitors FDPSO’s exploration until the search converges to an optimal solution. The proposed method is tested on six standard test images and compared with heuristic methods, such as Bat Algorithm (BA) and Firefly Algorithm (FA). The robustness of the proposed thresholding procedure was tested and validated on the considered image data set with Poisson Noise (PN) and Gaussian Noise (GN). The results obtained with this study verify that, FDPSO offers better image quality measures when compared with BA and FA algorithms. Wilcoxon’s test was performed by Mean Structural Similarity Index (MSSIM), and the results prove that image segmentation is clear even in noisy dataset based on the statistical significance of the FDPSO with respect to BA and FA.

2020 ◽  
Vol 10 (19) ◽  
pp. 6662
Author(s):  
Ji-Won Baek ◽  
Kyungyong Chung

Since the image related to road damage includes objects such as potholes, cracks, shadows, and lanes, there is a problem that it is difficult to detect a specific object. In this paper, we propose a pothole classification model using edge detection in road image. The proposed method converts RGB (red green and blue) image data, including potholes and other objects, to gray-scale to reduce the amount of computation. It detects all objects except potholes using an object detection algorithm. The detected object is removed, and a pixel value of 255 is assigned to process it as a background. In addition, to extract the characteristics of a pothole, the contour of the pothole is extracted through edge detection. Finally, potholes are detected and classified based by the (you only look once) YOLO algorithm. The performance evaluation evaluates the distortion rate and restoration rate of the image, and the validity of the model and accuracy of the classification. The result of the evaluation shows that the mean square error (MSE) of the distortion rate and restoration rate of the proposed method has errors of 0.2–0.44. The peak signal to noise ratio (PSNR) is evaluated as 50 db or higher. The structural similarity index map (SSIM) is evaluated as 0.71–0.82. In addition, the result of the pothole classification shows that the area under curve (AUC) is evaluated as 0.9.


Author(s):  
Pankaj Upadhyay ◽  
Jitender Kumar Chhabra

Image recognition plays a vital role in image-based product searches and false logo identification on e-commerce sites. For the efficient recognition of images, image segmentation is a very important and is an essential phase. This article presents a physics-inspired electromagnetic field optimization (EFO)-based image segmentation method which works using an automatic clustering concept. The proposed approach is a physics-inspired population-based metaheuristic that exploits the behavior of electromagnets and results into a faster convergence and a more accurate segmentation of images. EFO maintains a balance of exploration and exploitation using the nature-inspired golden ratio between attraction and repulsion forces and converges fast towards a globally optimal solution. Fixed length real encoding schemes are used to represent particles in the population. The performance of the proposed method is compared with recent state of the art metaheuristic algorithms for image segmentation. The proposed method is applied to the BSDS 500 image data set. The experimental results indicate better performance in terms of accuracy and convergence speed over the compared algorithms.


2020 ◽  
Vol 10 (11) ◽  
pp. 2707-2713
Author(s):  
Zheng Sun ◽  
Xiangyang Yan

Intravascular photoacoustic tomography (IVPAT) is a newly developed imaging modality in the interventional diagnosis and treatment of coronary artery diseases. Incomplete acoustic measurement caused by limitedview scanning of the detector in the vascular lumen results in under-sampling artifacts and distortion in the images reconstructed by using the standard reconstruction methods. A method for limited-view IVPAT image reconstruction based on deep learning is presented in this paper. A convolutional neural network (CNN) is constructed and trained with computer-simulated image data set. Then, the trained CNN is used to optimize the cross-sectional images of the vessel which are recovered from the incomplete photoacoustic measurements by using the standard time-reversal (TR) algorithm to obtain the images with the improved quality. Results of numerical demonstration indicate that the method can effectively reduce the image distortion and artifacts caused by the limited-view detection. Furthermore, it is superior to the compressed sensing (CS) method in recovering the unmeasured information of the imaging target with the structural similarity around 10% higher than CS reconstruction.


2020 ◽  
Vol 9 (4) ◽  
pp. 1461-1467
Author(s):  
Indrarini Dyah Irawati ◽  
Sugondo Hadiyoso ◽  
Yuli Sun Hariyani

In this study, we proposed compressive sampling for MRI reconstruction based on sparse representation using multi-wavelet transformation. Comparing the performance of wavelet decomposition level, which are Level 1, Level 2, Level 3, and Level 4. We used gaussian random process to generate measurement matrix. The algorithm used to reconstruct the image is . The experimental results showed that the use of wavelet multi-level can generate higher compression ratio but requires a longer processing time. MRI reconstruction results based on the parameters of the peak signal to noise ratio (PSNR) and structural similarity index measure (SSIM) show that the higher the level of decomposition in wavelets, the value of both decreases.


2020 ◽  
Vol 20 (04) ◽  
pp. 2050032
Author(s):  
Rubul Kumar Bania ◽  
Anindya Halder

Mammography imaging is one of the most widely used techniques for breast cancer screening and analysis of abnormalities. However, due to some technical difficulties during the time of acquisition and digital storage of mammogram images, impulse noise may be present. Therefore, detection and removals of impulse noise from the mammogram images are very essential for early detection and further diagnosis of breast cancer. In this paper, a novel adaptive trimmed median filter (ATMF) is proposed for impulse noise (salt & pepper (SNP)) detection and removal with an application to mammogram image denoising. Automatic switching mechanism for updating the Window of Interest (WoI) size from ([Formula: see text]) to ([Formula: see text]) or ([Formula: see text]) is performed. The proposed method is applied on publicly available mammogram images corrupted with varying SNP noise densities in the range 5%–90%. The performance of the proposed method is measured by various quantitative indices like peak signal to noise ratio (PSNR), mean square error (MSE), image enhancement factor (IEF) and structural similarity index measure (SSIM). The comparative analysis of the proposed method is done with respect to other state-of-the-art noise removal methods viz., standard median filter (SMF), decision based median filter (DMF), decision based unsymmetric trimmed median filter (DUTMF), modified decision based unsymmetric trimmed median filter (MDUTMF) and decision based unsymmetric trimmed winsorized mean filter (DUTWMF). The superiority of the proposed method over other compared methods is well evident from the experimental results in terms of the quantitative indices (viz., PSNR, IEF and SSIM) and also from the visual quality of the denoised images. Paired t-test confirms the statistical significance of the higher PSNR values achieved by the proposed method as compared to the other counterpart techniques. The proposed method turned out to be very effective in denoising both high and low density noises present in (mammogram) images.


2019 ◽  
Vol 28 (1) ◽  
pp. 25-34
Author(s):  
Grzegorz Wieczorek ◽  
Izabella Antoniuk ◽  
Michał Kruk ◽  
Jarosław Kurek ◽  
Arkadiusz Orłowski ◽  
...  

In this paper we present a new segmentation method meant for boost area that remains after removing the tumour using BCT (breast conserving therapy). The selected area is a region on which radiation treatment will later be made. Consequently, an inaccurate designation of this region can result in a treatment missing its target or focusing on healthy breast tissue that otherwise could be spared. Needless to say that exact indication of boost area is an extremely important aspect of the entire medical procedure, where a better definition can lead to optimizing of the coverage of the target volume and, in result, can save normal breast tissue. Precise definition of this area has a potential to both improve the local control of the disease and to ensure better cosmetic outcome for the patient. In our approach we use U-net along with Keras and TensorFlow systems to tailor a precise solution for the indication of the boost area. During the training process we utilize a set of CT images, where each of them came with a contour assigned by an expert. We wanted to achieve a segmentation result as close to given contour as possible. With a rather small initial data set we used data augmentation techniques to increase the number of training examples, while the final outcomes were evaluated according to their similarity to the ones produced by experts, by calculating the mean square error and the structural similarity index (SSIM).


2014 ◽  
Vol 704 ◽  
pp. 373-379
Author(s):  
S.K. Lakshmanaprabu ◽  
U. Sabura Banu

Multiloop fractional order PID controller is tuned using Bat algorithm for two interacting conical tank process. Two interacting conical tank process is modelled using mass balance equations. Two Interacting Conical Tank process is a complex system involving tedious interaction. Straight forward multiloop PID controller design involves various steps to design the controller. Due to easy implementation and quick convergence, Bat algorithm is used in recent past for solving continuous non-linear optimization problems to achieve global optimal solution. Bat algorithm, a swarm intelligence technique will be attempted to tune the multiloop fractional order PID controller for two interacting conical tank process. The multi objective optimized multiloop fractional PID controller is tested for tracking, disturbance rejection for minimum Integral time absolute error.


2012 ◽  
Vol 6 (2) ◽  
pp. 34-52 ◽  
Author(s):  
Roli Bansal ◽  
Priti Sehgal ◽  
Punam Bedi

Presented is an efficient watermarking scheme using Particle Swarm Optimization (PSO) to watermark host fingerprint images with their corresponding facial images in the Discrete Cosine Transform (DCT) domain. PSO is used to find the best DCT coefficients’ locations in the host image where the facial image data can be embedded, so that the distortion produced in the host image is minimum. The objective function for PSO is formulated in terms of the Structural Similarity Index (SSIM) and the Orientation Certainty Level Index (OCL) so as to base it on the simple visual effect of the human visual perception capability and correct minutia prediction ability. The results exhibit better watermarked image quality while retaining the feature set of the original fingerprint. Moreover, the proposed technique is robust so that the extraction of watermark is possible even after the watermarked image is exposed to attacks. As a result, at the receiver’s end, the watermarked fingerprint image and the extracted facial image can be verified for a secure and accurate biometric based personal authentication.


2020 ◽  
pp. 1-11
Author(s):  
Ilona Anna Urbaniak ◽  
Macin Wolter

Due to the amount of medical image data being produced and transferred over networks, employing lossy compression has been accepted by worldwide regulatory bodies. As expected, increasing the degree of compression leads to decreasing image fidelity. The extent of allowable irreversible compression is dependent on the imaging modality and the nature of the image pathology as well as anatomy. Interpolation, which often causes image distortion, has been extensively used to rescale images during radiological diagnosis. This work attempts to assess the quality of medical images after the application of lossy compression followed by rescaling. This research proposes a fullreference objective measure of quality for medical images that considers their deterministic and statistical properties. Statistical features are acquired from the frequency domain of the signal and are combined with elements of the structural similarity index (SSIM). The aim is to construct a model that is specialized for medical images and that could serve as a predictor of quality.


2022 ◽  
Vol 2161 (1) ◽  
pp. 012024
Author(s):  
Padmashree Desai ◽  
C Sujatha ◽  
Saumyajit Chakraborty ◽  
Saurav Ansuman ◽  
Sanika Bhandari ◽  
...  

Abstract Intelligent decision-making systems require the potential for forecasting, foreseeing, and reasoning about future events. The issue of video frame prediction has aroused a lot of attention due to its usefulness in many computer vision applications such as autonomous vehicles and robots. Recent deep learning advances have significantly improved video prediction performance. Nevertheless, as top-performing systems attempt to foresee even more future frames, their predictions become increasingly foggy. We developed a method for predicting a future frame based on a series of prior frames that services the Convolutional Long-Short Term Memory (ConvLSTM) model. The input video is segmented into frames, fed to the ConvLSTM model to extract the features and forecast a future frame which can be beneficial in a variety of applications. We have used two metrics to measure the quality of the predicted frame: structural similarity index (SSIM) and perceptual distance, which help in understanding the difference between the actual frame and the predicted frame. The UCF101 data set is used for testing and training in the project. It is a data collection of realistic action videos taken from YouTube with 101 action categories for action detection. The ConvLSTM model is trained and tested for 24 categories from this dataset and a future frame is predicted which yields satisfactory results. We obtained SSIM as 0.95 and perceptual similarity as 24.28 for our system. The suggested work’s results are also compared to those of state-of-the-art approaches, which are shown to be superior.


Sign in / Sign up

Export Citation Format

Share Document