scholarly journals Diagnosis of Multiple Sclerosis Disease in Brain Magnetic Resonance Imaging Based on the Harris Hawks Optimization Algorithm

2021 ◽  
Vol 2021 ◽  
pp. 1-12
Author(s):  
Amal F. A. Iswisi ◽  
Oğuz Karan ◽  
Javad Rahebi

The damaged areas of brain tissues can be extracted by using segmentation methods, most of which are based on the integration of machine learning and data mining techniques. An important segmentation method is to utilize clustering techniques, especially the fuzzy C-means (FCM) clustering technique, which is sufficiently accurate and not overly sensitive to imaging noise. Therefore, the FCM technique is appropriate for multiple sclerosis diagnosis, although the optimal selection of cluster centers can affect segmentation. They are difficult to select because this is an NP-hard problem. In this study, the Harris Hawks optimization (HHO) algorithm was used for the optimal selection of cluster centers in segmentation and FCM algorithms. The HHO is more accurate than other conventional algorithms such as the genetic algorithm and particle swarm optimization. In the proposed method, every membership matrix is assumed as a hawk or an HHO member. The next step is to generate a population of hawks or membership matrices, the most optimal of which is selected to find the optimal cluster centers to decrease the multiple sclerosis clustering error. According to the tests conducted on a number of brain MRIs, the proposed method outperformed the FCM clustering and other techniques such as the k -NN algorithm, support vector machine, and hybrid data mining methods in accuracy.

Multiple sclerosis (MS) is among the world’s most common neurologic disorder. Severity classification of MS disease is necessary for treatment and medication dosage decisions and to understand the disease progression. To the best of authors’ knowledge, this is the first study for the severity classification of MS disease. In this study, Rough set (RS) approach is applied to discern the three classes (mild, moderate, and severe) of the severity of MS disease. Furthermore, the performance of the RS approach is compared with Machine learning (ML) classifiers namely, random forest, K-nearest neighbour, and support vector machine. The performance is evaluated on the dataset acquired from Multiple sclerosis outcome assessments consortium (MSOAC), Arizona, US. The weighted average accuracy, precision, recall, and specificity values for the RS approach are found to be 84.04%, 76.99%, 76.75%, and 83.84% respectively. However, among the ML classifiers, the performance of random forest classifier is found best for which the weighted average accuracy, precision, recall, and specificity values are 62.19 %, 52.65 %, 56.84 %, and 59.87 % respectively. The RS approach is found much superior to ML classifiers and may be used for MS disease severity classification. This study may be helpful for the clinicians to assess the severity of the MS patients and to take medication and dosage decisions.


2016 ◽  
Vol 26 (2) ◽  
pp. 467-478 ◽  
Author(s):  
Krzysztof Siwek ◽  
Stanisław Osowski

Abstract The paper discusses methods of data mining for prediction of air pollution. Two tasks in such a problem are important: generation and selection of the prognostic features, and the final prognostic system of the pollution for the next day. An advanced set of features, created on the basis of the atmospheric parameters, is proposed. This set is subject to analysis and selection of the most important features from the prediction point of view. Two methods of feature selection are compared. One applies a genetic algorithm (a global approach), and the other-a linear method of stepwise fit (a locally optimized approach). On the basis of such analysis, two sets of the most predictive features are selected. These sets take part in prediction of the atmospheric pollutants PM10, SO2, NO2 and O3. Two approaches to prediction are compared. In the first one, the features selected are directly applied to the random forest (RF), which forms an ensemble of decision trees. In the second case, intermediate predictors built on the basis of neural networks (the multilayer perceptron, the radial basis function and the support vector machine) are used. They create an ensemble integrated into the final prognosis. The paper shows that preselection of the most important features, cooperating with an ensemble of predictors, allows increasing the forecasting accuracy of atmospheric pollution in a significant way.


2014 ◽  
Vol 989-994 ◽  
pp. 3814-3817
Author(s):  
Zhi Zhang

The abnormal data of communication networks are complex and diverse, it is difficult to recognize and mine the abnormal data accurately and effectively with traditional methods. In order to improve the recognition accuracy of communication network, a data mining algorithms based on the method of communication network abnormal data recognition is presented. Firstly, the communication network data samples are analyzed for fuzzy c-means clustering in order to obtain the degree of membership matrix. Secondly, the training samples of communication network abnormal data mining are selected according to the membership. At last, the training samples are put into the least square support vector machine learning, which establish the model of abnormal data identification in communication network. The performance of the algorithm was tested by simulation tests, and the results show that, the abnormal data recognition efficiency and accuracy in this paper was improved much more than the traditional identification methods.


2019 ◽  
Vol 15 (2) ◽  
pp. 275-280
Author(s):  
Agus Setiyono ◽  
Hilman F Pardede

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam.  One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.


Author(s):  
Khalid AA Abakar ◽  
Chongwen Yu

This work demonstrated the possibility of using the data mining techniques such as artificial neural networks (ANN) and support vector machine (SVM) based model to predict the quality of the spinning yarn parameters. Three different kernel functions were used as SVM kernel functions which are Polynomial and Radial Basis Function (RBF) and Pearson VII Function-based Universal Kernel (PUK) and ANN model were used as data mining techniques to predict yarn properties. In this paper, it was found that the SVM model based on Person VII kernel function (PUK) have the same performance in prediction of spinning yarn quality in comparison with SVM based RBF kernel. The comparison with the ANN model showed that the two SVM models give a better prediction performance than an ANN model.


2018 ◽  
Vol 3 (1) ◽  
pp. 001
Author(s):  
Zulhendra Zulhendra ◽  
Gunadi Widi Nurcahyo ◽  
Julius Santony

In this study using Data Mining, namely K-Means Clustering. Data Mining can be used in searching for a large enough data analysis that aims to enable Indocomputer to know and classify service data based on customer complaints using Weka Software. In this study using the algorithm K-Means Clustering to predict or classify complaints about hardware damage on Payakumbuh Indocomputer. And can find out the data of Laptop brands most do service on Indocomputer Payakumbuh as one of the recommendations to consumers for the selection of Laptops.


2012 ◽  
Vol 45 (4) ◽  
pp. 41 ◽  
Author(s):  
M. K. Saha ◽  
Santanu Das ◽  
A. Bandyopadhyay ◽  
S. Bandyopadhyay

Sign in / Sign up

Export Citation Format

Share Document