Use of Particle Swarm Optimization for Feature Selection and Data Mining Methods for Efficient Detection of Automobile Insurance Fraud

One of the key task in data mining is the selection of relevant features from datasets with high dimensionality. This is expected to reduce the time and space complexity, and consequently improve the performance of data mining algorithms for tasks such as classification. This study presents an empirical study of the effect of particle swarm optimization as a feature selection technique on the performance of classification algorithms. Two dataset from different domains were used: SMS spam detection and sentiment analysis datasets. Particle swarm optimization is applied on the datasets for feature selection. Both the reduced and raw dataset are separately classified using C4.5 decision tree, k-nearest neighbour and support vector machine. The result of the analysis showed that the improvement of classifier performance is case-dependent; some significant improvements are noticed in the sentiment analysis datasets and not in the SMS spam dataset. Although some marginal effect are observed on performance, it implies that with particle swarm optimization features selection the space complexity is reduced while maintaining the accuracy of the classifiers. Keywords—classification, feature selection, machine learning, particle swarm optimization, text mining

Download Full-text

A Data Mining Model to predict and analyze the events related to Coronary Heart Disease using Decision Trees with Particle Swarm Optimization for Feature Selection

International Journal of Computer Applications ◽

10.5120/8779-2736 ◽

2012 ◽

Vol 55 (8) ◽

pp. 49-55 ◽

Cited By ~ 4

Author(s):

A. SheikAbdullah

Keyword(s):

Data Mining ◽

Coronary Heart Disease ◽

Feature Selection ◽

Particle Swarm Optimization ◽

Heart Disease ◽

Decision Trees ◽

Particle Swarm ◽

Swarm Optimization ◽

Mining Model

Download Full-text

Prediksi Nasabah Yang Berpotensi Membuka Simpanan Deposito Menggunakan Naive Bayes Berbasis Particle Swarm Optimization

Techno Com ◽

10.33633/tc.v17i2.1648 ◽

2018 ◽

Vol 17 (2) ◽

pp. 208-219

Author(s):

Alvino Dwi Rachman Prabowo ◽

Muljono Muljono

Keyword(s):

Data Mining ◽

Feature Selection ◽

Particle Swarm Optimization ◽

Decision Support ◽

Decision Support System ◽

Support System ◽

Naive Bayes ◽

Particle Swarm ◽

Naïve Bayes ◽

Swarm Optimization

Deposito masih merupakan pilihan utama bagi masyarakat untuk berinvestasi saat ini dan hal itu merupakan kesempatan bagi bank-bank untuk menentukan strategi pemasaran dan promosi yang lebih efisien dengan tidak terlalu banyak menggunakan biaya sehingga masyarakat tertarik untuk berinvestasi pada produk deposito dari bank tersebut. Atas dasar permasalahan tersebut, maka dilakukan penelitian untuk memprediksi nasabah yang berpotensi membuka deposito dengan menggunakan teknik data mining khususnya algoritma Naive Bayes berbasis PSO. PSO pada penelitian ini akan digunakan untuk feature selection yaitu dengan memilih attribut terbaik dengan memilih attribut yang sudah diberikan bobot sehingga dapat meningkatkan hasil akurasi dari prediksi menggunakan algoritma Naive Bayes. Hasil dari prediksi nasabah yang berpotensi membuka deposito dengan menggunakan Naive Baiyes memiliki akurasi sebesar 82,19%. Sedangkan prediksi yang menggunakan Naive Baiyes berbasis PSO memiliki akurasi sebesar 89,70%. Penggunaan algoritma PSO ternyata meningkatkan akurasi sebesar 7,51% dan algoritma Naive Baiyes berbasis PSO tersebut dapat digunakan untuk decision support system nasabah yang berpotensi membuka deposito karena menjadi model algoritma yang terbaik.

Download Full-text

Adaptive Particle Swarm Optimization based Credentialed Extreme Learning Machine Classifier (APSO-CELMC) for High Dimensional Datasets

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.j1029.08810s19 ◽

2019 ◽

Vol 8 (10S) ◽

pp. 157-163

Keyword(s):

Data Mining ◽

Feature Selection ◽

Particle Swarm Optimization ◽

Extreme Learning Machine ◽

Particle Swarm ◽

Selection Task ◽

Swarm Optimization ◽

Adaptive Particle Swarm Optimization ◽

Computer Science Research ◽

Learning Machine

Data mining is a key research field in the computer science research arena. Feature selection is performed once the dataset got cleansed. Optimization algorithms are considered to be helpful for the feature selection task. Also the obtained suitable features will contribute considerably for the classifier. Machine learning classifiers are comparatively performing better than that of traditional data mining classification algorithms. In this part of research work an adaptive particle swarm optimization algorithm is employed in order to perform feature selection task. Extreme learning machine classifier is added with credential weights. Twenty datasets are taken for performance analysis. From the obtained results it is evident that Adaptive Particle Swarm Optimization based Credentialed Extreme Learning Machine Classifier (APSO-CELMC) performs better in terms of predictive accuracy and time taken for classification.

Download Full-text

An Improved Intelligent Approach to Enhance the Sentiment Classifier for Knowledge Discovery Using Machine Learning

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327910999200528114552 ◽

2020 ◽

Vol 10 (4) ◽

pp. 582-593

Author(s):

Midde Venkateswarlu Naik ◽

D. Vasumathi ◽

A.P. Siva Kumar

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Global Warming ◽

Particle Swarm Optimization ◽

Sentiment Analysis ◽

Optimization Technique ◽

Particle Swarm ◽

Sentiment Classification ◽

Support Vector ◽

Swarm Optimization

Aims: The proposed research work is on an evolutionary enhanced method for sentiment or emotion classification on unstructured review text in the big data field. The sentiment analysis plays a vital role for current generation of people for extracting valid decision points about any aspect such as movie ratings, education institute or politics ratings, etc. The proposed hybrid approach combined the optimal feature selection using Particle Swarm Optimization (PSO) and sentiment classification through Support Vector Machine (SVM). The current approach performance is evaluated with statistical measures, such as precision, recall, sensitivity, specificity, and was compared with the existing approaches. The earlier authors have achieved an accuracy of sentiment classifier in the English text up to 94% as of now. In the proposed scheme, an average accuracy of sentiment classifier on distinguishing datasets outperformed as 99% by tuning various parameters of SVM, such as constant c value and kernel gamma value in association with PSO optimization technique. The proposed method utilized three datasets, such as airline sentiment data, weather, and global warming datasets, that are publically available. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy. Background: The sentiment analysis plays a vital role for current generation people for extracting valid decisions about any aspect such as movie rating, education institute or even politics ratings, etc. Sentiment Analysis (SA) or opinion mining has become fascinated scientifically as a research domain for the present environment. The key area is sentiment classification on semi-structured or unstructured data in distinguish languages, which has become a major research aspect. User-Generated Content [UGC] from distinguishing sources has been hiked significantly with rapid growth in a web environment. The huge user-generated data over social media provides substantial value for discovering hidden knowledge or correlations, patterns, and trends or sentiment extraction about any specific entity. SA is a computational analysis to determine the actual opinion of an entity which is expressed in terms of text. SA is also called as computation of emotional polarity expressed over social media as natural text in miscellaneous languages. Usually, the automatic superlative sentiment classifier model depends on feature selection and classification algorithms. Methods: The proposed work used Support vector machine as classification technique and particle swarm optimization technique as feature selection purpose. In this methodology, we tune various permutations and combination parameters in order to obtain expected desired results with kernel and without kernel technique for sentiment classification on three datasets, including airline, global warming, weather sentiment datasets, that are freely hosted for research practices. Results: In the proposed scheme, The proposed method has outperformed with 99.2% of average accuracy to classify the sentiment on different datasets, among other machine learning techniques. The attained high accuracy in classifying sentiment or opinion about review text proves superior effectiveness over existing sentiment classifiers. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy. Conclusion: The objective of the research issue sentiment classifier accuracy has been hiked with the help of Kernel-based Support Vector Machine (SVM) based on parameter optimization. The optimal feature selection to classify sentiment or opinion towards review documents has been determined with the help of a particle swarm optimization approach. The proposed method utilized three datasets to simulate the results, such as airline sentiment data, weather sentiment data, and global warming data that are freely available datasets.

Download Full-text

Ensemble based filter feature selection with harmonize particle swarm optimization and support vector machine for optimal cancer classification

Machine Learning with Applications ◽

10.1016/j.mlwa.2021.100054 ◽

2021 ◽

pp. 100054

Author(s):

Tengku Mazlin Tengku Ab Hamid ◽

Roselina Sallehuddin ◽

Zuriahati Mohd Yunos ◽

Aida Ali

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Particle Swarm Optimization ◽

Particle Swarm ◽

Cancer Classification ◽

Support Vector ◽

Swarm Optimization

Download Full-text

Novel self-adjusted particle swarm optimization algorithm for feature selection

Computing ◽

10.1007/s00607-020-00891-w ◽

2021 ◽

Author(s):

Bo Wei ◽

Xuan Wang ◽

Xuewen Xia ◽

Mingfeng Jiang ◽

Zuohua Ding ◽

...

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Optimization Algorithm ◽

Particle Swarm Optimization Algorithm ◽

Particle Swarm ◽

Swarm Optimization

Download Full-text

A Novel Feature Selection Approach based on Binary Particle Swarm Optimization and Ensemble Learning for Heterogeneous Defect Prediction

2021 3rd Asia Pacific Information Technology Conference ◽

10.1145/3449365.3449384 ◽

2021 ◽

Author(s):

Ruchika Malhotra ◽

Anmol Budhiraja ◽

Abhinav Kumar Singh ◽

Ishani Ghoshal

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Ensemble Learning ◽

Particle Swarm ◽

Defect Prediction ◽

Binary Particle Swarm Optimization ◽

Swarm Optimization ◽

Selection Approach ◽

Feature Selection Approach ◽

Heterogeneous Defect Prediction

Download Full-text

Chunking and cooperation in particle swarm optimization for feature selection

Annals of Mathematics and Artificial Intelligence ◽

10.1007/s10472-021-09752-4 ◽

2021 ◽

Author(s):

Malek Sarhani ◽

Stefan Voß

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Cooperative Learning ◽

Optimization Problems ◽

Practical Importance ◽

Particle Swarm ◽

Large Datasets ◽

Social Phenomena ◽

Swarm Optimization ◽

Complex Optimization

AbstractBio-inspired optimization aims at adapting observed natural behavioral patterns and social phenomena towards efficiently solving complex optimization problems, and is nowadays gaining much attention. However, researchers recently highlighted an inconsistency between the need in the field and the actual trend. Indeed, while nowadays it is important to design innovative contributions, an actual trend in bio-inspired optimization is to re-iterate the existing knowledge in a different form. The aim of this paper is to fill this gap. More precisely, we start first by highlighting new examples for this problem by considering and describing the concepts of chunking and cooperative learning. Second, by considering particle swarm optimization (PSO), we present a novel bridge between these two notions adapted to the problem of feature selection. In the experiments, we investigate the practical importance of our approach while exploring both its strength and limitations. The results indicate that the approach is mainly suitable for large datasets, and that further research is needed to improve the computational efficiency of the approach and to ensure the independence of the sub-problems defined using chunking.

Download Full-text

Feature selection based on niching particle swarm optimization for omics data classification

2020 International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI) ◽

10.1109/ichci51889.2020.00036 ◽

2020 ◽

Author(s):

Zhao Xu ◽

Junshan Yang

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Particle Swarm ◽

Data Classification ◽

Omics Data ◽

Swarm Optimization

Download Full-text