An integrated PSO for parameter determination and feature selection of ELM and its application in classification of power system disturbances

2015 ◽  
Vol 32 ◽  
pp. 23-37 ◽  
Author(s):  
R. Ahila ◽  
V. Sadasivam ◽  
K. Manimala
Author(s):  
Р.И. Кузьмич ◽  
А.А. Ступина ◽  
М.И. Цепкова ◽  
С.Н. Ежеманская

Предлагается подход для отбора важных признаков при классификации наблюдений. Реализация подхода основана на построении логических правил на базе метода логического анализа данных и учете частоты использования признаков при их формировании для конкретной задачи классификации. An approach is proposed for the selection of important features in the classification of observations. The implementation of the approach is based on the construction of patterns based on the method of logical analysis of data and taking into account the frequency of using features when forming them for a specific classification task.


2016 ◽  
Vol 3 (2) ◽  
pp. 139-148
Author(s):  
M Rizky Wijaya ◽  
Ristu Saptono ◽  
Afrizal Doewes

Diabetes can lead to mortality and disability, so patients should be inpatient again to undergo treatment again to be saved. On previous research about feature selection with greedy stepwise forward fail to predict classification ratio inpatient of patient with the result of recall and precision 0 on data training 60%, 75%, 80%, and 90% and there is suggestion to handle unbalanced class data problem by comparison of data readmitted 6293 and the otherwise 64141. The research purposed to know the effect of choosing the best model using best first instead of greedy stepwise forward and data sampling with spreadsubsample to resolve unbalanced class data problem. The data used was patient data from 130 American Hospital in 1999 until 2008 with 70434 data. The method that used was best first search and spreadsubsample. The result of this research are precision found 0.4 and 0.333 on training dataset 75% and 90% with best first method, while spreadsubsample method found that value of precision and recall is more significantly increased. Spreadsubsample has more effect with the result of precision and recall rather than using best first method.


2014 ◽  
Vol 543-547 ◽  
pp. 3614-3620
Author(s):  
Zhi Qiang Li ◽  
De Quan Yang ◽  
Yuan Tan ◽  
Yuan Ping Zou

For the attribute-weighted based naive Bayesian classification algorithms, the selection of the weight directly affects the classification results. Based on this, the drawbacks of the TFIDF feature selection approaches in sentiment classification for the microblogs are analyzed, and an improved algorithm named TF-D(t)-CHI is proposed, which applies statistical calculation to obtain the correlation degree between the feature words and the classes. It presents the distribution of the feature items by variance in classes, which solves the problem that the short-texts contain few feature words while the high frequency feature words have too high weight. Experimental result indicate that TF-D(T)-CHI based naive Bayesian classification for feature selection and weight calculation has better classification results in sentiment classification for microblogs.


Sign in / Sign up

Export Citation Format

Share Document