Comparing the Performance of FCBF, Chi-Square and Relief-F Filter Feature Selection Algorithms in Educational Data Mining

Educational Data Mining (EDM) is an emerging research area help the educational institutions to improve the performance of their students. Feature Selection (FS) algorithms remove irrelevant data from the educational dataset and hence increases the performance of classifiers used in EDM techniques. This paper present an analysis of the performance of feature selection algorithms on student data set. .In this papers the different problems that are defined in problem formulation. All these problems are resolved in future. Furthermore the paper is an attempt of playing a positive role in the improvement of education quality, as well as guides new researchers in making academic intervention.

Download Full-text

Feature Selection Algorithms for Data Mining Classification: A Survey

Indian Journal of Science and Technology ◽

10.17485/ijst/2019/v12i6/139581 ◽

2019 ◽

Vol 12 (6) ◽

pp. 1-11 ◽

Cited By ~ 1

Author(s):

N. Krishnaveni ◽

V. Radha ◽

◽

Keyword(s):

Data Mining ◽

Feature Selection ◽

Selection Algorithms

Download Full-text

A NOVEL FEATURE SELECTION ALGORITHM WITH SUPERVISED MUTUAL INFORMATION FOR CLASSIFICATION

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213013500279 ◽

2013 ◽

Vol 22 (04) ◽

pp. 1350027

Author(s):

JAGANATHAN PALANICHAMY ◽

KUPPUCHAMY RAMASAMY

Keyword(s):

Machine Learning ◽

Data Mining ◽

Feature Selection ◽

Mutual Information ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Class A ◽

Selection Algorithms ◽

The Relationship ◽

Class Variable

Feature selection is essential in data mining and pattern recognition, especially for database classification. During past years, several feature selection algorithms have been proposed to measure the relevance of various features to each class. A suitable feature selection algorithm normally maximizes the relevancy and minimizes the redundancy of the selected features. The mutual information measure can successfully estimate the dependency of features on the entire sampling space, but it cannot exactly represent the redundancies among features. In this paper, a novel feature selection algorithm is proposed based on maximum relevance and minimum redundancy criterion. The mutual information is used to measure the relevancy of each feature with class variable and calculate the redundancy by utilizing the relationship between candidate features, selected features and class variables. The effectiveness is tested with ten benchmarked datasets available in UCI Machine Learning Repository. The experimental results show better performance when compared with some existing algorithms.

Download Full-text

Developed third iterative dichotomizer based on feature decisive values for educational data mining

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v18.i1.pp209-217 ◽

2020 ◽

Vol 18 (1) ◽

pp. 209

Author(s):

Saja Taha Ahmed ◽

Rafah Al-Hamdani ◽

Muayad Sadik Croock

Keyword(s):

Data Mining ◽

Feature Selection ◽

Decision Tree ◽

Predictive Analytics ◽

Educational Data Mining ◽

Target Class ◽

Id3 Algorithm ◽

Feature Weight ◽

Holdout Validation ◽

Fold Cross Validation

Recently, the decision trees have been adopted among the preeminent utilized classification models. They acquire their fame from their efficiency in predictive analytics, easy to interpret and implicitly perform feature selection. This latter perspective is one of essential significance in Educational Data Mining (EDM), in which selecting the most relevant features has a major impact on classification accuracy enhancement. The main contribution is to build a new multi-objective decision tree, which can be used for feature selection and classification. The proposed Decisive Decision Tree (DDT) is introduced and constructed based on a decisive feature value as a feature weight related to the target class label. The traditional Iterative Dichotomizer 3 (ID3) algorithm and the proposed DDT are compared using three datasets in terms of some ID3 issues, including logarithmic calculation complexity and multi-values featuresselection. The results indicated that the proposed DDT outperforms the ID3 in the developing time. The accuracy of the classification is improved on the basis of 10-fold cross-validation for all datasets with the highest accuracy achieved by the proposed method is 92% for the student.por dataset and holdout validation for two datasets, i.e. Iraqi and Student-Math. The experiment also shows that the proposed DDT tends to select attributes that are important rather than multi-value.

Download Full-text

Application of Feature Selection Methods in Educational Data Mining

International Journal of Computer Applications ◽

10.5120/18048-8951 ◽

2014 ◽

Vol 103 (2) ◽

pp. 34-38 ◽

Cited By ~ 5

Author(s):

Anal Acharya ◽

Devadatta Sinha

Keyword(s):

Data Mining ◽

Feature Selection ◽

Educational Data Mining ◽

Selection Methods

Download Full-text

Role of FCBF Feature Selection in Educational Data Mining

Mehran University Research Journal of Engineering and Technology ◽

10.22581/muet1982.2004.09 ◽

2020 ◽

Vol 39 (4) ◽

pp. 772-778

Author(s):

Maryam Zaffar ◽

Manzoor Ahmad Hashmani ◽

K.S. Savita ◽

Syed Sajjad Hussain Rizvi ◽

Mubashar Rehman

Keyword(s):

Data Mining ◽

Feature Selection ◽

Prediction Model ◽

Student Performance ◽

Performance Prediction ◽

Prediction Models ◽

Educational Data Mining ◽

Action Plans ◽

Factors Affecting ◽

Academic Organization

The Educational Data Mining (EDM) is a very vigorous area of Data Mining (DM), and it is helpful in predicting the performance of students. Student performance prediction is not only important for the student but also helpful for academic organization to detect the causes of success and failures of students. Furthermore, the features selected through the students’ performance prediction models helps in developing action plans for academic welfare. Feature selection can increase the prediction accuracy of the prediction model. In student performance prediction model, where every feature is very important, as a neglection of any important feature can cause the wrong development of academic action plans. Moreover, the feature selection is a very important step in the development of student performance prediction models. There are different types of feature selection algorithms. In this paper, Fast Correlation-Based Filter (FCBF) is selected as a feature selection algorithm. This paper is a step on the way to identifying the factors affecting the academic performance of the students. In this paper performance of FCBF is being evaluated on three different student’s datasets. The performance of FCBF is detected well on a student dataset with greater no of features.

Download Full-text

A novel academic performance estimation model using two stage feature selection

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v19.i3.pp1610-1619 ◽

2020 ◽

Vol 19 (3) ◽

pp. 1610

Author(s):

Pamela Chaudhury ◽

Hrudaya Kumar Tripathy

Keyword(s):

Data Mining ◽

Feature Selection ◽

Academic Performance ◽

Educational Data Mining ◽

Performance Estimation ◽

Estimation Model ◽

Proposed Model ◽

Differential Evolutionary Algorithm ◽

Performance Estimation Model ◽

Using Data

Educational data mining has gained tremendous interest from researchers across the globe. Using data mining techniques in the field of education several significant findings have been made. Accurate academic performance estimation is a challenging task. In this study we have developed a novel model to estimate the academic performance of students. Techniques like conversion of categorical attributes into dummy variables, classification, two staged feature selection and an improved differential evolutionary algorithm were used. Our proposed model outperformed existing models of students’ academic performance determination and gave a new direction to it. The proposed model can help not only to reduce the number of academic failures but also help to comprehend the factors contributing to a student’s academic performance (poor, average or outstanding).Computer

Download Full-text

Performance analysis of feature selection algorithm for educational data mining

2017 IEEE Conference on Big Data and Analytics (ICBDA) ◽

10.1109/icbdaa.2017.8284099 ◽

2017 ◽

Cited By ~ 18

Author(s):

Maryam Zaffar ◽

Manzoor Ahmed Hashmani ◽

K. S. Savita

Keyword(s):

Data Mining ◽

Feature Selection ◽

Performance Analysis ◽

Educational Data Mining ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

The Impact of Feature Selection Methods for Classifying Arabic Textual Data

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.d7163.118419 ◽

2019 ◽

Vol 8 (4) ◽

pp. 1333-1338

Keyword(s):

Feature Selection ◽

Text Classification ◽

Information Gain ◽

Feature Space ◽

Support Vector ◽

Selection Methods ◽

K Nearest Neighbors ◽

Chi Square ◽

Selection Algorithms ◽

The Impact

Text classification is a vital process due to the large volume of electronic articles. One of the drawbacks of text classification is the high dimensionality of feature space. Scholars developed several algorithms to choose relevant features from article text such as Chi-square (x2 ), Information Gain (IG), and Correlation (CFS). These algorithms have been investigated widely for English text, while studies for Arabic text are still limited. In this paper, we investigated four well-known algorithms: Support Vector Machines (SVMs), Naïve Bayes (NB), K-Nearest Neighbors (KNN), and Decision Tree against benchmark Arabic textual datasets, called Saudi Press Agency (SPA) to evaluate the impact of feature selection methods. Using the WEKA tool, we have experimented the application of the four mentioned classification algorithms with and without feature selection algorithms. The results provided clear evidence that the three feature selection methods often improves classification accuracy by eliminating irrelevant features.

Download Full-text

Comparing the Performance of FCBF, Chi-Square and Relief-F Filter Feature Selection Algorithms in Educational Data Mining

Performance Evaluation of Feature Selection Algorithms in Educational Data Mining

An Enhancement of Feature Selection Algorithm for EDM: A Review

Feature Selection Algorithms for Data Mining Classification: A Survey

A NOVEL FEATURE SELECTION ALGORITHM WITH SUPERVISED MUTUAL INFORMATION FOR CLASSIFICATION

Developed third iterative dichotomizer based on feature decisive values for educational data mining

Application of Feature Selection Methods in Educational Data Mining

Role of FCBF Feature Selection in Educational Data Mining

A novel academic performance estimation model using two stage feature selection

Performance analysis of feature selection algorithm for educational data mining

The Impact of Feature Selection Methods for Classifying Arabic Textual Data

Export Citation Format