Abstract
Students’ performance prediction systems play a vital role in enhancing the educational performance inside universities, schools, and training centers. Big data can come from different resources such as exam centers, virtual courses, registration departments, e-learning systems, and so on. Extracting meaningful knowledge from educational data is a complex task, so, reducing the data dimensionality is needed. In this paper, we proposed an enhanced binary genetic algorithm (EBGA) as a wrapper feature selection algorithm. Novel hybrid selection mechanism based on a k-means algorithm and Electromagnetic-like mechanism (EM) method is proposed. K-means will cluster the population into a set of clusters, while EM will determine a value called a total force (TF) for each solution. Each cluster has an accumulated total force (ATF) (i.e., adding all TFs together). Selection process will select two solutions with the highest TF from the cluster, which has the highest ATF. We employed a hybrid machine learning approach between the proposed EBGA and five different classifiers (i.e., k-Nearest Neighbors (k-NN), Decision Trees (DT), Naive Bayes (NB), Support Vector Machine (SVM), and Linear Discriminant Analysis (LDA)). Two real case studies obtained from UCI Machine Learning Repository are used in this paper. Obtained results showed the ability of the proposed approach to enhance the performance of the binary genetic algorithm. Moreover, the performances of all classifiers are improved between 1% to 11%.