scholarly journals FINDING THE BEST ALGORITHMS AND EFFECTIVE FACTORS IN CLASSIFICATION OF TURKISH SCIENCE STUDENT SUCCESS

2019 ◽  
Vol 18 (2) ◽  
pp. 239-253 ◽  
Author(s):  
Enes Filiz ◽  
Ersoy Öz

Educational Data Mining (EDM) is an important tool in the field of classification of educational data that helps researchers and education planners analyse and model available educational data for specific needs such as developing educational strategies. Trends International Mathematics and Science Study (TIMSS) which is a notable study in educational area was used in this research. EDM methodology was applied to the results of TIMSS 2015 that presents data culled from eighth grade students from Turkey. The main purposes are to find the algorithms that are most appropriate for classifying the successes of students, especially in science subjects, and ascertaining the factors that lead to this success. It was found that logistic regression and support vector machines – poly kernel are the most suitable algorithms. A diverse set of features obtained by feature selection methods are “Computer Tablet Shared”, “Extra Lessons Last 12 Month”, “Extra Lessons How Many Month”, “How Far in Education Do You Expect to Go”, “Home Educational Resources”, and “Student Confident in Science” and these features are the most effective features in science success. Keywords: classification algorithms, educational data mining, eighth grade, science success, TIMSS 2015.

2020 ◽  
pp. 1248-1271
Author(s):  
Seda Tolun ◽  
Halit Alper Tayalı

This chapter focuses on available data analysis and data mining techniques to find the optimal location of the Multicriteria Single Facility Location Problem (MSFLP) at diverse business settings. Solving for the optimal of an MSFLP, there exists numerous multicriteria decision analysis techniques. Mainstream models are mentioned in this chapter, while presenting a general classification of the MSFLP and its framework. Besides, topics from machine learning with respect to decision analysis are covered: Unsupervised Principal Components Analysis ranking (PCA-rank) and supervised Support Vector Machines ranking (SVM-rank). This chapter proposes a data mining perspective for the multicriteria single facility location problem and proposes a new approach to the facility location problem with the combination of the PCA-rank and ranking SVMs.


Author(s):  
Adeel Ahmed ◽  
Kamlesh Kumar ◽  
Mansoor A. Khuhro ◽  
Asif A. Wagan ◽  
Imtiaz A. Halepoto ◽  
...  

Nowadays, educational data mining is being employed as assessing tool for study and analysis of hidden patterns in academic databases which can be used to predict student’s academic performance. This paper implements various machine learning classification techniques on students’ academic records for results predication. For this purpose, data of MS(CS) students were collected from a public university of Pakistan through their assignments, quizzes, and sessional marks. The WEKA data mining tool has been used for performing all experiments namely, data pre-processing, classification, and visualization. For performance measure, classifier models were trained with 3- and 10-fold cross validation methods to evaluate classifiers' accuracy. The results show that bagging classifier combined with support vector machines outperform other classifiers in terms of accuracy, precision, recall, and F-measure score. The obtained outcomes confirm that our research provides significant contribution in prediction of students’ academic performance which can ultimately be used to assists faculty members to focus low grades students in improving their academic records.


Author(s):  
Seda Tolun ◽  
Halit Alper Tayalı

This chapter focuses on available data analysis and data mining techniques to find the optimal location of the Multicriteria Single Facility Location Problem (MSFLP) at diverse business settings. Solving for the optimal of an MSFLP, there exists numerous multicriteria decision analysis techniques. Mainstream models are mentioned in this chapter, while presenting a general classification of the MSFLP and its framework. Besides, topics from machine learning with respect to decision analysis are covered: Unsupervised Principal Components Analysis ranking (PCA-rank) and supervised Support Vector Machines ranking (SVM-rank). This chapter proposes a data mining perspective for the multicriteria single facility location problem and proposes a new approach to the facility location problem with the combination of the PCA-rank and ranking SVMs.


Author(s):  
Marianne Maktabi ◽  
Hannes Köhler ◽  
Magarita Ivanova ◽  
Thomas Neumuth ◽  
Nada Rayes ◽  
...  

Author(s):  
VLADIMIR NIKULIN ◽  
TIAN-HSIANG HUANG ◽  
GEOFFREY J. MCLACHLAN

The method presented in this paper is novel as a natural combination of two mutually dependent steps. Feature selection is a key element (first step) in our classification system, which was employed during the 2010 International RSCTC data mining (bioinformatics) Challenge. The second step may be implemented using any suitable classifier such as linear regression, support vector machine or neural networks. We conducted leave-one-out (LOO) experiments with several feature selection techniques and classifiers. Based on the LOO evaluations, we decided to use feature selection with the separation type Wilcoxon-based criterion for all final submissions. The method presented in this paper was tested successfully during the RSCTC data mining Challenge, where we achieved the top score in the Basic track.


2011 ◽  
Vol 61 (9) ◽  
pp. 2874-2878 ◽  
Author(s):  
L. Gonzalez-Abril ◽  
F. Velasco ◽  
J.A. Ortega ◽  
L. Franco

Author(s):  
Rakesh Kumar ◽  
Avinash M. Jade ◽  
Valadi K. Jayaraman ◽  
Bhaskar D. Kulkarni

A hybrid strategy of using (i) locally linear embedding for nonlinear dimensionality reduction of high dimensional data and (ii) support vector machines for classification of the resultant features is proposed as a robust methodology for process monitoring. Illustrative examples substantiate the methodology vis-à-vis current practice.


2016 ◽  
Vol 51 (20) ◽  
pp. 2853-2862 ◽  
Author(s):  
Serkan Ballı

The aim of this study is to diagnose and classify the failure modes for two serial fastened sandwich composite plates using data mining techniques. The composite material used in the study was manufactured using glass fiber reinforced layer and aluminum sheets. Obtained results of previous experimental study for sandwich composite plates, which were mechanically fastened with two serial pins or bolts were used for classification of failure modes. Furthermore, experimental data from previous study consists of different geometrical parameters for various applied preload moments as 0 (pinned), 2, 3, 4, and 5 Nm (bolted). In this study, data mining methods were applied by using these geometrical parameters and pinned/bolted joint configurations. Therefore, three geometrical parameters and 100 test data were used for classification by utilizing support vector machine, Naive Bayes, K-Nearest Neighbors, Logistic Regression, and Random Forest methods. According to experiments, Random Forest method achieved better results than others and it was appropriate for diagnosing and classification of the failure modes. Performances of all data mining methods used were discussed in terms of accuracy and error ratios.


Sign in / Sign up

Export Citation Format

Share Document