Analysing students’ college english test-band 4(CET-4) scores based on educational data mining

There is a large amount of data accumulated in management systems in universities and information hidden in these large amounts of data is not fully tapped and applied. Using data mining techniques in education research can help in the extraction of many valuable patterns from the data. The purpose of this article is to analyze the linear relation between the students’ total score of College English Test-Band 4 (CET-4) and scores in each part of the test, using Support vector machin.e (SVM) to classify students' CET-4 performance in each part. The research collected data with questionnaires and analysis was carried out with the help of SPSS. The result showed that the main factors that affect the CET-4 results are previous experience of learning English, attitudes toward the test and time spent on learning English per day. Keywords: CET-4 score; Educational data mining; SVM classification; Influence factors

Download Full-text

Analisis Kinerja Support Vector Machine dalam Mengidentifikasi Komentar Perundungan pada Jejaring Sosial

JURNAL MEDIA INFORMATIKA BUDIDARMA ◽

10.30865/mib.v5i2.2923 ◽

2021 ◽

Vol 5 (2) ◽

pp. 475

Author(s):

Ade Clinton Sitepu ◽

Wanayumini Wanayumini ◽

Zakarias Situmorang

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Confusion Matrix ◽

Process Research ◽

Training Data ◽

Support Vector ◽

Media Technology ◽

Svm Classification ◽

Svm Algorithm ◽

Using Data

Cyberbullying is the same as bullying but it is done through media technology. Bullying has often occurred along with the development of social media technology in society. Some technique are needed to filter out bully comments because it will indirectly affect the psychological condition of the reader, morover it is aimed at the person concerned. By using data mining techniques, the system is expected to be able to classify information circulating in the community. This research uses the Support Vector Machine (SVM) classification because the algorithm is good at performing the classification process. Research using about 1000 dataset comments. Data are grouped manually first into the labels "bully" and "not bully" then the data divide into training data and test data. To test the system capability, data is analyzed using confusion matrix. The results showed that the SVM Algorithm was able to classify with an level of accuracy 87.75%, 89% precision and 91% Recal. The SVM algorithm is able to formulate training data with level of accuracy 98.3%

Download Full-text

Early Detection of Red Palm Weevil, Rhynchophorus ferrugineus (Olivier), Infestation Using Data Mining

Plants ◽

10.3390/plants10010095 ◽

2021 ◽

Vol 10 (1) ◽

pp. 95

Author(s):

Heba Kurdi ◽

Amal Al-Aldawsari ◽

Isra Al-Turaiki ◽

Abdulrahman S. Aldawood

Keyword(s):

Data Mining ◽

Plant Size ◽

Support Vector ◽

Classification Algorithms ◽

Palm Tree ◽

Rhynchophorus Ferrugineus ◽

Red Palm Weevil ◽

Palm Weevil ◽

Using Data ◽

F Measure

In the past 30 years, the red palm weevil (RPW), Rhynchophorus ferrugineus (Olivier), a pest that is highly destructive to all types of palms, has rapidly spread worldwide. However, detecting infestation with the RPW is highly challenging because symptoms are not visible until the death of the palm tree is inevitable. In addition, the use of automated RPW weevil identification tools to predict infestation is complicated by a lack of RPW datasets. In this study, we assessed the capability of 10 state-of-the-art data mining classification algorithms, Naive Bayes (NB), KSTAR, AdaBoost, bagging, PART, J48 Decision tree, multilayer perceptron (MLP), support vector machine (SVM), random forest, and logistic regression, to use plant-size and temperature measurements collected from individual trees to predict RPW infestation in its early stages before significant damage is caused to the tree. The performance of the classification algorithms was evaluated in terms of accuracy, precision, recall, and F-measure using a real RPW dataset. The experimental results showed that infestations with RPW can be predicted with an accuracy up to 93%, precision above 87%, recall equals 100%, and F-measure greater than 93% using data mining. Additionally, we found that temperature and circumference are the most important features for predicting RPW infestation. However, we strongly call for collecting and aggregating more RPW datasets to run more experiments to validate these results and provide more conclusive findings.

Download Full-text

Osteoporosis Risk Prediction Using Data Mining Algorithms

Journal of Community Health Research ◽

10.18502/jchr.v9i2.3401 ◽

2020 ◽

Author(s):

Efat Jabarpour ◽

Amin Abedini ◽

Abbasali Keshtkar

Keyword(s):

Data Mining ◽

Personal Information ◽

Disease Diagnosis ◽

Support Vector ◽

Data Mining Algorithms ◽

Industry Standard ◽

Disease Information ◽

Increased Risk ◽

Using Data ◽

Mining Algorithms

Introduction: Osteoporosis is a disease that reduces bone density and loses the quality of bone microstructure leading to an increased risk of fractures. It is one of the major causes of inability and death in elderly people. The current study aims at determining the factors influencing the incidence of osteoporosis and providing a predictive model for the disease diagnosis to increase the diagnostic speed and reduce diagnostic costs. Methods: An Individual's data including personal information, lifestyle, and disease information were reviewed. A new model has been presented based on the Cross-Industry Standard Process CRISP methodology. Besides, Support Vector Machine (SVM) and Bayes methods (Tree Augmented Naïve Bayes (TAN)) and Clementine12 have been used as data mining tools. Results: Some features have been detected to affect this disease. The rules have been extracted that can be used as a pattern for the prediction of the patients' status. Classification precision was calculated to be 88.39% for SVM, and 91.29% for (TAN) when the precision of TAN is higher comparing to other methods. Conclusion: The most effective factors concerning osteoporosis are detected and can be used for a new sample with defined characteristics to predict the possibility of osteoporosis in a person.

Download Full-text

Data-Driven Modelling of Smart Building Ventilation Subsystem

Journal of Sensors ◽

10.1155/2019/3572019 ◽

2019 ◽

Vol 2019 ◽

pp. 1-14 ◽

Cited By ~ 5

Author(s):

Grigore Stamatescu ◽

Iulia Stamatescu ◽

Nicoleta Arghira ◽

Ioana Fagarasan

Keyword(s):

Data Mining ◽

Data Streams ◽

Data Driven ◽

Support Vector ◽

Commercial Building ◽

Monitoring And Control ◽

Smart Building ◽

Building Ventilation ◽

Using Data ◽

Rich Data

Considering the advances in building monitoring and control through networks of interconnected devices, effective handling of the associated rich data streams is becoming an important challenge. In many situations, the application of conventional system identification or approximate grey-box models, partly theoretic and partly data driven, is either unfeasible or unsuitable. The paper discusses and illustrates an application of black-box modelling achieved using data mining techniques with the purpose of smart building ventilation subsystem control. We present the implementation and evaluation of a data mining methodology on collected data from over one year of operation. The case study is carried out on four air handling units of a modern campus building for preliminary decision support for facility managers. The data processing and learning framework is based on two steps: raw data streams are compressed using the Symbolic Aggregate Approximation method, followed by the resulting segments being input into a Support Vector Machine algorithm. The results are useful for deriving the behaviour of each equipment in various modi of operation and can be built upon for fault detection or energy efficiency applications. Challenges related to online operation within a commercial Building Management System are also discussed as the approach shows promise for deployment.

Download Full-text

Email Worm Detection Using Data Mining

Techniques and Applications for Advanced Information Privacy and Security ◽

10.4018/978-1-60566-210-7.ch002 ◽

2011 ◽

pp. 20-34

Author(s):

Mohammad M. Masud ◽

Latifur Khan ◽

Bhavani Thuraisingham

Keyword(s):

Data Mining ◽

Feature Selection ◽

Principal Component ◽

Classification Model ◽

Support Vector ◽

Two Phase ◽

Feature Selection Technique ◽

Worm Detection ◽

Phase Selection ◽

Using Data

This chapter applies data mining techniques to detect email worms. Email messages contain a number of different features such as the total number of words in message body/subject, presence/absence of binary attachments, type of attachments, and so on. The goal is to obtain an efficient classification model based on these features. The solution consists of several steps. First, the number of features is reduced using two different approaches: feature-selection and dimension-reduction. This step is necessary to reduce noise and redundancy from the data. The feature-selection technique is called Two-phase Selection (TPS), which is a novel combination of decision tree and greedy selection algorithm. The dimensionreduction is performed by Principal Component Analysis. Second, the reduced data is used to train a classifier. Different classification techniques have been used, such as Support Vector Machine (SVM), Naïve Bayes and their combination. Finally, the trained classifiers are tested on a dataset containing both known and unknown types of worms. These results have been compared with published results. It is found that the proposed TPS selection along with SVM classification achieves the best accuracy in detecting both known and unknown types of worms.

Download Full-text

Application of EDM to Understand the Online Students' Behavioral Pattern

Journal of Information Technology Research ◽

10.4018/jitr.2019070109 ◽

2019 ◽

Vol 12 (3) ◽

pp. 154-168 ◽

Cited By ~ 1

Author(s):

Luis Naito Mendes Bezerra ◽

Márcia Terra da Silva

Keyword(s):

Data Mining ◽

Educational Data Mining ◽

Learning Management Systems ◽

Behavioral Pattern ◽

Management Systems ◽

Online Course ◽

Educational Context ◽

Behavioral Patterns ◽

Limited Capacity ◽

Online Access

In distance learning, the professor cannot see that the students are having trouble with a subject, and can fail to perceive the problem in time to intervene. However, in learning management systems (LMS's) a large volume of data regarding online access, participation and progress can be registered and collected allowing analysis based on students' behavioral patterns. As traditional methods have a limited capacity to extract knowledge from big volumes of data, educational data mining (EDM) arises as a tool to help teachers interpreting the behavior of students. The objective of the present article is to describe the application of educational data mining technics aiming to obtain relevant knowledge of students' behavioral patterns in an LMS for an online course, with 1,113 students enrolled. This paper applies two algorithms on educational context, decision tree and clustering, unveiling unknown relevant aspects to professors and managers, such as the most important examinations that contribute to students' approval as well as the most significant attributes to their success.

Download Full-text

Prognosis of Diabetes Using Data mining Approach-Fuzzy C Means Clustering and Support Vector Machine

International Journal of Computer Trends and Technology ◽

10.14445/22312803/ijctt-v11p120 ◽

2014 ◽

Vol 11 (2) ◽

pp. 94-98 ◽

Cited By ~ 14

Author(s):

Ravi Sanakal ◽

◽

Smt. T Jayakumari

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Support Vector ◽

Fuzzy C Means ◽

Data Mining Approach ◽

Fuzzy C Means Clustering ◽

Using Data

Download Full-text

Student Performance Predictions Using Knowledge Discovery Database and Data Mining, DPU Students Records as Sample

Academic Journal of Nawroz University ◽

10.25007/ajnu.v10n3a875 ◽

2021 ◽

Vol 10 (3) ◽

pp. 121-127

Author(s):

Bareen Haval ◽

Karwan Jameel Abdulrahman ◽

Araz Rajab

Keyword(s):

Data Mining ◽

Decision Tree ◽

Student Performance ◽

Educational Data Mining ◽

Data Sets ◽

Decision Tree Classifier ◽

Data Mining Techniques ◽

Academic History ◽

Tree Classifier ◽

Using Data

This article presents the results of connecting an educational data mining techniques to the academic performance of students. Three classification models (Decision Tree, Random Forest and Deep Learning) have been developed to analyze data sets and predict the performance of students. The projected submission of the three classificatory was calculated and matched. The academic history and data of the students from the Office of the Registrar were used to train the models. Our analysis aims to evaluate the results of students using various variables such as the student's grade. Data from (221) students with (9) different attributes were used. The results of this study are very important, provide a better understanding of student success assessments and stress the importance of data mining in education. The main purpose of this study is to show the student successful forecast using data mining techniques to improve academic programs. The results of this research indicate that the Decision Tree classifier overtakes two other classifiers by achieving a total prediction accuracy of 97%.

Download Full-text

An Effectual Ga Based Association Rule Generation and Fuzzy Svm Classification Algorithm for Predicting Students Performance

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f8805.088619 ◽

2019 ◽

Vol 8 (6) ◽

pp. 2915-2920

Keyword(s):

Data Mining ◽

Academic Performance ◽

Association Rule ◽

Educational Data Mining ◽

Rule Generation ◽

Academic Status ◽

Student Records ◽

Performance Simulation ◽

Svm Classification ◽

Student’S Performance

This investigation provides outcome of utilizing educational data mining [EDM] to design academic performance of students from real time and online dataset collected from colleges. Data mining is determined to examine non-academic and academic data; this model utilizes a classification approach termed as Fuzzy SVM classification with Genetic algorithm to attain effectual understanding of association rule in enrolment and to evaluate data quality for classification, which is identified as prediction task of performance and academic status based on low academic performance. This model attempts to predict student’s performance in grading system. Academic and student records attained from process were considered to train models estimated using cross-validation and formerly records from complete academic performance. Simulation was performed in MATLAB environment and show that academic status prediction is enhanced while hybrid dataset are added. The accuracy was compared with the existing models and shows better trade off than those methods.

Download Full-text

Using Data Mining In Learning Management Systems Amidst Covid-19

Aksara: Jurnal Ilmu Pendidikan Nonformal ◽

10.37905/aksara.6.3.213-216.2020 ◽

2020 ◽

Vol 6 (3) ◽

pp. 213

Author(s):

Froilan D Mobo

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Learning Management Systems ◽

The Philippines ◽

Management Systems ◽

Official Gazette ◽

Learning Management ◽

The Republic ◽

Using Data ◽

The One

The Second Semester of Academic Year 2019-2020 was temporarily suspended due to the widespread COVID-19 last March 16, 2020, forcing the President of the Republic of the Philippines, Hon. Rodrigo Roa Duterte imposed an Enhanced Community Quarantine in Luzon which is known as a lockdown closing all the border points of each town and provinces. One of the major problem encountered during the lockdown is the suspension of classes because as per IATF guidelines you need to stay home, the said Memorandum Order was posted in the official gazette, (Medialdea, 2020)The dataset on the features of the Learning Management Systems using Moodle is that Professors will be the one who will set the topics, quizzes, and exercises for his class even the assessment methods on the system. To prevent from slowing down the network, the Team of Seaversity the developer of the learning management systems headed by C/E Ephrem Dela Cernan conducts a ZOOM Training to all Faculty to be familiarized more on the Learning Management Systems of the Philippine Merchant Marine Academy. The Moodle Learning Management Systems is a user-friendly environment because of its features and users can easily adjust from the traditional face to face teaching going to e-Learning approach because of it’s all capabilities as a data mining methods such as statistics, association rule mining, pattern mining visualization, categorization, clustering, and text mining., (AlAjmi & Shakir, 2013)

Download Full-text