data mining algorithms
Recently Published Documents


TOTAL DOCUMENTS

887
(FIVE YEARS 343)

H-INDEX

26
(FIVE YEARS 7)

Author(s):  
Özerk Yavuz

Epidemic diseases can be extremely dangerous with its hazarding influences. They may have negative effects on economies, businesses, environment, humans, and workforce. In this paper, some of the factors that are interrelated with COVID-19 pandemic have been examined using data mining methodologies and approaches. As a result of the analysis some rules and insights have been discovered and performances of the data mining algorithms have been evaluated. According to the analysis results, JRip algorithmic technique had the most correct classification rate and the lowest root mean squared error (RMSE). Considering classification rate and RMSE measure, JRip can be considered as an effective method in understanding factors that are related with corona virus caused deaths.


2022 ◽  
Vol 2022 ◽  
pp. 1-9
Author(s):  
Jiawen Du ◽  
Yong Pi

With the advent of the era of big data, people’s lives have undergone earth-shaking changes, not only getting rid of the cumbersome traditional data collection but also collecting and sorting information directly from people’s footprints on social networks. This paper explores and analyzes the privacy issues in current social networks and puts forward the protection strategies of users’ privacy data based on data mining algorithms so as to truly ensure that users’ privacy in social networks will not be illegally infringed in the era of big data. The data mining algorithm proposed in this paper can protect the user’s identity from being identified and the user’s private information from being leaked. Using differential privacy protection methods in social networks can effectively protect users’ privacy information in data publishing and data mining. Therefore, it is of great significance to study data publishing, data mining methods based on differential privacy protection, and their application in social networks.


Learning data analytics improves the learning field in higher education using educational data for extracting useful patterns and making better decision. Identifying potential at-risk students may help instructors and academic guidance to improve the students’ performance and the achievement of learning outcomes. The aim of this research study is to predict at early phases the student’s failure in a particular course using the standards-based grading. Several machines learning techniques were implemented to predict the student failure based on Support Vector Machine, Multilayer Perceptron, Naïve Bayes, and decision tree. The results on each technique shows the ability of machine learning algorithms to predict the student failure accurately after the third week and before the course dropout week. This study provides a strong knowledge for student performance in all courses. It also provides faculty members the ability to help student at-risk by focusing on them and providing necessary support to improve their performance and avoid failure.


2022 ◽  
pp. 1054-1070
Author(s):  
Andrew Stranieri ◽  
Venki Balasubramanian

Remote patient monitoring involves the collection of data from wearable sensors that typically requires analysis in real time. The real-time analysis of data streaming continuously to a server challenges data mining algorithms that have mostly been developed for static data residing in central repositories. Remote patient monitoring also generates huge data sets that present storage and management problems. Although virtual records of every health event throughout an individual's lifespan known as the electronic health record are rapidly emerging, few electronic records accommodate data from continuous remote patient monitoring. These factors combine to make data analytics with continuous patient data very challenging. In this chapter, benefits for data analytics inherent in the use of standards for clinical concepts for remote patient monitoring is presented. The openEHR standard that describes the way in which concepts are used in clinical practice is well suited to be adopted as the standard required to record meta-data about remote monitoring. The claim is advanced that this is likely to facilitate meaningful real time analyses with big remote patient monitoring data. The point is made by drawing on a case study involving the transmission of patient vital sign data collected from wearable sensors in an Indian hospital.


2022 ◽  
pp. 163-187
Author(s):  
Gökçe Karahan Adalı

This study aims to measure the effect of the preventive policies on public during the COVID-19 pandemic as well as measuring the public's trust in the government. The study examines the determinants of public trust in governments and the associations between the preventive measures. It is also aimed to determine the protective measures that governments prefer to implement together by using association rules of data mining algorithms. By this means, double and triple action packages are presented. This study finds that basic characteristics such as education, health, and age are among the most basic determinants of trust in governments during the pandemic. The trust in government and opinions that measures taken are sufficient decreased as the education level increased. Considering the age criteria, this situation is the opposite. It is observed that women followed the preventative policies more strictly than men. It is also observed that public trust in governments is directly proportional to the development levels of countries.


Author(s):  
Hanein Omar Mohamed, Basma.F.Idris Hanein Omar Mohamed, Basma.F.Idris

Asthma is a chronic disease that is caused by inflammation of airways. Diagnosis, predication and classification of asthmatic are one of the major attractive areas of research for decades by using different and recent techniques, however the main problem of asthma is misdiagnosis. This paper simplifies and compare between different Artificial Neural Network techniques used to solve this problem by using different algorithms to getting a high level of accuracyin diagnosis, prediction, and classification of asthma like: (data mining algorithms, machine learning algorithms, deep machine learning algorithms), depending and passing through three stages: data acquisition, feature extracting, data classification. According to the comparison of different techniques the high accuracy achieved by ANN was (98.85%), and the low accuracy of it was (80%), despite of the accuracy achieved by Support Vector Machine (SVM) was (86%) when used Mel Frequency Cepstral Coefficient MFCC for feature extraction, while the accuracy was (99.34%) when used Relief for extracting feature. Based in our comparison we recommend that if the researchers used the same techniques they should to return to previous studies it to get high accuracy.


Author(s):  
Владимир Арнольдович Биллиг ◽  
Николай Васильевич Звягинцев

В настоящее время накоплено значительное количество экспериментальных данных, фиксирующих процесс протекания химических реакций. Анализ этих данных комплексом алгоритмов Data Mining дает важную практическую информацию для поиска эффективных условий проведения реакций, при которых получается максимальное количество целевого продукта при минимальных затратах. В данной работе на примере работы с базой, содержащей данные о протекании реакции карбонилирования различных олефинов, показано, как разработанный нами программный комплекс позволяет извлечь полезные знания, способствующие повышению эффективности химических реакций. At present, a significant amount of experimental data has been accumulated, recording the process of the occurrence of chemical reactions. Analysis of these data by a set of Data Mining algorithms provides important practical information for finding effective conditions for carrying out reactions, at which the maximum amount of the target product is obtained at minimal cost. In this paper, using the example of working with a database containing data on the course of the carbonylation reaction of various olefins, it is shown how the software package developed by us allows us to extract useful knowledge that contributes to an increase in the efficiency of chemical reactions.


Author(s):  
Ansar Abbas ◽  
Muhammad Aman Ullah ◽  
Abdul Waheed

This study is conducted to predict the body weight (BW) for Thalli sheep of southern Punjab from different body measurements. In the BW prediction, several body measurements viz., withers height, body length, head length, head width, ear length, ear width, neck length, neck width, heart girth, rump length, rump width, tail length, barrel depth and sacral pelvic width are used as predictors. The data mining algorithms such as Chi-square Automatic Interaction Detector (CHAID), Exhaustive CHAID, Classification and Regression Tree (CART) and Artificial Neural Network (ANN) are used to predict the BW for a total of 85 female Thalli sheep. The data set is partitioned into training (80 %) and test (20 %) sets before the algorithms are used. The minimum number of parent (4) and child nodes (2) are set in order to ensure their predictive ability. The R2 % and RMSE values for CHAID, Exhaustive CHAID, ANN and CART algorithms are 67.38(1.003), 64.37(1.049), 61.45(1.093) and 59.02(1.125), respectively. The mostsignificant predictor is BL in the BW prediction of Thalli sheep. The heaviest BW average of 9.596 kg is obtained from the subgroup of those having BL > 25.000 inches. On behalf of the several goodness of fit criteria, we conclude that the CHAID algorithm performance is better in order to predict the BW of Thalli sheep and more suitable decision tree diagram visually. Also, the obtained CHAID results may help to determine body measurements positively associated with BW for developing better selection strategies with the scope of indirect selection criteria.


Sign in / Sign up

Export Citation Format

Share Document