mining algorithms Latest Research Papers

Epidemic diseases can be extremely dangerous with its hazarding influences. They may have negative effects on economies, businesses, environment, humans, and workforce. In this paper, some of the factors that are interrelated with COVID-19 pandemic have been examined using data mining methodologies and approaches. As a result of the analysis some rules and insights have been discovered and performances of the data mining algorithms have been evaluated. According to the analysis results, JRip algorithmic technique had the most correct classification rate and the lowest root mean squared error (RMSE). Considering classification rate and RMSE measure, JRip can be considered as an effective method in understanding factors that are related with corona virus caused deaths.

Download Full-text

Data Mining Algorithms for Water Main Condition Prediction—Comparative Analysis

Journal of Water Resources Planning and Management ◽

10.1061/(asce)wr.1943-5452.0001512 ◽

2022 ◽

Vol 148 (2) ◽

Author(s):

Ahmed Assad ◽

Ahmed Bouferguene

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Main Condition ◽

Data Mining Algorithms ◽

Water Main ◽

Mining Algorithms

Download Full-text

Mining Simple Path Traversal Patterns in Knowledge Graph

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2128 ◽

2022 ◽

Author(s):

Feng Xiong ◽

Hongzhi Wang

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Divide And Conquer ◽

Simple Path ◽

Knowledge Graph ◽

Sequence Mining ◽

High Coverage ◽

List Structure ◽

Life Force ◽

Mining Algorithms

The data mining has remained a subject of unfailing charm for research. The knowledge graph is rising and showing infinite life force and strong developing potential in recent years, where it is observed that acyclic knowledge graph has capacity for enhancing usability. Though the development of knowledge graphs has provided an ample scope for appearing the abilities of data mining, related researches are still insufficient. In this paper, we introduce path traversal patterns mining to knowledge graph. We design a novel simple path traversal pattern mining framework for improving the representativeness of result. A divide-and-conquer approach of combining each path is proposed to discover the most frequent traversal patterns in knowledge graph. To support the algorithm, we design a linked list structure indexed by the length of sequences with handy operations. The correctness of algorithm is proven. Experiments show that our algorithm reaches a high coverage with low output amounts compared to existing frequent sequence mining algorithms.

Download Full-text

DDMF: A Method for Mining Relatively Important Nodes Based on Distance Distribution and Multi-Index Fusion

Applied Sciences ◽

10.3390/app12010522 ◽

2022 ◽

Vol 12 (1) ◽

pp. 522

Author(s):

Na Zhao ◽

Qian Liu ◽

Ming Jing ◽

Jie Li ◽

Zhidan Zhao ◽

...

Keyword(s):

Complex Networks ◽

Euclidean Distance ◽

Distance Distribution ◽

Entropy Weight ◽

Multi Index ◽

Entropy Weight Method ◽

Weight Method ◽

Auc Value ◽

Mining Algorithms ◽

Important Nodes

In research on complex networks, mining relatively important nodes is a challenging and practical work. However, little research has been done on mining relatively important nodes in complex networks, and the existing relatively important node mining algorithms cannot take into account the indicators of both precision and applicability. Aiming at the scarcity of relatively important node mining algorithms and the limitations of existing algorithms, this paper proposes a relatively important node mining method based on distance distribution and multi-index fusion (DDMF). First, the distance distribution of each node is generated according to the shortest path between nodes in the network; then, the cosine similarity, Euclidean distance and relative entropy are fused, and the entropy weight method is used to calculate the weights of different indexes; Finally, by calculating the relative importance score of nodes in the network, the relatively important nodes are mined. Through verification and analysis on real network datasets in different fields, the results show that the DDMF method outperforms other relatively important node mining algorithms in precision, recall, and AUC value.

Download Full-text

Developing and Comparing Data Mining algorithms that Work Best for Predicting Student’s Performance

International Journal of Information and Communication Technology Education ◽

10.4018/ijicte.293235 ◽

2022 ◽

Vol 18 (1) ◽

pp. 0-0

Keyword(s):

At Risk ◽

Student Performance ◽

At Risk Students ◽

Machine Learning Algorithms ◽

Support Vector ◽

Standards Based Grading ◽

Data Mining Algorithms ◽

Student Failure ◽

Student’S Performance ◽

Mining Algorithms

Learning data analytics improves the learning field in higher education using educational data for extracting useful patterns and making better decision. Identifying potential at-risk students may help instructors and academic guidance to improve the students’ performance and the achievement of learning outcomes. The aim of this research study is to predict at early phases the student’s failure in a particular course using the standards-based grading. Several machines learning techniques were implemented to predict the student failure based on Support Vector Machine, Multilayer Perceptron, Naïve Bayes, and decision tree. The results on each technique shows the ability of machine learning algorithms to predict the student failure accurately after the third week and before the course dropout week. This study provides a strong knowledge for student performance in all courses. It also provides faculty members the ability to help student at-risk by focusing on them and providing necessary support to improve their performance and avoid failure.

Download Full-text

Measuring the Attitudes of Governmental Policies and the Public Towards the COVID-19 Pandemic

10.4018/978-1-7998-8674-7.ch009 ◽

2022 ◽

pp. 163-187

Author(s):

Gökçe Karahan Adalı

Keyword(s):

Data Mining ◽

Public Trust ◽

Education Level ◽

Protective Measures ◽

The Public ◽

Data Mining Algorithms ◽

The Government ◽

Development Levels ◽

Basic Characteristics ◽

Mining Algorithms

This study aims to measure the effect of the preventive policies on public during the COVID-19 pandemic as well as measuring the public's trust in the government. The study examines the determinants of public trust in governments and the associations between the preventive measures. It is also aimed to determine the protective measures that governments prefer to implement together by using association rules of data mining algorithms. By this means, double and triple action packages are presented. This study finds that basic characteristics such as education, health, and age are among the most basic determinants of trust in governments during the pandemic. The trust in government and opinions that measures taken are sufficient decreased as the education level increased. Considering the age criteria, this situation is the opposite. It is observed that women followed the preventative policies more strictly than men. It is also observed that public trust in governments is directly proportional to the development levels of countries.

Download Full-text

FABRIC AND PRODUCTION DEFECT DETECTION IN THE APPAREL INDUSTRY USING DATA MINING ALGORITHMS

International Journal of 3D Printing Technologies and Digital Industry ◽

10.46519/ij3dptdi.1030676 ◽

2021 ◽

Author(s):

Taner ERSÖZ ◽

Hamza ZAHOOR ◽

Filiz ERSÖZ

Keyword(s):

Data Mining ◽

Defect Detection ◽

Apparel Industry ◽

Data Mining Algorithms ◽

Using Data ◽

Mining Algorithms

Download Full-text

Data Mining algorithms in search of effective conditions for conducting chemical reactions

Herald of Tver State University Series Applied Mathematics ◽

10.26456/vtpmk625 ◽

2021 ◽

pp. 29-42

Author(s):

Владимир Арнольдович Биллиг ◽

Николай Васильевич Звягинцев

Keyword(s):

Data Mining ◽

Chemical Reactions ◽

Software Package ◽

Maximum Amount ◽

Minimal Cost ◽

Useful Knowledge ◽

Data Mining Algorithms ◽

Practical Information ◽

Mining Algorithms ◽

Effective Conditions

В настоящее время накоплено значительное количество экспериментальных данных, фиксирующих процесс протекания химических реакций. Анализ этих данных комплексом алгоритмов Data Mining дает важную практическую информацию для поиска эффективных условий проведения реакций, при которых получается максимальное количество целевого продукта при минимальных затратах. В данной работе на примере работы с базой, содержащей данные о протекании реакции карбонилирования различных олефинов, показано, как разработанный нами программный комплекс позволяет извлечь полезные знания, способствующие повышению эффективности химических реакций. At present, a significant amount of experimental data has been accumulated, recording the process of the occurrence of chemical reactions. Analysis of these data by a set of Data Mining algorithms provides important practical information for finding effective conditions for carrying out reactions, at which the maximum amount of the target product is obtained at minimal cost. In this paper, using the example of working with a database containing data on the course of the carbonylation reaction of various olefins, it is shown how the software package developed by us allows us to extract useful knowledge that contributes to an increase in the efficiency of chemical reactions.

Download Full-text

Body Weight Prediction of Thalli Sheep Reared in Southern Punjab Using Different Data Mining Algorithms

Proceedings of the Pakistan Academy of Sciences: A. Physical and Computational Sciences ◽

10.53560/ppasa(58-2)603 ◽

2021 ◽

Vol 58 (2) ◽

pp. 29-38

Author(s):

Ansar Abbas ◽

Muhammad Aman Ullah ◽

Abdul Waheed

Keyword(s):

Data Mining ◽

Body Weight ◽

Goodness Of Fit ◽

The Body ◽

Classification And Regression Tree ◽

Body Measurements ◽

Data Set ◽

Data Mining Algorithms ◽

Exhaustive Chaid ◽

Mining Algorithms

This study is conducted to predict the body weight (BW) for Thalli sheep of southern Punjab from different body measurements. In the BW prediction, several body measurements viz., withers height, body length, head length, head width, ear length, ear width, neck length, neck width, heart girth, rump length, rump width, tail length, barrel depth and sacral pelvic width are used as predictors. The data mining algorithms such as Chi-square Automatic Interaction Detector (CHAID), Exhaustive CHAID, Classification and Regression Tree (CART) and Artificial Neural Network (ANN) are used to predict the BW for a total of 85 female Thalli sheep. The data set is partitioned into training (80 %) and test (20 %) sets before the algorithms are used. The minimum number of parent (4) and child nodes (2) are set in order to ensure their predictive ability. The R2 % and RMSE values for CHAID, Exhaustive CHAID, ANN and CART algorithms are 67.38(1.003), 64.37(1.049), 61.45(1.093) and 59.02(1.125), respectively. The mostsignificant predictor is BL in the BW prediction of Thalli sheep. The heaviest BW average of 9.596 kg is obtained from the subgroup of those having BL > 25.000 inches. On behalf of the several goodness of fit criteria, we conclude that the CHAID algorithm performance is better in order to predict the BW of Thalli sheep and more suitable decision tree diagram visually. Also, the obtained CHAID results may help to determine body measurements positively associated with BW for developing better selection strategies with the scope of indirect selection criteria.

Download Full-text

Digital Mining Algorithm of English Translation Course Information Based on Digital Twin Technology

Wireless Communications and Mobile Computing ◽

10.1155/2021/9741948 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Juan Yang

Keyword(s):

Execution Time ◽

English Translation ◽

Accuracy Rate ◽

Language Mapping ◽

Digital Twin ◽

Mining Algorithm ◽

Language Communication ◽

Text Information ◽

Cross Language ◽

Mining Algorithms

Cross-language communication puts forward higher requirements for information mining in English translation course. Aiming at the problem that the frequent patterns in the current digital mining algorithms produce a large number of patterns and rules, with a long execution time, this paper proposes a digital mining algorithm for English translation course information based on digital twin technology. According to the results of word segmentation and tagging, the feature words of English translation text are extracted, and the cross-language mapping of text is established by using digital twin technology. The estimated probability of text translation is maximized by corresponding relationship. The text information is transformed into text vector, the semantic similarity of text is calculated, and the degree of translation matching is judged. Based on this data dimension, the frequent sequence is constructed by transforming suffix sequence into prefix sequence, and the digital mining algorithm is designed. The results of example analysis show that the execution time of digital mining algorithm based on digital twin technology is significantly shorter than that based on Apriori and Map Reduce, and the mining accuracy rate reached more than 80%, which has good performance in processing massive data.

Download Full-text

mining algorithms
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

A data mining analysis of COVID-19 cases in states of United States of America

Data Mining Algorithms for Water Main Condition Prediction—Comparative Analysis

Mining Simple Path Traversal Patterns in Knowledge Graph

DDMF: A Method for Mining Relatively Important Nodes Based on Distance Distribution and Multi-Index Fusion

Developing and Comparing Data Mining algorithms that Work Best for Predicting Student’s Performance

Measuring the Attitudes of Governmental Policies and the Public Towards the COVID-19 Pandemic

FABRIC AND PRODUCTION DEFECT DETECTION IN THE APPAREL INDUSTRY USING DATA MINING ALGORITHMS

Data Mining algorithms in search of effective conditions for conducting chemical reactions

Body Weight Prediction of Thalli Sheep Reared in Southern Punjab Using Different Data Mining Algorithms

Digital Mining Algorithm of English Translation Course Information Based on Digital Twin Technology

Export Citation Format

mining algorithmsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

A data mining analysis of COVID-19 cases in states of United States of America

Data Mining Algorithms for Water Main Condition Prediction—Comparative Analysis

Mining Simple Path Traversal Patterns in Knowledge Graph

DDMF: A Method for Mining Relatively Important Nodes Based on Distance Distribution and Multi-Index Fusion

Developing and Comparing Data Mining algorithms that Work Best for Predicting Student’s Performance

Measuring the Attitudes of Governmental Policies and the Public Towards the COVID-19 Pandemic

FABRIC AND PRODUCTION DEFECT DETECTION IN THE APPAREL INDUSTRY USING DATA MINING ALGORITHMS

Data Mining algorithms in search of effective conditions for conducting chemical reactions

Body Weight Prediction of Thalli Sheep Reared in Southern Punjab Using Different Data Mining Algorithms

Digital Mining Algorithm of English Translation Course Information Based on Digital Twin Technology

mining algorithms
Recently Published Documents