Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications

Frontiers in Bioinformatics ◽

10.3389/fbinf.2021.794547 ◽

2022 ◽

Vol 1 ◽

Author(s):

Agostinetto Giulia ◽

Sandionigi Anna ◽

Bruno Antonia ◽

Pescini Dario ◽

Casiraghi Maurizio

Keyword(s):

Machine Learning ◽

16S Rrna ◽

Association Rule ◽

Association Rule Mining ◽

Pattern Mining ◽

Microbial Community Composition ◽

Frequent Itemset ◽

Supervised Machine Learning ◽

Rule Mining ◽

Microbiome Data

Boosted by the exponential growth of microbiome-based studies, analyzing microbiome patterns is now a hot-topic, finding different fields of application. In particular, the use of machine learning techniques is increasing in microbiome studies, providing deep insights into microbial community composition. In this context, in order to investigate microbial patterns from 16S rRNA metabarcoding data, we explored the effectiveness of Association Rule Mining (ARM) technique, a supervised-machine learning procedure, to extract patterns (in this work, intended as groups of species or taxa) from microbiome data. ARM can generate huge amounts of data, making spurious information removal and visualizing results challenging. Our work sheds light on the strengths and weaknesses of pattern mining strategy into the study of microbial patterns, in particular from 16S rRNA microbiome datasets, applying ARM on real case studies and providing guidelines for future usage. Our results highlighted issues related to the type of input and the use of metadata in microbial pattern extraction, identifying the key steps that must be considered to apply ARM consciously on 16S rRNA microbiome data. To promote the use of ARM and the visualization of microbiome patterns, specifically, we developed microFIM (microbial Frequent Itemset Mining), a versatile Python tool that facilitates the use of ARM integrating common microbiome outputs, such as taxa tables. microFIM implements interest measures to remove spurious information and merges the results of ARM analysis with the common microbiome outputs, providing similar microbiome strategies that help scientists to integrate ARM in microbiome applications. With this work, we aimed at creating a bridge between microbial ecology researchers and ARM technique, making researchers aware about the strength and weaknesses of association rule mining approach.

Download Full-text

Automatic Item Weight Generation for Pattern Mining and its Application

Developments in Data Extraction, Management, and Analysis ◽

10.4018/978-1-4666-2148-0.ch009 ◽

2013 ◽

pp. 187-207 ◽

Cited By ~ 1

Author(s):

Yun Sing Koh ◽

Russel Pears ◽

Gillian Dobbie

Keyword(s):

Uniform Distribution ◽

Real World ◽

Association Rule ◽

Association Rule Mining ◽

Natural Variation ◽

Pattern Mining ◽

Weighting Scheme ◽

Rule Mining ◽

Real World Application

Association rule mining discovers relationships among items in a transactional database. Most approaches assume that all items within a dataset have a uniform distribution with respect to support. However, this is not always the case, and weighted association rule mining (WARM) was introduced to provide importance to individual items. Previous approaches to the weighted association rule mining problem require users to assign weights to items. In certain cases, it is difficult to provide weights to all items within a dataset. In this paper, the authors propose a method that is based on a novel Valency model that automatically infers item weights based on interactions between items. The authors experiment shows that the weighting scheme results in rules that better capture the natural variation that occurs in a dataset when compared with a miner that does not employ a weighting scheme. The authors applied the model in a real world application to mine text from a given collection of documents. The use of item weighting enabled the authors to attach more importance to terms that are distinctive. The results demonstrate that keyword discrimination via item weighting leads to informative rules.

Download Full-text

Machine learning and data mining frameworks for predicting drug response in cancer: An overview and a novel in silico screening process based on association rule mining

Pharmacology & Therapeutics ◽

10.1016/j.pharmthera.2019.107395 ◽

2019 ◽

Vol 203 ◽

pp. 107395 ◽

Cited By ~ 7

Author(s):

Konstantinos Vougas ◽

Theodore Sakellaropoulos ◽

Athanassios Kotsinas ◽

George-Romanos P. Foukas ◽

Andreas Ntargaras ◽

...

Keyword(s):

Machine Learning ◽

Data Mining ◽

Association Rule ◽

In Silico ◽

Association Rule Mining ◽

Drug Response ◽

Rule Mining ◽

In Silico Screening ◽

Screening Process

Download Full-text

Minimum Threshold Determination Method based on Dataset Characteristics in Association Rule Mining

10.21203/rs.3.rs-728509/v1 ◽

2021 ◽

Author(s):

Erna Hikmawati ◽

Nur Ulfa Maulidevi ◽

Kridanto Surendro

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Threshold Value ◽

Extraction Process ◽

Frequent Itemset ◽

Rule Mining ◽

Minimum Threshold ◽

Minimum Support ◽

Associative Behavior

Abstract The process of extracting data to obtain useful information is known as data mining. Furthermore, one of the promising and widely used techniques for this extraction process is association rule mining. This technique is used to identify interesting relationships between sets of items in a dataset and predict associative behavior for new data. The first step in association rule mining is the determination of the frequent item set that will be involved in the rule formation process. In this step, a threshold is used to eliminate items excluded in the frequent itemset which is also known as the minimum support. Furthermore, the threshold provides an important role in determining the number of rules generated. However, setting the wrong threshold leads to the failure of the association rule mining to obtain rules. Currently, the minimum support value is determined by the user. This leads to a challenge that becomes worse for a user that is ignorant of the dataset characteristics. In this study, a method was proposed to determine the minimum support value based on the characteristics of the dataset. Furthermore, this required certain criteria to be used as thresholds which led to more adaptive rules according to the needs of the user. The results of this study showed that 6 from 8 datasets, obtained a rule with lift ratio > 1 using the minimum threshold value that was determined through this method.

Download Full-text

Machine Learning Based Quantitative Association Rule Mining Method for Evaluating Cellular Network Performance

IEEE Access ◽

10.1109/access.2019.2953943 ◽

2019 ◽

Vol 7 ◽

pp. 166815-166822

Author(s):

Guanghui Fan ◽

Wenjuan Shi ◽

Liang Guo ◽

Jun Zeng ◽

Kaixuan Zhang ◽

...

Keyword(s):

Machine Learning ◽

Association Rule ◽

Association Rule Mining ◽

Cellular Network ◽

Network Performance ◽

Mining Method ◽

Rule Mining ◽

Quantitative Association Rule

Download Full-text

Sequential Association Rule Mining Revisited: A Study Directed at Relational Pattern Mining for Multi-morbidity

10.1007/978-3-030-91100-3_20 ◽

2021 ◽

pp. 241-253

Author(s):

Alexandar Vincent-Paulraj ◽

Girvan Burnside ◽

Frans Coenen ◽

Munir Pirmohamed ◽

Lauren Walker

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Pattern Mining ◽

Rule Mining ◽

Sequential Association Rule Mining

Download Full-text

Review on Association Rule Mining Techniques with Machine Learning Approach for Cancer Treatment

Asian Journal of Research in Social Sciences and Humanities ◽

10.5958/2249-7315.2016.00627.4 ◽

2016 ◽

Vol 6 (8) ◽

pp. 482

Author(s):

T. Kiruthiga ◽

M. Vijaya Kumar

Keyword(s):

Machine Learning ◽

Cancer Treatment ◽

Association Rule ◽

Association Rule Mining ◽

Learning Approach ◽

Rule Mining ◽

Machine Learning Approach

Download Full-text

Development of Product Recommendation Engine By Collaborative Filtering and Association Rule Mining Using Machine Learning Algorithms

2020 Fourth International Conference on Inventive Systems and Control (ICISC) ◽

10.1109/icisc47916.2020.9171210 ◽

2020 ◽

Author(s):

Abhiraj Biswas ◽

Kaza Sai Vineeth ◽

Ayush Jain ◽

Mohana

Keyword(s):

Machine Learning ◽

Collaborative Filtering ◽

Association Rule ◽

Association Rule Mining ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Rule Mining ◽

Product Recommendation

Download Full-text

Cellular Network Performance using Machine Learning based Quantitative Association Rule Mining Method

2020 IEEE 91st Vehicular Technology Conference (VTC2020-Spring) ◽

10.1109/vtc2020-spring48590.2020.9128988 ◽

2020 ◽

Author(s):

Guanghui Fan ◽

Juan Wang ◽

Kaixuan Zhang ◽

Jun Zeng ◽

Guan Gui

Keyword(s):

Machine Learning ◽

Association Rule ◽

Association Rule Mining ◽

Cellular Network ◽

Network Performance ◽

Mining Method ◽

Rule Mining ◽

Quantitative Association Rule

Download Full-text

Cyclic Repeated Patterns in Sequential Pattern Mining Based on the Fuzzy C-Means Clustering and Association Rule Mining Technique

International Journal of Intelligent Engineering and Systems ◽

10.22266/ijies2017.0228.19 ◽

2017 ◽

Vol 10 (1) ◽

pp. 176-185

Author(s):

Ramani Selvanambi ◽

◽

Jaisankar Natarajan ◽

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Pattern Mining ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Rule Mining ◽

Fuzzy C Means ◽

Mining Technique ◽

Fuzzy C Means Clustering

Download Full-text

An Enhanced Approach to Mine Maximal Frequent Itemset using Maximal Frequent Itemset Prima Algorithm (MFIPA)

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2019.8.s2.2035 ◽

2019 ◽

Vol 8 (S2) ◽

pp. 9-12

Author(s):

R. Smeeta Mary ◽

K. Perumal

Keyword(s):

Data Mining ◽

Association Rule ◽

Association Rule Mining ◽

Frequent Itemsets ◽

Frequent Itemset ◽

Decision Makers ◽

New Method ◽

Rule Mining

In data mining finding out the frequent itemsets is one of the very essential topics. Data mining helps in identifying the best knowledge for different decision makers. Frequent itemset generation is the precondition and most time-consuming method for association rule mining. In this paper we suggest a new algorithm for frequent itemset detection that works with datasets in distributed manner. The proposed algorithm brings in a new method to find frequent itemset not including the necessitate to create candidate itemsets. The proposed approach could be implemented using horizontal representation for transaction datasets and allocating prime value. It explores all the frequent itemset that is present in the input and according to the support the maximum frequent itemset is identified. It was applied on different transactions database and compared with well-known algorithms: FP-Growth and Parallel Apriori with different support levels. The try out showed that the proposed algorithm attain major time improvement over both algorithms.

Download Full-text