Exact Maximum Clique Algorithm for Different Graph Types Using Machine Learning

Abstract Background Computer Aided Diagnostics (CAD) can support medical practitioners to make critical decisions about their patients' disease conditions. Practitioners require access to the chain of reasoning behind CAD to build trust in the CAD advice and to supplement their own expertise. Yet, CAD systems might be based on black box machine learning (ML) models and high dimensional data sources (electronic health records, MRI scans, cardiotocograms, etc). These foundations make interpretation and explanation of the CAD advice very challenging. This challenge is recognised throughout the machine learning research community. eXplainable Artificial Intelligence (XAI) is emerging as one of the most important research areas of recent years because it addresses the interpretability and trust concerns of critical decision makers, including those in clinical and medical practice. Methods In this work, we focus on AdaBoost, a black box ML model that has been widely adopted in the CAD literature. We address the challenge -- to explain AdaBoost classification -- with a novel algorithm that extracts simple, logical rules from AdaBoost models. Our algorithm, Adaptive-Weighted High Importance Path Snippets (Ada-WHIPS), makes use of AdaBoost's adaptive classifier weights. Using a novel formulation, Ada-WHIPS uniquely redistributes the weights among individual decision nodes of the internal decision trees (DT) of the AdaBoost model. Then, a simple heuristic search of the weighted nodes finds a single rule that dominated the model's decision. We compare the explanations generated by our novel approach with the state of the art in an experimental study. We evaluate the derived explanations with simple statistical tests of well-known quality measures, precision and coverage, and a novel measure stability that is better suited to the XAI setting .

Download Full-text

Ada-WHIPS: Explaining AdaBoost Classification with Applications in the Health Sciences

10.21203/rs.2.19113/v3 ◽

2020 ◽

Author(s):

Julian Hatwell ◽

Mohamed Medhat Gaber ◽

R.M. Atif Azad

Keyword(s):

Machine Learning ◽

Statistical Tests ◽

Black Box ◽

Decision Makers ◽

Medical Practitioners ◽

Research Areas ◽

Novel Approach ◽

Mri Scans ◽

Practice Methods ◽

Logical Rules

Abstract Background Computer Aided Diagnostics (CAD) can support medical practitioners to make critical decisions about their patients' disease conditions. Practitioners require access to the chain of reasoning behind CAD to build trust in the CAD advice and to supplement their own expertise. Yet, CAD systems might be based on black box machine learning (ML) models and high dimensional data sources (electronic health records, MRI scans, cardiotocograms, etc). These foundations make interpretation and explanation of the CAD advice very challenging. This challenge is recognised throughout the machine learning research community. eXplainable Artificial Intelligence (XAI) is emerging as one of the most important research areas of recent years because it addresses the interpretability and trust concerns of critical decision makers, including those in clinical and medical practice. Methods In this work, we focus on AdaBoost, a black box ML model that has been widely adopted in the CAD literature. We address the challenge -- to explain AdaBoost classification -- with a novel algorithm that extracts simple, logical rules from AdaBoost models. Our algorithm, Adaptive-Weighted High Importance Path Snippets (Ada-WHIPS), makes use of AdaBoost's adaptive classifier weights. Using a novel formulation, Ada-WHIPS uniquely redistributes the weights among individual decision nodes of the internal decision trees (DT) of the AdaBoost model. Then, a simple heuristic search of the weighted nodes finds a single rule that dominated the model's decision. We compare the explanations generated by our novel approach with the state of the art in an experimental study. We evaluate the derived explanations with simple statistical tests of well-known quality measures, precision and coverage, and a novel measure stability that is better suited to the XAI setting .

Download Full-text

Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-020-01201-2 ◽

2020 ◽

Vol 20 (1) ◽

Cited By ~ 1

Author(s):

Julian Hatwell ◽

Mohamed Medhat Gaber ◽

R. Muhammad Atif Azad

Keyword(s):

Machine Learning ◽

State Of The Art ◽

Statistical Tests ◽

Black Box ◽

The State ◽

Related Data ◽

Research Areas ◽

Novel Approach ◽

Tightly Coupled ◽

Novel Algorithm

Abstract Background Computer Aided Diagnostics (CAD) can support medical practitioners to make critical decisions about their patients’ disease conditions. Practitioners require access to the chain of reasoning behind CAD to build trust in the CAD advice and to supplement their own expertise. Yet, CAD systems might be based on black box machine learning models and high dimensional data sources such as electronic health records, magnetic resonance imaging scans, cardiotocograms, etc. These foundations make interpretation and explanation of the CAD advice very challenging. This challenge is recognised throughout the machine learning research community. eXplainable Artificial Intelligence (XAI) is emerging as one of the most important research areas of recent years because it addresses the interpretability and trust concerns of critical decision makers, including those in clinical and medical practice. Methods In this work, we focus on AdaBoost, a black box model that has been widely adopted in the CAD literature. We address the challenge – to explain AdaBoost classification – with a novel algorithm that extracts simple, logical rules from AdaBoost models. Our algorithm, Adaptive-Weighted High Importance Path Snippets (Ada-WHIPS), makes use of AdaBoost’s adaptive classifier weights. Using a novel formulation, Ada-WHIPS uniquely redistributes the weights among individual decision nodes of the internal decision trees of the AdaBoost model. Then, a simple heuristic search of the weighted nodes finds a single rule that dominated the model’s decision. We compare the explanations generated by our novel approach with the state of the art in an experimental study. We evaluate the derived explanations with simple statistical tests of well-known quality measures, precision and coverage, and a novel measure stability that is better suited to the XAI setting. Results Experiments on 9 CAD-related data sets showed that Ada-WHIPS explanations consistently generalise better (mean coverage 15%-68%) than the state of the art while remaining competitive for specificity (mean precision 80%-99%). A very small trade-off in specificity is shown to guard against over-fitting which is a known problem in the state of the art methods. Conclusions The experimental results demonstrate the benefits of using our novel algorithm for explaining CAD AdaBoost classifiers widely found in the literature. Our tightly coupled, AdaBoost-specific approach outperforms model-agnostic explanation methods and should be considered by practitioners looking for an XAI solution for this class of models.

Download Full-text

Automated Software Design Reusability using a Unique Machine Learning Technique

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.e3010.039520 ◽

2020 ◽

Vol 9 (5) ◽

pp. 1825-1828

Keyword(s):

Machine Learning ◽

Software Development ◽

Traditional Approach ◽

Development Lifecycle ◽

Novel Approach ◽

Long Run ◽

Software Development Lifecycle ◽

Speed Up ◽

Automated Software ◽

Traditional Approaches

The era of machine learning (ML) has brought significant advancement into the traditional approaches of software development and services. Software reusability and design automation is a key requirement that can be handled through the integration of artificial intelligence (AI) capabilities with the traditional approach of software development lifecycle (SDLC) practices. The study introduces a novel approach of ML, which can assist inappropriate selection of reusable software components, which in the long run, can optimize the operational cost in the context of development practices and also speed up the service delivery performance of software engineering activities. The proposed model is validated through a numerical analysis that shows the effectiveness of the system in terms of both classification accuracy and computational efficiency.

Download Full-text

Ada-WHIPS: Explaining AdaBoost Classification with Applications in the Health Sciences

10.21203/rs.2.19113/v5 ◽

2020 ◽

Author(s):

Julian Hatwell ◽

Mohamed Medhat Gaber ◽

R.M. Atif Azad

Keyword(s):

Machine Learning ◽

State Of The Art ◽

Statistical Tests ◽

Black Box ◽

The State ◽

Related Data ◽

Research Areas ◽

Novel Approach ◽

Tightly Coupled ◽

Novel Algorithm

Abstract Background Computer Aided Diagnostics (CAD) can support medical practitioners to make critical decisions about their patients' disease conditions. Practitioners require access to the chain of reasoning behind CAD to build trust in the CAD advice and to supplement their own expertise. Yet, CAD systems might be based on black box machine learning (ML) models and high dimensional data sources (electronic health records, MRI scans, cardiotocograms, etc). These foundations make interpretation and explanation of the CAD advice very challenging. This challenge is recognised throughout the machine learning research community. eXplainable Artificial Intelligence (XAI) is emerging as one of the most important research areas of recent years because it addresses the interpretability and trust concerns of critical decision makers, including those in clinical and medical practice. Methods In this work, we focus on AdaBoost, a black box ML model that has been widely adopted in the CAD literature. We address the challenge -- to explain AdaBoost classification -- with a novel algorithm that extracts simple, logical rules from AdaBoost models. Our algorithm, Adaptive-Weighted High Importance Path Snippets (Ada-WHIPS), makes use of AdaBoost's adaptive classifier weights. Using a novel formulation, Ada-WHIPS uniquely redistributes the weights among individual decision nodes of the internal decision trees (DT) of the AdaBoost model. Then, a simple heuristic search of the weighted nodes finds a single rule that dominated the model's decision. We compare the explanations generated by our novel approach with the state of the art in an experimental study. We evaluate the derived explanations with simple statistical tests of well-known quality measures, precision and coverage, and a novel measure stability that is better suited to the XAI setting.Results Experiments on 9 CAD-related data sets showed that Ada-WHIPS explanations consistently generalise better (mean coverage 15%-68%) than the state of the art while remaining competitive for speciﬁcity (mean precision 80%-99%). A very small trade-oﬀ in speciﬁcity is shown to guard againstover-ﬁtting which is a known problem in the state of the art methods.Conclusions The experimental results demonstrate the beneﬁts of using our novel algorithm for explaining CAD AdaBoost classiﬁers widely found in the literature. Our tightly coupled, AdaBoost-speciﬁc approach outperforms model-agnostic explanation methods and should be considered by practitioners looking for an XAI solution for this class of models.

Download Full-text

A Novel Approach for Prediction of Warts Disease Treatment Methods: Machine Learning Techniques

SSRN Electronic Journal ◽

10.2139/ssrn.3463673 ◽

2019 ◽

Author(s):

Lavanya P. ◽

Priyanka Prakash ◽

Manasa M. ◽

R. G. Babukarthik ◽

Bonduvenkat B.

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Disease Treatment ◽

Treatment Methods ◽

Novel Approach ◽

Learning Techniques

Download Full-text

A Survey for Predicting Enzyme Family Classes Using Machine Learning Methods

Current Drug Targets ◽

10.2174/1389450119666181002143355 ◽

2019 ◽

Vol 20 (5) ◽

pp. 540-550 ◽

Cited By ~ 11

Author(s):

Jiu-Xin Tan ◽

Hao Lv ◽

Fang Wang ◽

Fu-Ying Dao ◽

Wei Chen ◽

...

Keyword(s):

Machine Learning ◽

Catalytic Mechanism ◽

Biological Function ◽

Learning Methods ◽

Biochemical Processes ◽

Machine Learning Methods ◽

Enzyme Family ◽

The Family ◽

Speed Up ◽

Family Classification

Enzymes are proteins that act as biological catalysts to speed up cellular biochemical processes. According to their main Enzyme Commission (EC) numbers, enzymes are divided into six categories: EC-1: oxidoreductase; EC-2: transferase; EC-3: hydrolase; EC-4: lyase; EC-5: isomerase and EC-6: synthetase. Different enzymes have different biological functions and acting objects. Therefore, knowing which family an enzyme belongs to can help infer its catalytic mechanism and provide information about the relevant biological function. With the large amount of protein sequences influxing into databanks in the post-genomics age, the annotation of the family for an enzyme is very important. Since the experimental methods are cost ineffective, bioinformatics tool will be a great help for accurately classifying the family of the enzymes. In this review, we summarized the application of machine learning methods in the prediction of enzyme family from different aspects. We hope that this review will provide insights and inspirations for the researches on enzyme family classification.

Download Full-text

Kernel methods in system identification, machine learning and function estimation: A survey

Automatica ◽

10.1016/j.automatica.2014.01.001 ◽

2014 ◽

Vol 50 (3) ◽

pp. 657-682 ◽

Cited By ~ 294

Author(s):

Gianluigi Pillonetto ◽

Francesco Dinuzzo ◽

Tianshi Chen ◽

Giuseppe De Nicolao ◽

Lennart Ljung

Keyword(s):

Machine Learning ◽

System Identification ◽

Kernel Methods ◽

Function Estimation ◽

And Function

Download Full-text

A Novel Approach for the Early Recognition of Diabetic Retinopathy using Machine Learning

2021 International Conference on Computer Communication and Informatics (ICCCI) ◽

10.1109/iccci50826.2021.9402566 ◽

2021 ◽

Author(s):

Akanksha Soni ◽

Avinash Rai

Keyword(s):

Machine Learning ◽

Diabetic Retinopathy ◽

Early Recognition ◽

Novel Approach

Download Full-text

Using Machine Learning for Quantum Annealing Accuracy Prediction

Algorithms ◽

10.3390/a14060187 ◽

2021 ◽

Vol 14 (6) ◽

pp. 187

Author(s):

Aaron Barbosa ◽

Elijah Pelofske ◽

Georg Hahn ◽

Hristo N. Djidjev

Keyword(s):

Machine Learning ◽

Maximum Clique ◽

Classification Model ◽

Maximum Clique Problem ◽

Problem Instance ◽

Np Hard ◽

Machine Learning Classification ◽

Hard Problems ◽

Problem Instances ◽

D Wave

Quantum annealers, such as the device built by D-Wave Systems, Inc., offer a way to compute solutions of NP-hard problems that can be expressed in Ising or quadratic unconstrained binary optimization (QUBO) form. Although such solutions are typically of very high quality, problem instances are usually not solved to optimality due to imperfections of the current generations quantum annealers. In this contribution, we aim to understand some of the factors contributing to the hardness of a problem instance, and to use machine learning models to predict the accuracy of the D-Wave 2000Q annealer for solving specific problems. We focus on the maximum clique problem, a classic NP-hard problem with important applications in network analysis, bioinformatics, and computational chemistry. By training a machine learning classification model on basic problem characteristics such as the number of edges in the graph, or annealing parameters, such as the D-Wave’s chain strength, we are able to rank certain features in the order of their contribution to the solution hardness, and present a simple decision tree which allows to predict whether a problem will be solvable to optimality with the D-Wave 2000Q. We extend these results by training a machine learning regression model that predicts the clique size found by D-Wave.

Download Full-text