Opening the Black Box: Interpretable Machine Learning for Geneticists

2017 ◽

Author(s):

Kacper Sokol ◽

Peter Flach

Keyword(s):

Machine Learning ◽

Predictive Accuracy ◽

Spatial Perception ◽

Black Box ◽

High Dimensional ◽

Box Models ◽

Machine Learning Applications ◽

Black Box Models ◽

Machine Learning Models

Understanding data, models and predictions is important for machine learning applications. Due to the limitations of our spatial perception and intuition, analysing high-dimensional data is inherently difficult. Furthermore, black-box models achieving high predictive accuracy are widely used, yet the logic behind their predictions is often opaque. Use of textualisation -- a natural language narrative of selected phenomena -- can tackle these shortcomings. When extended with argumentation theory we could envisage machine learning models and predictions arguing persuasively for their choices.

Download Full-text

Healthcare and Aviation: Perspectives on Alerts, Machine Learning, and Future Directions

Proceedings of the International Symposium on Human Factors and Ergonomics in Health Care ◽

10.1177/2327857920091018 ◽

2020 ◽

Vol 9 (1) ◽

pp. 113-115

Author(s):

Christopher J. Hansen ◽

Dominic DiCostanzo ◽

Randall J. Mumaw ◽

Emily S. Patterson

Keyword(s):

Machine Learning ◽

Image Analysis ◽

Graduate Students ◽

Predictive Analytics ◽

Black Box ◽

Hospital Environment ◽

Future Directions ◽

Effective Application ◽

Highly Skilled ◽

Opening Up

The fields of healthcare and aviation can learn from one another about alerts and their potential for effective application through predictive analytics. We conducted a series of interactive discussions between an expert in alerts in aviation cockpits and graduate students specializing in the application of machine learning in healthcare, and particularly with respect to image analysis. We present our findings regarding insights for healthcare on alerts and for aviation on machine learning. Our findings suggest that ‘opening up the black box’ is important for highly skilled pilots to be able to process recommendations from complex algorithms in aviation, and that considering whether an alert or alarm is ‘actionable’ is important when directing the attention of nurses caring for more than one patient at a time in a hospital environment.

Download Full-text

Deep learning for predicting disease status using genomic data

10.7287/peerj.preprints.27123 ◽

2018 ◽

Cited By ~ 1

Author(s):

Qianfan Wu ◽

Adel Boueiz ◽

Alican Bozkurt ◽

Arya Masoomi ◽

Allan Wang ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Rapid Development ◽

Learning Algorithms ◽

Genomic Data ◽

Disease Status ◽

Machine Learning Algorithms ◽

High Dimensional ◽

Learning Approach ◽

Low Dimensional

Predicting disease status for a complex human disease using genomic data is an important, yet challenging, step in personalized medicine. Among many challenges, the so-called curse of dimensionality problem results in unsatisfied performances of many state-of-art machine learning algorithms. A major recent advance in machine learning is the rapid development of deep learning algorithms that can efficiently extract meaningful features from high-dimensional and complex datasets through a stacked and hierarchical learning process. Deep learning has shown breakthrough performance in several areas including image recognition, natural language processing, and speech recognition. However, the performance of deep learning in predicting disease status using genomic datasets is still not well studied. In this article, we performed a review on the four relevant articles that we found through our thorough literature review. All four articles used auto-encoders to project high-dimensional genomic data to a low dimensional space and then applied the state-of-the-art machine learning algorithms to predict disease status based on the low-dimensional representations. This deep learning approach outperformed existing prediction approaches, such as prediction based on probe-wise screening and prediction based on principal component analysis. The limitations of the current deep learning approach and possible improvements were also discussed.

Download Full-text

Prediction of Cocaine Inpatient Treatment Success Using Machine Learning on High-Dimensional Heterogeneous Data

IEEE Access ◽

10.1109/access.2020.3041895 ◽

2020 ◽

Vol 8 ◽

pp. 218936-218953

Author(s):

Jose Tapia-Galisteo ◽

Jose M. Iniesta ◽

Carmen Perez-Gandia ◽

Gema Garcia-Saez ◽

Diego Urgeles Puertolas ◽

...

Keyword(s):

Machine Learning ◽

Inpatient Treatment ◽

Treatment Success ◽

Heterogeneous Data ◽

High Dimensional

Download Full-text

Interpretable machine learning with reject option

at - Automatisierungstechnik ◽

10.1515/auto-2017-0123 ◽

2018 ◽

Vol 66 (4) ◽

pp. 283-290 ◽

Cited By ~ 7

Author(s):

Johannes Brinkrolf ◽

Barbara Hammer

Keyword(s):

Machine Learning ◽

Vector Quantization ◽

Random Forests ◽

Black Box ◽

Learning Models ◽

Process Automation ◽

Reject Option ◽

Interpretable Machine Learning ◽

Adversarial Examples ◽

Machine Learning Models

Abstract Classification by means of machine learning models constitutes one relevant technology in process automation and predictive maintenance. However, common techniques such as deep networks or random forests suffer from their black box characteristics and possible adversarial examples. In this contribution, we give an overview about a popular alternative technology from machine learning, namely modern variants of learning vector quantization, which, due to their combined discriminative and generative nature, incorporate interpretability and the possibility of explicit reject options for irregular samples. We give an explicit bound on minimum changes required for a change of the classification in case of LVQ networks with reject option, and we demonstrate the efficiency of reject options in two examples.

Download Full-text

Peeking inside the Black Box: Interpretable Machine Learning and Hedonic Rental Estimation

10.15396/eres2021_104 ◽

2021 ◽

Author(s):

Marcelo Cajias ◽

Willwersch Jonas ◽

Lorenz Felix ◽

Franz Fuerst

Keyword(s):

Machine Learning ◽

Black Box ◽

Interpretable Machine Learning

Download Full-text

Peeking into the Black Box: An Actuarial Case Study for Interpretable Machine Learning

SSRN Electronic Journal ◽

10.2139/ssrn.3595944 ◽

2020 ◽

Cited By ~ 1

Author(s):

Christian Lorentzen ◽

Michael Mayer

Keyword(s):

Machine Learning ◽

Black Box ◽

Interpretable Machine Learning

Download Full-text

Interpretable machine learning for demand modeling with high-dimensional data using Gradient Boosting Machines and Shapley values

Journal of Revenue and Pricing Management ◽

10.1057/s41272-020-00236-4 ◽

2020 ◽

Vol 19 (5) ◽

pp. 355-364

Author(s):

Evgeny A. Antipov ◽

Elena B. Pokryshevskaya

Keyword(s):

Machine Learning ◽

High Dimensional Data ◽

High Dimensional ◽

Gradient Boosting ◽

Demand Modeling ◽

Interpretable Machine Learning ◽

Shapley Values

Download Full-text

Deep learning for predicting disease status using genomic data

10.7287/peerj.preprints.27123v1 ◽

2018 ◽

Author(s):

Qianfan Wu ◽

Adel Boueiz ◽

Alican Bozkurt ◽

Arya Masoomi ◽

Allan Wang ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Rapid Development ◽

Learning Algorithms ◽

Genomic Data ◽

Disease Status ◽

Machine Learning Algorithms ◽

High Dimensional ◽

Learning Approach ◽

Low Dimensional

Predicting disease status for a complex human disease using genomic data is an important, yet challenging, step in personalized medicine. Among many challenges, the so-called curse of dimensionality problem results in unsatisfied performances of many state-of-art machine learning algorithms. A major recent advance in machine learning is the rapid development of deep learning algorithms that can efficiently extract meaningful features from high-dimensional and complex datasets through a stacked and hierarchical learning process. Deep learning has shown breakthrough performance in several areas including image recognition, natural language processing, and speech recognition. However, the performance of deep learning in predicting disease status using genomic datasets is still not well studied. In this article, we performed a review on the four relevant articles that we found through our thorough literature review. All four articles used auto-encoders to project high-dimensional genomic data to a low dimensional space and then applied the state-of-the-art machine learning algorithms to predict disease status based on the low-dimensional representations. This deep learning approach outperformed existing prediction approaches, such as prediction based on probe-wise screening and prediction based on principal component analysis. The limitations of the current deep learning approach and possible improvements were also discussed.

Download Full-text

A Short Survey on Machine Learning Explainability: An Application to Periocular Recognition

Electronics ◽

10.3390/electronics10151861 ◽

2021 ◽

Vol 10 (15) ◽

pp. 1861

Author(s):

João Brito ◽

Hugo Proença

Keyword(s):

Machine Learning ◽

Medical Image ◽

Visual Cues ◽

Medical Image Analysis ◽

Black Box ◽

Future Directions ◽

The Core ◽

Box Models ◽

Black Box Models ◽

Periocular Recognition

Interpretability has made significant strides in recent years, enabling the formerly black-box models to reach new levels of transparency. These kinds of models can be particularly useful to broaden the applicability of machine learning-based systems to domains where—apart from the predictions—appropriate justifications are also required (e.g., forensics and medical image analysis). In this context, techniques that focus on visual explanations are of particular interest here, due to their ability to directly portray the reasons that support a given prediction. Therefore, in this document, we focus on presenting the core principles of interpretability and describing the main methods that deliver visual cues (including one that we designed for periocular recognition in particular). Based on these intuitions, the experiments performed show explanations that attempt to highlight the most important periocular components towards a non-match decision. Then, some particularly challenging scenarios are presented to naturally sustain our conclusions and thoughts regarding future directions.

Download Full-text

Opening the Black Box: Interpretable Machine Learning for Geneticists

The Role of Textualisation and Argumentation in Understanding the Machine Learning Process

Healthcare and Aviation: Perspectives on Alerts, Machine Learning, and Future Directions

Deep learning for predicting disease status using genomic data

Prediction of Cocaine Inpatient Treatment Success Using Machine Learning on High-Dimensional Heterogeneous Data

Interpretable machine learning with reject option

Peeking inside the Black Box: Interpretable Machine Learning and Hedonic Rental Estimation

Peeking into the Black Box: An Actuarial Case Study for Interpretable Machine Learning

Interpretable machine learning for demand modeling with high-dimensional data using Gradient Boosting Machines and Shapley values

Deep learning for predicting disease status using genomic data

A Short Survey on Machine Learning Explainability: An Application to Periocular Recognition

Export Citation Format