Applying Mondrian Cross-Conformal Prediction to Estimate Prediction Confidence on Large Imbalanced Bioactivity Datasets

Mapping Intimacies ◽

10.1101/116764 ◽

2017 ◽

Author(s):

Jiangming Sun ◽

Lars Carlsson ◽

Ernst Ahlberg ◽

Ulf Norinder ◽

Ola Engkvist ◽

...

Keyword(s):

Machine Learning ◽

Application Domain ◽

Minority Class ◽

Conformal Prediction ◽

Machine Learning Methods ◽

Prediction Confidence ◽

Qsar Modelling ◽

Calibration Sets ◽

Prediction Region ◽

Prediction Regions

ABSTRACTConformal prediction has been proposed as a more rigorous way to define prediction confidence compared to other application domain concepts that have earlier been used for QSAR modelling. One main advantage of such a method is that it provides a prediction region potentially with multiple predicted labels, which contrasts to the single valued (regression) or single label (classification) output predictions by standard QSAR modelling algorithms. Standard conformal prediction might not be suitable for imbalanced datasets. Therefore, Mondrian cross-conformal prediction (MCCP) which combines the Mondrian inductive conformal prediction with cross-fold calibration sets has been introduced. In this study, the MCCP method was applied to 18 publicly available datasets that have various imbalance levels varying from 1:10 to 1:1000 (ratio of active/inactive compounds). Our results show that MCCP in general performed well on cheminformatics datasets with various imbalance levels. More importantly, the method not only provides confidence of prediction and prediction regions compared to standard machine learning methods, but also produces valid predictions for the minority class. In addition, a compound similarity based nonconformity measure was investigated. Our results demonstrate that although it gives valid predictions, its efficiency is much worse than nonconformity measures obtained from supervised learning.

Editorial: Machine Learning Methods in QSAR Modelling

QSAR & Combinatorial Science ◽

10.1002/qsar.200390046 ◽

2003 ◽

Vol 22 (5) ◽

pp. 485-486 ◽

Cited By ~ 4

Keyword(s):

Machine Learning ◽

Learning Methods ◽

Machine Learning Methods ◽

Qsar Modelling

Advanced Interpretable Machine Learning Methods for Clinical NGS Big Data of Complex Hereditary Diseases

10.3389/978-2-88966-274-6 ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Big Data ◽

Hereditary Diseases ◽

Learning Methods ◽

Machine Learning Methods ◽

Interpretable Machine Learning

Application of machine learning methods for automatic interpretation of open hole logging data

Neftyanoe khozyaystvo - Oil Industry ◽

10.24887/0028-2448-2020-11-44-47 ◽

2020 ◽

pp. 44-47

Author(s):

M.A. Basyrov ◽

◽

A.V. Akinshin ◽

I.R. Makhmutov ◽

Yu.D. Kantemirov ◽

...

Keyword(s):

Machine Learning ◽

Learning Methods ◽

Machine Learning Methods ◽

Automatic Interpretation ◽

Open Hole

TESTING PREDICTION ACCURACY OF HDU ADMISSION FOLLOWING HIGH GRADE SEROUS ADVANCED OVARIAN CANCER CYTOREDUCTIVE SURGERY USING MACHINE LEARNING METHODS.

10.26226/morressier.5fa3ee5d55b1fd4cc4dd93d7 ◽

2020 ◽

Author(s):

Alexandros Laios ◽

Angelika Kaufmann ◽

Mohamed Otify ◽

Diederick De Jong ◽

Tim Broadhead ◽

...

Keyword(s):

Machine Learning ◽

Ovarian Cancer ◽

Cytoreductive Surgery ◽

Prediction Accuracy ◽

Advanced Ovarian Cancer ◽

High Grade ◽

Learning Methods ◽

Machine Learning Methods

Evolution of Machine Learning Methods for Memography Classification

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i3.499502 ◽

2018 ◽

Vol 6 (3) ◽

pp. 499-502

Author(s):

R. Swathi ◽

◽

R. Seshadri ◽

Keyword(s):

Machine Learning ◽

Learning Methods ◽

Machine Learning Methods

FORECASTING THREATS AND CHOOSING THE OPTIMAL STRATEGY FOR ENSURING ECONOMIC SECURITY USING MACHINE LEARNING METHODS

Scientific Review Series 1 Economics and Law ◽

10.26653/2076-4650-2019-6-09 ◽

2019 ◽

pp. 115-123

Author(s):

Evgeniy A. Voronin ◽

◽

Igor V. Yushin ◽

Keyword(s):

Machine Learning ◽

Optimal Strategy ◽

Economic Security ◽

Learning Methods ◽

Machine Learning Methods

Utilizing Blockchain Technology in Social Media Bot Identification

10.36227/techrxiv.12049374 ◽

2020 ◽

Author(s):

Shreya Reddy ◽

Lisa Ewen ◽

Pankti Patel ◽

Prerak Patel ◽

Ankit Kundal ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Gold Standard ◽

The Internet ◽

Learning Models ◽

Current Time ◽

Machine Learning Methods ◽

Blockchain Technology ◽

Modern Age ◽

Machine Learning Models

<p>As bots become more prevalent and smarter in the modern age of the internet, it becomes ever more important that they be identified and removed. Recent research has dictated that machine learning methods are accurate and the gold standard of bot identification on social media. Unfortunately, machine learning models do not come without their negative aspects such as lengthy training times, difficult feature selection, and overwhelming pre-processing tasks. To overcome these difficulties, we are proposing a blockchain framework for bot identification. At the current time, it is unknown how this method will perform, but it serves to prove the existence of an overwhelming gap of research under this area.<i></i></p>

A Generalized Approach to Soil Strength Prediction With Machine Learning Methods

10.21236/ada464726 ◽

2006 ◽

Author(s):

Peter M. Semen

Keyword(s):

Machine Learning ◽

Soil Strength ◽

Strength Prediction ◽

Learning Methods ◽

Machine Learning Methods

Machine Learning Methods for Financial Forecasting: Application to the S&P 500

SSRN Electronic Journal ◽

10.2139/ssrn.2554146 ◽

2006 ◽

Cited By ~ 1

Author(s):

Babak Mahdavi Damghani

Keyword(s):

Machine Learning ◽

Financial Forecasting ◽

Learning Methods ◽

Machine Learning Methods

Can Machine-Learning Methods Predict the Outcome of an NBA Game?

SSRN Electronic Journal ◽

10.2139/ssrn.3208101 ◽

2018 ◽

Cited By ~ 1

Author(s):

Nachi Lieder

Keyword(s):

Machine Learning ◽

Learning Methods ◽

Machine Learning Methods