Feature Constrained Multi-Task Learning Models for Spatiotemporal Event Forecasting

The global extent of COVID-19 mutations and the consequent depletion of hospital resources highlighted the necessity of effective computer-assisted medical diagnosis. COVID-19 detection mediated by deep learning models can help diagnose this highly contagious disease and lower infectivity and mortality rates. Computed tomography (CT) is the preferred imaging modality for building automatic COVID-19 screening and diagnosis models. It is well-known that the training set size significantly impacts the performance and generalization of deep learning models. However, accessing a large dataset of CT scan images from an emerging disease like COVID-19 is challenging. Therefore, data efficiency becomes a significant factor in choosing a learning model. To this end, we present a multi-task learning approach, namely, a mask-guided attention (MGA) classifier, to improve the generalization and data efficiency of COVID-19 classification on lung CT scan images.The novelty of this method is compensating for the scarcity of data by employing more supervision with lesion masks, increasing the sensitivity of the model to COVID-19 manifestations, and helping both generalization and classification performance. Our proposed model achieves better overall performance than the single-task baseline and state-of-the-art models, as measured by various popular metrics. In our experiment with different percentages of data from our curated dataset, the classification performance gain from this multi-task learning approach is more significant for the smaller training sizes. Furthermore, experimental results demonstrate that our method enhances the focus on the lesions, as witnessed by bothattention and attribution maps, resulting in a more interpretable model.

Download Full-text

Abstract 279: Multi-task Learning Improves Model Performance in Predicting Rare Catastrophic Events in Healthcare Claims Dataset

Circulation ◽

10.1161/circ.142.suppl_4.279 ◽

2020 ◽

Vol 142 (Suppl_4) ◽

Author(s):

ChienYu Chi ◽

Yen-Pin Chen ◽

Adrian Winkler ◽

Kuan-Chun Fu ◽

Fie Xu ◽

...

Keyword(s):

Cardiac Arrest ◽

Deep Learning ◽

Short Term Memory ◽

Learning System ◽

Research Database ◽

Learning Models ◽

Catastrophic Events ◽

Single Task ◽

Task Learning ◽

Hospital Cardiac Arrest

Introduction: Predicting rare catastrophic events is challenging due to lack of targets. Here we employed a multi-task learning method and demonstrated that substantial gains in accuracy and generalizability was achieved by sharing representations between related tasks Methods: Starting from Taiwan National Health Insurance Research Database, we selected adult people (>20 year) experienced in-hospital cardiac arrest but not out-of-hospital cardiac arrest during 8 years (2003-2010), and built a dataset using de-identified claims of Emergency Department (ED) and hospitalization. Final dataset had 169,287 patients, randomly split into 3 sections, train 70%, validation 15%, and test 15%.Two outcomes, 30-day readmission and 30-day mortality are chosen. We constructed the deep learning system in two steps. We first used a taxonomy mapping system Text2Node to generate a distributed representation for each concept. We then applied a multilevel hierarchical model based on long short-term memory (LSTM) architecture. Multi-task models used gradient similarity to prioritize the desired task over auxiliary tasks. Single-task models were trained for each desired task. All models share the same architecture and are trained with the same input data Results: Each model was optimized to maximize AUROC on the validation set with the final metrics calculated on the held-out test set. We demonstrated multi-task deep learning models outperform single task deep learning models on both tasks. While readmission had roughly 30% positives and showed miniscule improvements, the mortality task saw more improvement between models. We hypothesize that this is a result of the data imbalance, mortality occurred roughly 5% positive; the auxiliary tasks help the model interpret the data and generalize better. Conclusion: Multi-task deep learning models outperform single task deep learning models in predicting 30-day readmission and mortality in in-hospital cardiac arrest patients.

Download Full-text

Extreme Event Forecasting Using Machine Learning Models

Lecture Notes in Electrical Engineering - Advances in Communication and Computational Technology ◽

10.1007/978-981-15-5341-7_115 ◽

2020 ◽

pp. 1503-1514

Author(s):

Manish Kumar ◽

Deepak Kumar Gupta ◽

Samayveer Singh

Keyword(s):

Machine Learning ◽

Extreme Event ◽

Learning Models ◽

Event Forecasting ◽

Machine Learning Models

Download Full-text

Modeling Originators for Event Forecasting Multi-Task Learning in Mil Algorithm

International Journal of Computing Algorithm ◽

10.20894/ijcoa.101.007.001.004 ◽

2018 ◽

Vol 7 (1) ◽

pp. 19-23

Author(s):

Saranya E ◽

◽

Saravanan A.M. ◽

Keyword(s):

Task Learning ◽

Event Forecasting

Download Full-text

MLRDA: A Multi-Task Semi-Supervised Learning Framework for Drug-Drug Interaction Prediction

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/628 ◽

2019 ◽

Author(s):

Xu Chu ◽

Yang Lin ◽

Yasha Wang ◽

Leye Wang ◽

Jiangtao Wang ◽

...

Keyword(s):

Supervised Learning ◽

State Of The Art ◽

Multiple Drug ◽

Prediction Methods ◽

Learning Models ◽

Preventable Hospitalizations ◽

Interaction Prediction ◽

Learning Framework ◽

Task Learning ◽

Real World Datasets

Drug-drug interactions (DDIs) are a major cause of preventable hospitalizations and deaths. Recently, researchers in the AI community try to improve DDI prediction in two directions, incorporating multiple drug features to better model the pharmacodynamics and adopting multi-task learning to exploit associations among DDI types. However, these two directions are challenging to reconcile due to the sparse nature of the DDI labels which inflates the risk of overfitting of multi-task learning models when incorporating multiple drug features. In this paper, we propose a multi-task semi-supervised learning framework MLRDA for DDI prediction. MLRDA effectively exploits information that is beneficial for DDI prediction in unlabeled drug data by leveraging a novel unsupervised disentangling loss CuXCov. The CuXCov loss cooperates with the classification loss to disentangle the DDI prediction relevant part from the irrelevant part in a representation learnt by an autoencoder, which helps to ease the difficulty in mining useful information for DDI prediction in both labeled and unlabeled drug data. Moreover, MLRDA adopts a multi-task learning framework to exploit associations among DDI types. Experimental results on real-world datasets demonstrate that MLRDA significantly outperforms state-of-the-art DDI prediction methods by up to 10.3% in AUPR.

Download Full-text

Loss-Balanced Task Weighting to Reduce Negative Transfer in Multi-Task Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019977 ◽

2019 ◽

Vol 33 ◽

pp. 9977-9978 ◽

Cited By ~ 1

Author(s):

Shengchao Liu ◽

Yingyu Liang ◽

Anthony Gitter

Keyword(s):

Computational Chemistry ◽

Task Performance ◽

Negative Transfer ◽

Learning Models ◽

Improve Performance ◽

Single Task ◽

Task Learning ◽

Model Training

In settings with related prediction tasks, integrated multi-task learning models can often improve performance relative to independent single-task models. However, even when the average task performance improves, individual tasks may experience negative transfer in which the multi-task model’s predictions are worse than the single-task model’s. We show the prevalence of negative transfer in a computational chemistry case study with 128 tasks and introduce a framework that provides a foundation for reducing negative transfer in multitask models. Our Loss-Balanced Task Weighting approach dynamically updates task weights during model training to control the influence of individual tasks.

Download Full-text

On the Role of Syntactic Graph Convolutions for Identifying and Classifying Argument Components

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019997 ◽

2019 ◽

Vol 33 ◽

pp. 9997-9998

Author(s):

Gaku Morio ◽

Katsuhide Fujita

Keyword(s):

Learning Models ◽

Syntactic Knowledge ◽

Task Learning ◽

Component Identification ◽

Syntactic Information ◽

Fundamental Research

This paper focuses on fundamental research that combines syntactic knowledge with neural studies, which utilize syntactic information in argument component identification and classification (AC-I/C) tasks in argument mining (AM). The following are our paper’s contributions: 1) We propose a way of incorporating a syntactic GCN into multi-task learning models for AC-I/C tasks. 2) We demonstrate the valid effectiveness of our proposed syntactic GCN in fair experiments in some datasets. We also found that syntactic GCNs are promising for lexically independent scenarios. Our code in the experiments is available for reproducibility.1

Download Full-text