Multi-Task Deep Learning Model with an Attention Mechanism for Ship Accident Sentence Prediction

The number of ship accidents occurring in the Korean ocean has been steadily increasing year by year. The Korea Maritime Safety Tribunal (KMST) has published verdicts to ensure that the relevant personnel can share judgment on these accidents. As of 2020, there have been 3156 ship accidents; thus, it is difficult for the relevant personnel to study these various accidents by only reading the verdicts. Therefore, in this study, we propose a multi-task deep learning model with an attention mechanism for predicting the sentencing of ship accidents. The tasks are accident types, applied articles, and the sentencing of ship accidents. The proposed model was tested under verdicts published by the KMST between 2010 and 2019. Through experiments, we show that the proposed model can improve the performance of sentence prediction and can assist the relevant personnel to study these accidents.

Download Full-text

Interpretable deep learning model for building energy consumption prediction based on attention mechanism

Energy and Buildings ◽

10.1016/j.enbuild.2021.111379 ◽

2021 ◽

pp. 111379

Author(s):

Yuan Gao ◽

Yingjun Ruan

Keyword(s):

Deep Learning ◽

Energy Consumption ◽

Building Energy ◽

Learning Model ◽

Attention Mechanism ◽

Building Energy Consumption ◽

Energy Consumption Prediction ◽

Deep Learning Model ◽

Consumption Prediction

Download Full-text

Cross-Age Face Recognition Using Deep Learning Model Based on Dual Attention Mechanism

Lecture Notes in Electrical Engineering - Communications, Signal Processing, and Systems ◽

10.1007/978-981-15-8411-4_251 ◽

2021 ◽

pp. 1911-1919

Author(s):

Jialve Wang ◽

Shenghong Li ◽

Fucai Luo

Keyword(s):

Deep Learning ◽

Face Recognition ◽

Learning Model ◽

Attention Mechanism ◽

Model Based ◽

Deep Learning Model

Download Full-text

Content Noise Detection Model Using Deep Learning in Web Forums

Sustainability ◽

10.3390/su12125074 ◽

2020 ◽

Vol 12 (12) ◽

pp. 5074

Author(s):

Jiyoung Woo ◽

Jaeseok Yun

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Model ◽

Detection Model ◽

Proposed Model ◽

Web Forum ◽

Web Forums ◽

Conventional Machine ◽

Text Features ◽

Deep Learning Model

Spam posts in web forum discussions cause user inconvenience and lower the value of the web forum as an open source of user opinion. In this regard, as the importance of a web post is evaluated in terms of the number of involved authors, noise distorts the analysis results by adding unnecessary data to the opinion analysis. Here, in this work, an automatic detection model for spam posts in web forums using both conventional machine learning and deep learning is proposed. To automatically differentiate between normal posts and spam, evaluators were asked to recognize spam posts in advance. To construct the machine learning-based model, text features from posted content using text mining techniques from the perspective of linguistics were extracted, and supervised learning was performed to distinguish content noise from normal posts. For the deep learning model, raw text including and excluding special characters was utilized. A comparison analysis on deep neural networks using the two different recurrent neural network (RNN) models of the simple RNN and long short-term memory (LSTM) network was also performed. Furthermore, the proposed model was applied to two web forums. The experimental results indicate that the deep learning model affords significant improvements over the accuracy of conventional machine learning associated with text features. The accuracy of the proposed model using LSTM reaches 98.56%, and the precision and recall of the noise class reach 99% and 99.53%, respectively.

Download Full-text

ECOVNet: a highly effective ensemble based deep learning model for detecting COVID-19

PeerJ Computer Science ◽

10.7717/peerj-cs.551 ◽

2021 ◽

Vol 7 ◽

pp. e551

Author(s):

Nihad Karim Chowdhury ◽

Muhammad Ashad Kabir ◽

Md. Muhtadir Rahman ◽

Noortaz Rezoana

Keyword(s):

Deep Learning ◽

Detection System ◽

Learning Model ◽

Classification Performance ◽

Data Sets ◽

X Rays ◽

Proposed Model ◽

Highly Effective ◽

Chest X Ray ◽

Deep Learning Model

The goal of this research is to develop and implement a highly effective deep learning model for detecting COVID-19. To achieve this goal, in this paper, we propose an ensemble of Convolutional Neural Network (CNN) based on EfficientNet, named ECOVNet, to detect COVID-19 from chest X-rays. To make the proposed model more robust, we have used one of the largest open-access chest X-ray data sets named COVIDx containing three classes—COVID-19, normal, and pneumonia. For feature extraction, we have applied an effective CNN structure, namely EfficientNet, with ImageNet pre-training weights. The generated features are transferred into custom fine-tuned top layers followed by a set of model snapshots. The predictions of the model snapshots (which are created during a single training) are consolidated through two ensemble strategies, i.e., hard ensemble and soft ensemble, to enhance classification performance. In addition, a visualization technique is incorporated to highlight areas that distinguish classes, thereby enhancing the understanding of primal components related to COVID-19. The results of our empirical evaluations show that the proposed ECOVNet model outperforms the state-of-the-art approaches and significantly improves detection performance with 100% recall for COVID-19 and overall accuracy of 96.07%. We believe that ECOVNet can enhance the detection of COVID-19 disease, and thus, underpin a fully automated and efficacious COVID-19 detection system.

Download Full-text

Which Attributes Matter the Most for Loan Origination? A Neural Attention Approach

10.20944/preprints202002.0180.v1 ◽

2020 ◽

Author(s):

Antonios Alexos ◽

Sotirios Chatzis

Keyword(s):

Deep Learning ◽

Real World ◽

Individual Case ◽

Learning Model ◽

Attention Mechanism ◽

Novel Approach ◽

Specific Individual ◽

Real World Datasets ◽

Deep Learning Model

In this paper we address the understanding of the problem, of why a deep learning model decides that an individual is eligible for a loan or not. Here we propose a novel approach for inferring, which attributes matter the most, for making a decision in each specific individual case. Specifically we leverage concepts from neural attention to devise a novel feature wise attention mechanism. As we show, using real world datasets, our approach offers unique insights into the importance of various features, by producing a decision explanation for each specific loan case. At the same time, we observe that our novel mechanism, generates decisions which are much closer to the decisions generated by human experts, compared to the existent competitors.

Download Full-text

Hybrid deep learning model using recurrent neural network and gated recurrent unit for heart disease prediction

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v11i6.pp5467-5476 ◽

2021 ◽

Vol 11 (6) ◽

pp. 5467

Author(s):

Surenthiran Krishnan ◽

Pritheega Magalingam ◽

Roslina Ibrahim

Keyword(s):

Neural Network ◽

Heart Disease ◽

Deep Learning ◽

Recurrent Neural Network ◽

Short Term Memory ◽

Learning Model ◽

Disease Prediction ◽

The Neural Network ◽

Proposed Model ◽

Deep Learning Model

<span>This paper proposes a new hybrid deep learning model for heart disease prediction using recurrent neural network (RNN) with the combination of multiple gated recurrent units (GRU), long short-term memory (LSTM) and Adam optimizer. This proposed model resulted in an outstanding accuracy of 98.6876% which is the highest in the existing model of RNN. The model was developed in Python 3.7 by integrating RNN in multiple GRU that operates in Keras and Tensorflow as the backend for deep learning process, supported by various Python libraries. The recent existing models using RNN have reached an accuracy of 98.23% and deep neural network (DNN) has reached 98.5%. The common drawbacks of the existing models are low accuracy due to the complex build-up of the neural network, high number of neurons with redundancy in the neural network model and imbalance datasets of Cleveland. Experiments were conducted with various customized model, where results showed that the proposed model using RNN and multiple GRU with synthetic minority oversampling technique (SMOTe) has reached the best performance level. This is the highest accuracy result for RNN using Cleveland datasets and much promising for making an early heart disease prediction for the patients.</span>

Download Full-text

Multi-Task Model for Esophageal Lesion Analysis Using Endoscopic Images: Classification with Image Retrieval and Segmentation with Attention

Sensors ◽

10.3390/s22010283 ◽

2021 ◽

Vol 22 (1) ◽

pp. 283

Author(s):

Xiaoyuan Yu ◽

Suigu Tang ◽

Chak Fong Cheang ◽

Hon Ho Yu ◽

I Cheong Choi

Keyword(s):

Deep Learning ◽

Image Retrieval ◽

Learning Model ◽

Endoscopic Images ◽

Proposed Model ◽

Esophageal Lesion ◽

Lesion Analysis ◽

Diagnosis Accuracy ◽

Segmentation Task ◽

Deep Learning Model

The automatic analysis of endoscopic images to assist endoscopists in accurately identifying the types and locations of esophageal lesions remains a challenge. In this paper, we propose a novel multi-task deep learning model for automatic diagnosis, which does not simply replace the role of endoscopists in decision making, because endoscopists are expected to correct the false results predicted by the diagnosis system if more supporting information is provided. In order to help endoscopists improve the diagnosis accuracy in identifying the types of lesions, an image retrieval module is added in the classification task to provide an additional confidence level of the predicted types of esophageal lesions. In addition, a mutual attention module is added in the segmentation task to improve its performance in determining the locations of esophageal lesions. The proposed model is evaluated and compared with other deep learning models using a dataset of 1003 endoscopic images, including 290 esophageal cancer, 473 esophagitis, and 240 normal. The experimental results show the promising performance of our model with a high accuracy of 96.76% for the classification and a Dice coefficient of 82.47% for the segmentation. Consequently, the proposed multi-task deep learning model can be an effective tool to help endoscopists in judging esophageal lesions.

Download Full-text

An Improved Deep Learning Model for Traffic Crash Prediction

Journal of Advanced Transportation ◽

10.1155/2018/3869106 ◽

2018 ◽

Vol 2018 ◽

pp. 1-13 ◽

Cited By ~ 14

Author(s):

Chunjiao Dong ◽

Chunfu Shao ◽

Juan Li ◽

Zhihua Xiong

Keyword(s):

Deep Learning ◽

Feature Learning ◽

Learning Model ◽

Fine Tuning ◽

Crash Prediction ◽

Traffic Crash ◽

Explanatory Variables ◽

Feature Representations ◽

Proposed Model ◽

Deep Learning Model

Machine-learning technology powers many aspects of modern society. Compared to the conventional machine learning techniques that were limited in processing natural data in the raw form, deep learning allows computational models to learn representations of data with multiple levels of abstraction. In this study, an improved deep learning model is proposed to explore the complex interactions among roadways, traffic, environmental elements, and traffic crashes. The proposed model includes two modules, an unsupervised feature learning module to identify functional network between the explanatory variables and the feature representations and a supervised fine tuning module to perform traffic crash prediction. To address the unobserved heterogeneity issues in the traffic crash prediction, a multivariate negative binomial (MVNB) model is embedding into the supervised fine tuning module as a regression layer. The proposed model was applied to the dataset that was collected from Knox County in Tennessee to validate the performances. The results indicate that the feature learning module identifies relational information between the explanatory variables and the feature representations, which reduces the dimensionality of the input and preserves the original information. The proposed model that includes the MVNB regression layer in the supervised fine tuning module can better account for differential distribution patterns in traffic crashes across injury severities and provides superior traffic crash predictions. The findings suggest that the proposed model is a superior alternative for traffic crash predictions and the average accuracy of the prediction that was measured by RMSD can be improved by 84.58% and 158.27% compared to the deep learning model without the regression layer and the SVM model, respectively.

Download Full-text

Novel deep learning-based transcriptome data analysis for drug-drug interaction prediction with an application in diabetes

BMC Bioinformatics ◽

10.1186/s12859-021-04241-1 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Qichao Luo ◽

Shenglong Mo ◽

Yunfei Xue ◽

Xiangzhou Zhang ◽

Yuliang Gu ◽

...

Keyword(s):

Deep Learning ◽

Drug Interaction ◽

Learning Model ◽

Superior Performance ◽

Public Health Issue ◽

Transcriptome Data ◽

Proposed Model ◽

Drug Drug Interaction ◽

Deep Learning Model ◽

Drugbank Database

Abstract Background Drug-drug interaction (DDI) is a serious public health issue. The L1000 database of the LINCS project has collected millions of genome-wide expressions induced by 20,000 small molecular compounds on 72 cell lines. Whether this unified and comprehensive transcriptome data resource can be used to build a better DDI prediction model is still unclear. Therefore, we developed and validated a novel deep learning model for predicting DDI using 89,970 known DDIs extracted from the DrugBank database (version 5.1.4). Results The proposed model consists of a graph convolutional autoencoder network (GCAN) for embedding drug-induced transcriptome data from the L1000 database of the LINCS project; and a long short-term memory (LSTM) for DDI prediction. Comparative evaluation of various machine learning methods demonstrated the superior performance of our proposed model for DDI prediction. Many of our predicted DDIs were revealed in the latest DrugBank database (version 5.1.7). In the case study, we predicted drugs interacting with sulfonylureas to cause hypoglycemia and drugs interacting with metformin to cause lactic acidosis, and showed both to induce effects on the proteins involved in the metabolic mechanism in vivo. Conclusions The proposed deep learning model can accelerate the discovery of new DDIs. It can support future clinical research for safer and more effective drug co-prescription.

Download Full-text

An Interpretable Deep Learning Model for Automatic Sound Classification

Electronics ◽

10.3390/electronics10070850 ◽

2021 ◽

Vol 10 (7) ◽

pp. 850

Author(s):

Pablo Zinemanas ◽

Martín Rocamora ◽

Marius Miron ◽

Frederic Font ◽

Xavier Serra

Keyword(s):

Deep Learning ◽

Web Application ◽

Domain Knowledge ◽

Learning Model ◽

Unintended Effects ◽

Learning Models ◽

Time Frequency ◽

Sound Classification ◽

Proposed Model ◽

Deep Learning Model

Deep learning models have improved cutting-edge technologies in many research areas, but their black-box structure makes it difficult to understand their inner workings and the rationale behind their predictions. This may lead to unintended effects, such as being susceptible to adversarial attacks or the reinforcement of biases. There is still a lack of research in the audio domain, despite the increasing interest in developing deep learning models that provide explanations of their decisions. To reduce this gap, we propose a novel interpretable deep learning model for automatic sound classification, which explains its predictions based on the similarity of the input to a set of learned prototypes in a latent space. We leverage domain knowledge by designing a frequency-dependent similarity measure and by considering different time-frequency resolutions in the feature space. The proposed model achieves results that are comparable to that of the state-of-the-art methods in three different sound classification tasks involving speech, music, and environmental audio. In addition, we present two automatic methods to prune the proposed model that exploit its interpretability. Our system is open source and it is accompanied by a web application for the manual editing of the model, which allows for a human-in-the-loop debugging approach.

Download Full-text