Drug-Drug Interaction Extraction via Recurrent Hybrid Convolutional Neural Networks with an Improved Focal Loss

Drug-drug interactions (DDIs) may bring huge health risks and dangerous effects to a patient’s body when taking two or more drugs at the same time or within a certain period of time. Therefore, the automatic extraction of unknown DDIs has great potential for the development of pharmaceutical agents and the safety of drug use. In this article, we propose a novel recurrent hybrid convolutional neural network (RHCNN) for DDI extraction from biomedical literature. In the embedding layer, the texts mentioning two entities are represented as a sequence of semantic embeddings and position embeddings. In particular, the complete semantic embedding is obtained by the information fusion between a word embedding and its contextual information which is learnt by recurrent structure. After that, the hybrid convolutional neural network is employed to learn the sentence-level features which consist of the local context features from consecutive words and the dependency features between separated words for DDI extraction. Lastly but most significantly, in order to make up for the defects of the traditional cross-entropy loss function when dealing with class imbalanced data, we apply an improved focal loss function to mitigate against this problem when using the DDIExtraction 2013 dataset. In our experiments, we achieve DDI automatic extraction with a micro F-score of 75.48% on the DDIExtraction 2013 dataset, outperforming the state-of-the-art approach by 2.49%.

Download Full-text

Chemical-protein interaction extraction from biomedical literature: a hierarchical recurrent convolutional neural network method

International Journal of Data Mining and Bioinformatics ◽

10.1504/ijdmb.2019.099725 ◽

2019 ◽

Vol 22 (2) ◽

pp. 113 ◽

Cited By ~ 1

Author(s):

Cong Sun ◽

Zhihao Yang ◽

Lei Wang ◽

Yin Zhang ◽

Hongfei Lin ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Protein Interaction ◽

Biomedical Literature ◽

Neural Network Method ◽

Network Method ◽

Interaction Extraction

Download Full-text

Chemical-protein interaction extraction from biomedical literature: a hierarchical recurrent convolutional neural network method

International Journal of Data Mining and Bioinformatics ◽

10.1504/ijdmb.2019.10021458 ◽

2019 ◽

Vol 22 (2) ◽

pp. 113

Author(s):

Yijia Zhang ◽

Kan Xu ◽

Liang Yang ◽

Jian Wang ◽

Hongfei Lin ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Protein Interaction ◽

Biomedical Literature ◽

Neural Network Method ◽

Network Method ◽

Interaction Extraction

Download Full-text

Drug drug interaction extraction from biomedical literature using syntax convolutional neural network

Bioinformatics ◽

10.1093/bioinformatics/btw486 ◽

2016 ◽

pp. btw486 ◽

Cited By ~ 34

Author(s):

Zhehuan Zhao ◽

Zhihao Yang ◽

Ling Luo ◽

Hongfei Lin ◽

Jian Wang

Keyword(s):

Neural Network ◽

Drug Interaction ◽

Convolutional Neural Network ◽

Biomedical Literature ◽

Interaction Extraction ◽

Drug Drug Interaction

Download Full-text

Attention-Based Multi-Scale Convolutional Neural Network (A+MCNN) for Multi-Class Classification in Road Images

Sensors ◽

10.3390/s21155137 ◽

2021 ◽

Vol 21 (15) ◽

pp. 5137

Author(s):

Elham Eslami ◽

Hae-Bum Yun

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Contextual Information ◽

Automated Classification ◽

Automated Recognition ◽

Pavement Distress ◽

Multi Scale ◽

Pavement Distresses ◽

Multi Class Classification ◽

Transportation Applications

Automated pavement distress recognition is a key step in smart infrastructure assessment. Advances in deep learning and computer vision have improved the automated recognition of pavement distresses in road surface images. This task remains challenging due to the high variation of defects in shapes and sizes, demanding a better incorporation of contextual information into deep networks. In this paper, we show that an attention-based multi-scale convolutional neural network (A+MCNN) improves the automated classification of common distress and non-distress objects in pavement images by (i) encoding contextual information through multi-scale input tiles and (ii) employing a mid-fusion approach with an attention module for heterogeneous image contexts from different input scales. A+MCNN is trained and tested with four distress classes (crack, crack seal, patch, pothole), five non-distress classes (joint, marker, manhole cover, curbing, shoulder), and two pavement classes (asphalt, concrete). A+MCNN is compared with four deep classifiers that are widely used in transportation applications and a generic CNN classifier (as the control model). The results show that A+MCNN consistently outperforms the baselines by 1∼26% on average in terms of the F-score. A comprehensive discussion is also presented regarding how these classifiers perform differently on different road objects, which has been rarely addressed in the existing literature.

Download Full-text

Low‐dose CT denoising via convolutional neural network with an observer loss function

Medical Physics ◽

10.1002/mp.15161 ◽

2021 ◽

Author(s):

Minah Han ◽

Hyunjung Shim ◽

Jongduk Baek

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Loss Function ◽

Low Dose ◽

Low Dose Ct

Download Full-text

Forecasting the power consumption of a rotor spinning machine by using an adaptive squeeze and excitation convolutional neural network with imbalanced data

Journal of Cleaner Production ◽

10.1016/j.jclepro.2020.122864 ◽

2020 ◽

Vol 275 ◽

pp. 122864

Author(s):

Chuqiao Xu ◽

Junliang Wang ◽

Jie Zhang

Keyword(s):

Neural Network ◽

Power Consumption ◽

Convolutional Neural Network ◽

Imbalanced Data ◽

Rotor Spinning

Download Full-text

Speckle Noise Removal in Ultrasound Images Using a Deep Convolutional Neural Network and a Specially Designed Loss Function

Multiscale Multimodal Medical Imaging - Lecture Notes in Computer Science ◽

10.1007/978-3-030-37969-8_11 ◽

2019 ◽

pp. 85-92

Author(s):

Danlei Feng ◽

Weichen Wu ◽

Hongfeng Li ◽

Quanzheng Li

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Loss Function ◽

Speckle Noise ◽

Noise Removal ◽

Deep Convolutional Neural Network ◽

Ultrasound Images

Download Full-text

Attention Enhanced Serial Unet++ Network for Removing Unevenly Distributed Haze

Electronics ◽

10.3390/electronics10222868 ◽

2021 ◽

Vol 10 (22) ◽

pp. 2868

Author(s):

Wenxuan Zhao ◽

Yaqin Zhao ◽

Liqi Feng ◽

Jiaxi Tang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real World ◽

Large Scale ◽

Learning Strategy ◽

Contextual Information ◽

Small Scale ◽

Image Dehazing ◽

Atmospheric Scattering ◽

Real World Datasets

The purpose of image dehazing is the reduction of the image degradation caused by suspended particles for supporting high-level visual tasks. Besides the atmospheric scattering model, convolutional neural network (CNN) has been used for image dehazing. However, the existing image dehazing algorithms are limited in face of unevenly distributed haze and dense haze in real-world scenes. In this paper, we propose a novel end-to-end convolutional neural network called attention enhanced serial Unet++ dehazing network (AESUnet) for single image dehazing. We attempt to build a serial Unet++ structure that adopts a serial strategy of two pruned Unet++ blocks based on residual connection. Compared with the simple Encoder–Decoder structure, the serial Unet++ module can better use the features extracted by encoders and promote contextual information fusion in different resolutions. In addition, we take some improvement measures to the Unet++ module, such as pruning, introducing the convolutional module with ResNet structure, and a residual learning strategy. Thus, the serial Unet++ module can generate more realistic images with less color distortion. Furthermore, following the serial Unet++ blocks, an attention mechanism is introduced to pay different attention to haze regions with different concentrations by learning weights in the spatial domain and channel domain. Experiments are conducted on two representative datasets: the large-scale synthetic dataset RESIDE and the small-scale real-world datasets I-HAZY and O-HAZY. The experimental results show that the proposed dehazing network is not only comparable to state-of-the-art methods for the RESIDE synthetic datasets, but also surpasses them by a very large margin for the I-HAZY and O-HAZY real-world dataset.

Download Full-text

Textual Deblurring using Convolutional Neural Network

10.36227/techrxiv.16760632 ◽

2021 ◽

Author(s):

Muhammad Shahroz Nadeem ◽

Sibt Hussain ◽

Fatih Kurugollu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Loss Function ◽

State Of The Art ◽

Empirical Evaluation ◽

The State

This paper solves the textual deblurring problem, In this paper we propose a new loss function, we provide empirical evaluation of the design choices based on which a memory friendly CNN model is proposed, that performs better then the state of the art CNN method.

Download Full-text

Oversampling Based on Data Augmentation in Convolutional Neural Network for Silicon Wafer Defect Classification

Knowledge Innovation Through Intelligent Software Methodologies, Tools and Techniques - Frontiers in Artificial Intelligence and Applications ◽

10.3233/faia200547 ◽

2020 ◽

Author(s):

Uzma Batool ◽

Mohd Ibrahim Shapiai ◽

Nordinah Ismail ◽

Hilman Fauzi ◽

Syahrizal Salleh

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Silicon Wafer ◽

Data Augmentation ◽

Imbalanced Data ◽

Training Data ◽

Defect Classification ◽

Learning Method ◽

Test Set

Silicon wafer defect data collected from fabrication facilities is intrinsically imbalanced because of the variable frequencies of defect types. Frequently occurring types will have more influence on the classification predictions if a model gets trained on such skewed data. A fair classifier for such imbalanced data requires a mechanism to deal with type imbalance in order to avoid biased results. This study has proposed a convolutional neural network for wafer map defect classification, employing oversampling as an imbalance addressing technique. To have an equal participation of all classes in the classifier’s training, data augmentation has been employed, generating more samples in minor classes. The proposed deep learning method has been evaluated on a real wafer map defect dataset and its classification results on the test set returned a 97.91% accuracy. The results were compared with another deep learning based auto-encoder model demonstrating the proposed method, a potential approach for silicon wafer defect classification that needs to be investigated further for its robustness.

Download Full-text