NetTCR: sequence-based prediction of TCR binding to peptide-MHC complexes using convolutional neural networks

Predicting epitopes recognized by cytotoxic T cells has been a long standing challenge within the field of immuno- and bioinformatics. While reliable predictions of peptide binding are available for most Major Histocompatibility Complex class I (MHCI) alleles, prediction models of T cell receptor (TCR) interactions with MHC class I-peptide complexes remain poor due to the limited amount of available training data. Recent next generation sequencing projects have however generated a considerable amount of data relating TCR sequences with their cognate HLA-peptide complex target. Here, we utilize such data to train a sequence-based predictor of the interaction between TCRs and peptides presented by the most common human MHCI allele, HLA-A*02:01. Our model is based on convolutional neural networks, which are especially designed to meet the challenges posed by the large length variations of TCRs. We show that such a sequence-based model allows for the identification of TCRs binding a given cognate peptide-MHC target out of a large pool of non-binding TCRs.

Download Full-text

T-cell-receptor-like antibodies – generation, function and applications

Expert Reviews in Molecular Medicine ◽

10.1017/erm.2012.2 ◽

2012 ◽

Vol 14 ◽

Cited By ~ 45

Author(s):

Rony Dahan ◽

Yoram Reiter

Keyword(s):

T Cell ◽

Mhc Class I ◽

T Cell Receptor ◽

Cytotoxic T Cells ◽

Membrane Surface ◽

Cell Receptor ◽

Class I ◽

New Class ◽

Infected Cells ◽

Peptide Complexes

Tumour and virus-infected cells are recognised by CD8+ cytotoxic T cells that, in response, are activated to eliminate these cells. In order to be activated, the clonotypic T-cell receptor (TCR) needs to encounter a specific peptide antigen presented by the membrane surface major histocompatibility complex (MHC) molecule. Cells that have undergone malignant transformation or viral infection present peptides derived from tumour-associated antigens or viral proteins on their MHC class I molecules. Therefore, disease-specific MHC–peptide complexes are desirable targets for immunotherapeutic approaches. One such approach transforms the unique fine specificity but low intrinsic affinity of TCRs to MHC–peptide complexes into high-affinity soluble antibody molecules endowed with a TCR-like specificity towards tumour or viral epitopes. These antibodies, termed TCR-like antibodies, are being developed as a new class of immunotherapeutics that can target tumour and virus-infected cells and mediate their specific killing. In addition to their therapeutic capabilities, TCR-like antibodies are being developed as diagnostic reagents for cancer and infectious diseases, and serve as valuable research tools for studying MHC class I antigen presentation.

Download Full-text

Prediction Models for Truck Accidents at Freeway Ramps in Washington State Using Regression and Artificial Intelligence Techniques

Transportation Research Record Journal of the Transportation Research Board ◽

10.3141/1635-04 ◽

1998 ◽

Vol 1635 (1) ◽

pp. 30-36 ◽

Cited By ~ 9

Author(s):

Wael H. Awad ◽

Bruce N. Janson

Keyword(s):

Artificial Intelligence ◽

Neural Networks ◽

Linear Regression ◽

Prediction Models ◽

Washington State ◽

Training Data ◽

Coefficient Of Determination ◽

Training Process ◽

Truck Accidents ◽

High Level

Three different modeling approaches were applied to explain truck accidents at interchanges in Washington State during a 27-month period. Three models were developed for each ramp type including linear regression, neural networks, and a hybrid system using fuzzy logic and neural networks. The study showed that linear regression was able to predict accident frequencies that fell within one standard deviation from the overall mean of the dependent variable. However, the coefficient of determination was very low in all cases. The other two artificial intelligence (AI) approaches showed a high level of performance in identifying different patterns of accidents in the training data and presented a better fit when compared to the regression model. However, the ability of these AI models to predict test data that were not included in the training process showed unsatisfactory results.

Download Full-text

Text Localization in Scientific Figures using Fully Convolutional Neural Networks on Limited Training Data

Proceedings of the ACM Symposium on Document Engineering 2019 - DocEng '19 ◽

10.1145/3342558.3345396 ◽

2019 ◽

Cited By ~ 1

Author(s):

Morten Jessen ◽

Falk Böschen ◽

Ansgar Scherp

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Training Data ◽

Text Localization ◽

Fully Convolutional Neural Networks

Download Full-text

Convolutional Neural Networks Using Enhanced Radiographs for Real-Time Detection of Sitophilus zeamais in Maize Grain

Foods ◽

10.3390/foods10040879 ◽

2021 ◽

Vol 10 (4) ◽

pp. 879

Author(s):

Clíssia Barboza da Silva ◽

Alysson Alexander Naves Silva ◽

Geovanny Barroso ◽

Pedro Takao Yamamoto ◽

Valter Arthur ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Prediction Models ◽

Sitophilus Zeamais ◽

Human Consumption ◽

Storage Pests ◽

Maize Weevil ◽

X Ray ◽

Precise Control ◽

X Ray Imaging

The application of artificial intelligence (AI) such as deep learning in the quality control of grains has the potential to assist analysts in decision making and improving procedures. Advanced technologies based on X-ray imaging provide markedly easier ways to control insect infestation of stored products, regardless of whether the quality features are visible on the surface of the grains. Here, we applied contrast enhancement algorithms based on peripheral equalization and calcification emphasis on X-ray images to improve the detection of Sitophilus zeamais in maize grains. In addition, we proposed an approach based on convolutional neural networks (CNNs) to identity non-infested and infested classes using three different architectures; (i) Inception-ResNet-v2, (ii) Xception and (iii) MobileNetV2. In general, the prediction models developed based on the MobileNetV2 and Xception architectures achieved higher accuracy (≥0.88) in identifying non-infested grains and grains infested by maize weevil, with a correct classification from 0.78 to 1.00 for validation and test sets. Hence, the proposed approach using enhanced radiographs has the potential to provide precise control of Sitophilus zeamais for safe human consumption of maize grains. The proposed method can automatically recognize food contaminated with hidden storage pests without manual features, which makes it more reliable for grain inspection.

Download Full-text

Transfer Learning with Deep Recurrent Neural Networks for Remaining Useful Life Estimation

Applied Sciences ◽

10.3390/app8122416 ◽

2018 ◽

Vol 8 (12) ◽

pp. 2416 ◽

Cited By ~ 34

Author(s):

Ansi Zhang ◽

Honglei Wang ◽

Shaobo Li ◽

Yuxin Cui ◽

Zhonghao Liu ◽

...

Keyword(s):

Neural Networks ◽

Transfer Learning ◽

Recurrent Neural Networks ◽

Prediction Models ◽

Learning Algorithm ◽

Remaining Useful Life ◽

Operating Conditions ◽

Training Data ◽

Good Prediction ◽

Useful Life

Prognostics, such as remaining useful life (RUL) prediction, is a crucial task in condition-based maintenance. A major challenge in data-driven prognostics is the difficulty of obtaining a sufficient number of samples of failure progression. However, for traditional machine learning methods and deep neural networks, enough training data is a prerequisite to train good prediction models. In this work, we proposed a transfer learning algorithm based on Bi-directional Long Short-Term Memory (BLSTM) recurrent neural networks for RUL estimation, in which the models can be first trained on different but related datasets and then fine-tuned by the target dataset. Extensive experimental results show that transfer learning can in general improve the prediction models on the dataset with a small number of samples. There is one exception that when transferring from multi-type operating conditions to single operating conditions, transfer learning led to a worse result.

Download Full-text

Fish Detection Using Convolutional Neural Networks with Limited Training Data

Lecture Notes in Computer Science - Pattern Recognition ◽

10.1007/978-3-030-41404-7_52 ◽

2020 ◽

pp. 735-748

Author(s):

Shih-Lun Tseng ◽

Huei-Yung Lin

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Training Data ◽

Fish Detection

Download Full-text

SATELLITE-DERIVED BATHYMETRY USING CONVOLUTIONAL NEURAL NETWORKS AND MULTISPECTRAL SENTINEL-2 IMAGES

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b3-2021-201-2021 ◽

2021 ◽

Vol XLIII-B3-2021 ◽

pp. 201-207

Author(s):

Y. A. Lumban-Gaol ◽

K. A. Ohori ◽

R. Y. Peters

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Window Size ◽

Short Wave ◽

Training Data ◽

Coefficient Of Determination ◽

Linear Transform ◽

Band Combinations ◽

Sentinel 2

Abstract. Satellite-Derived Bathymetry (SDB) has been used in many applications related to coastal management. SDB can efficiently fill data gaps obtained from traditional measurements with echo sounding. However, it still requires numerous training data, which is not available in many areas. Furthermore, the accuracy problem still arises considering the linear model could not address the non-relationship between reflectance and depth due to bottom variations and noise. Convolutional Neural Networks (CNN) offers the ability to capture the connection between neighbouring pixels and the non-linear relationship. These CNN characteristics make it compelling to be used for shallow water depth extraction. We investigate the accuracy of different architectures using different window sizes and band combinations. We use Sentinel-2 Level 2A images to provide reflectance values, and Lidar and Multi Beam Echo Sounder (MBES) datasets are used as depth references to train and test the model. A set of Sentinel-2 and in-situ depth subimage pairs are extracted to perform CNN training. The model is compared to the linear transform and applied to two other study areas. Resulting accuracy ranges from 1.3 m to 1.94 m, and the coefficient of determination reaches 0.94. The SDB model generated using a window size of 9x9 indicates compatibility with the reference depths, especially at areas deeper than 15 m. The addition of both short wave infrared bands to the four visible bands in training improves the overall accuracy of SDB. The implementation of the pre-trained model to other study areas provides similar results depending on the water conditions.

Download Full-text

Uncertainty quantification in fault detection using convolutional neural networks

Geophysics ◽

10.1190/geo2020-0424.1 ◽

2021 ◽

pp. 1-45

Author(s):

Runhai Feng ◽

Dario Grana ◽

Niels Balling

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Reservoir Characterization ◽

Epistemic Uncertainty ◽

Confidence Regions ◽

Training Data ◽

Model Parameters ◽

Sigmoid Function ◽

Learning Tools

Segmentation of faults based on seismic images is an important step in reservoir characterization. With the recent developments of deep-learning methods and the availability of massive computing power, automatic interpretation of seismic faults has become possible. The likelihood of occurrence for a fault can be quantified using a sigmoid function. Our goal is to quantify the fault model uncertainty that is generally not captured by deep-learning tools. We propose to use the dropout approach, a regularization technique to prevent overfitting and co-adaptation in hidden units, to approximate the Bayesian inference and estimate the principled uncertainty over functions. Particularly, the variance of the learned model has been decomposed into aleatoric and epistemic parts. The proposed method is applied to a real dataset from the Netherlands F3 block with two different dropout ratios in convolutional neural networks. The aleatoric uncertainty is irreducible since it relates to the stochastic dependency within the input observations. As the number of Monte-Carlo realizations increases, the epistemic uncertainty asymptotically converges and the model standard deviation decreases, because the variability of model parameters is better simulated or explained with a larger sample size. This analysis can quantify the confidence to use fault predictions with less uncertainty. Additionally, the analysis suggests where more training data are needed to reduce the uncertainty in low confidence regions.

Download Full-text

Land Cover Classification from fused DSM and UAV Images Using Convolutional Neural Networks

Remote Sensing ◽

10.3390/rs11121461 ◽

2019 ◽

Vol 11 (12) ◽

pp. 1461 ◽

Cited By ~ 18

Author(s):

Husam A. H. Al-Najjar ◽

Bahareh Kalantar ◽

Biswajeet Pradhan ◽

Vahideh Saeidi ◽

Alfian Abdul Halin ◽

...

Keyword(s):

Neural Networks ◽

Land Cover ◽

Convolutional Neural Networks ◽

Land Cover Classification ◽

Training Data ◽

Surface Model ◽

Kappa Index ◽

Diverse Range ◽

Average Accuracy ◽

Uav Images

In recent years, remote sensing researchers have investigated the use of different modalities (or combinations of modalities) for classification tasks. Such modalities can be extracted via a diverse range of sensors and images. Currently, there are no (or only a few) studies that have been done to increase the land cover classification accuracy via unmanned aerial vehicle (UAV)–digital surface model (DSM) fused datasets. Therefore, this study looks at improving the accuracy of these datasets by exploiting convolutional neural networks (CNNs). In this work, we focus on the fusion of DSM and UAV images for land use/land cover mapping via classification into seven classes: bare land, buildings, dense vegetation/trees, grassland, paved roads, shadows, and water bodies. Specifically, we investigated the effectiveness of the two datasets with the aim of inspecting whether the fused DSM yields remarkable outcomes for land cover classification. The datasets were: (i) only orthomosaic image data (Red, Green and Blue channel data), and (ii) a fusion of the orthomosaic image and DSM data, where the final classification was performed using a CNN. CNN, as a classification method, is promising due to hierarchical learning structure, regulating and weight sharing with respect to training data, generalization, optimization and parameters reduction, automatic feature extraction and robust discrimination ability with high performance. The experimental results show that a CNN trained on the fused dataset obtains better results with Kappa index of ~0.98, an average accuracy of 0.97 and final overall accuracy of 0.98. Comparing accuracies between the CNN with DSM result and the CNN without DSM result for the overall accuracy, average accuracy and Kappa index revealed an improvement of 1.2%, 1.8% and 1.5%, respectively. Accordingly, adding the heights of features such as buildings and trees improved the differentiation between vegetation specifically where plants were dense.

Download Full-text

Beach State Recognition Using Argus Imagery and Convolutional Neural Networks

Remote Sensing ◽

10.3390/rs12233953 ◽

2020 ◽

Vol 12 (23) ◽

pp. 3953

Author(s):

Ashley N. Ellenson ◽

Joshua A. Simmons ◽

Greg W. Wilson ◽

Tyler J. Hesser ◽

Kristen D. Splinter

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

New South ◽

Equilibrium States ◽

The Self ◽

Training Data ◽

Local Data ◽

State Recognition ◽

South Wales ◽

Key Driver

Nearshore morphology is a key driver in wave breaking and the resulting nearshore circulation, recreational safety, and nutrient dispersion. Morphology persists within the nearshore in specific shapes that can be classified into equilibrium states. Equilibrium states convey qualitative information about bathymetry and relevant physical processes. While nearshore bathymetry is a challenge to collect, much information about the underlying bathymetry can be gained from remote sensing of the surfzone. This study presents a new method to automatically classify beach state from Argus daytimexposure imagery using a machine learning technique called convolutional neural networks (CNNs). The CNN processed imagery from two locations: Narrabeen, New South Wales, Australia and Duck, North Carolina, USA. Three different CNN models are examined, one trained at Narrabeen, one at Duck, and one trained at both locations. Each model was tested at the location where it was trained in a self-test, and the single-beach models were tested at the location where it was not trained in a transfer-test. For the self-tests, skill (as measured by the F-score) was comparable to expert agreement (CNN F-values at Duck = 0.80 and Narrabeen = 0.59). For the transfer-tests, the CNN model skill was reduced by 24–48%, suggesting the algorithm requires additional local data to improve transferability performance. Transferability tests showed that comparable F-scores (within 10%) to the self-trained cases can be achieved at both locations when at least 25% of the training data is from each site. This suggests that if applied to additional locations, a CNN model trained at one location may be skillful at new sites with limited new imagery data needed. Finally, a CNN visualization technique (Guided-Grad-CAM) confirmed that the CNN determined classifications using image regions (e.g., incised rip channels, terraces) that were consistent with beach state labelling rules.

Download Full-text