Deep learning regression model for antimicrobial peptide design

AbstractAntimicrobial peptides (AMPs) are naturally occurring or synthetic peptides that show promise for treating antibiotic-resistant pathogens. Machine learning techniques are increasingly used to identify naturally occurring AMPs, but there is a dearth of purely computational methods to design novel effective AMPs, which would speed AMP development. We collected a large database, Giant Repository of AMP Activities (GRAMPA), containing AMP sequences and associated MICs. We designed a convolutional neural network to perform combined classification and regression on peptide sequences to quantitatively predict AMP activity against Escherichia coli. Our predictions outperformed the state of the art at AMP classification and were also effective at regression, for which there were no publicly available comparisons. We then used our model to design novel AMPs and experimentally demonstrated activity of these AMPs against the pathogens E. coli, Pseudomonas aeruginosa, and Staphylococcus aureus. Data, code, and neural network architecture and parameters are available at https://github.com/zswitten/Antimicrobial-Peptides.

Download Full-text

Galaxy classification: deep learning on the OTELO and COSMOS databases

Astronomy and Astrophysics ◽

10.1051/0004-6361/202037697 ◽

2020 ◽

Vol 638 ◽

pp. A134

Author(s):

José A. de Diego ◽

Jakub Nadolny ◽

Ángel Bongiovanni ◽

Jordi Cepa ◽

Mirjana Pović ◽

...

Keyword(s):

Neural Network ◽

Network Architecture ◽

Deep Neural Network ◽

Shape Parameter ◽

Machine Learning Techniques ◽

Classification Methods ◽

Neural Network Architecture ◽

Linear Discriminant ◽

Learning Techniques ◽

Galaxy Classification

Context. The accurate classification of hundreds of thousands of galaxies observed in modern deep surveys is imperative if we want to understand the universe and its evolution. Aims. Here, we report the use of machine learning techniques to classify early- and late-type galaxies in the OTELO and COSMOS databases using optical and infrared photometry and available shape parameters: either the Sérsic index or the concentration index. Methods. We used three classification methods for the OTELO database: (1) u − r color separation, (2) linear discriminant analysis using u − r and a shape parameter classification, and (3) a deep neural network using the r magnitude, several colors, and a shape parameter. We analyzed the performance of each method by sample bootstrapping and tested the performance of our neural network architecture using COSMOS data. Results. The accuracy achieved by the deep neural network is greater than that of the other classification methods, and it can also operate with missing data. Our neural network architecture is able to classify both OTELO and COSMOS datasets regardless of small differences in the photometric bands used in each catalog. Conclusions. In this study we show that the use of deep neural networks is a robust method to mine the cataloged data.

Download Full-text

IPOMDP-Net: A Deep Neural Network for Partially Observable Multi-Agent Planning Using Interactive POMDPs

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016062 ◽

2019 ◽

Vol 33 ◽

pp. 6062-6069 ◽

Cited By ~ 1

Author(s):

Yanlin Han ◽

Piotr Gmytrasiewicz

Keyword(s):

Neural Network ◽

Network Architecture ◽

State Of The Art ◽

Neural Computing ◽

Neural Network Architecture ◽

Markov Decision ◽

Planning Algorithm ◽

Multi Agent ◽

Partially Observable ◽

Multi Agent Planning

This paper introduces the IPOMDP-net, a neural network architecture for multi-agent planning under partial observability. It embeds an interactive partially observable Markov decision process (I-POMDP) model and a QMDP planning algorithm that solves the model in a neural network architecture. The IPOMDP-net is fully differentiable and allows for end-to-end training. In the learning phase, we train an IPOMDP-net on various fixed and randomly generated environments in a reinforcement learning setting, assuming observable reinforcements and unknown (randomly initialized) model functions. In the planning phase, we test the trained network on new, unseen variants of the environments under the planning setting, using the trained model to plan without reinforcements. Empirical results show that our model-based IPOMDP-net outperforms the other state-of-the-art modelfree network and generalizes better to larger, unseen environments. Our approach provides a general neural computing architecture for multi-agent planning using I-POMDPs. It suggests that, in a multi-agent setting, having a model of other agents benefits our decision-making, resulting in a policy of higher quality and better generalizability.

Download Full-text

Towards Better Interpretability in Deep Q-Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33014561 ◽

2019 ◽

Vol 33 ◽

pp. 4561-4569 ◽

Cited By ~ 2

Author(s):

Raghuram Mandyam Annasamy ◽

Katia Sycara

Keyword(s):

Neural Network ◽

Network Architecture ◽

Empirical Studies ◽

Superior Performance ◽

Training Algorithms ◽

Neural Network Architecture ◽

Q Learning ◽

The Neural Network ◽

Learning Techniques ◽

Out Of Sample

Deep reinforcement learning techniques have demonstrated superior performance in a wide variety of environments. As improvements in training algorithms continue at a brisk pace, theoretical or empirical studies on understanding what these networks seem to learn, are far behind. In this paper we propose an interpretable neural network architecture for Q-learning which provides a global explanation of the model’s behavior using key-value memories, attention and reconstructible embeddings. With a directed exploration strategy, our model can reach training rewards comparable to the state-of-the-art deep Q-learning models. However, results suggest that the features extracted by the neural network are extremely shallow and subsequent testing using out-of-sample examples shows that the agent can easily overfit to trajectories seen during training.

Download Full-text

Robustness of Neural Network Emulations of Radiative Transfer Parameterizations in a State-of-the-Art General Circulation Model

10.5194/gmd-2021-114 ◽

2021 ◽

Author(s):

Alexei Belochitski ◽

Vladimir Krasnopolsky

Keyword(s):

Neural Network ◽

Radiative Transfer ◽

Network Architecture ◽

General Circulation ◽

State Of The Art ◽

Circulation Model ◽

Set Design ◽

Neural Network Architecture ◽

Parametric Change ◽

Model Components

Abstract. The ability of Machine-Learning (ML) based model components to generalize to the previously unseen inputs, and the resulting stability of the models that use these components, has been receiving a lot of recent attention, especially when it comes to ML-based parameterizations. At the same time, ML-based emulators of existing parameterizations can be stable, accurate, and fast when used in the model they were specifically designed for. In this work we show that shallow-neural-network-based emulators of radiative transfer parameterizations developed almost a decade ago for a state-of-the-art GCM are robust with respect to the substantial structural and parametric change in the host model: when used in two seven month-long experiments with the new model, they not only remain stable, but generate realistic output. Aspects of neural network architecture and training set design potentially contributing to stability of ML-based model components are discussed.

Download Full-text

Image Classification for Vehicle Type Dataset Using State-of-the-art Convolutional Neural Network Architecture

Proceedings of the 2018 Artificial Intelligence and Cloud Computing Conference on ZZZ - AICCC '18 ◽

10.1145/3299819.3299822 ◽

2018 ◽

Author(s):

Yian Seo ◽

Kyung-shik Shin

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Network Architecture ◽

State Of The Art ◽

Neural Network Architecture ◽

Vehicle Type

Download Full-text

Machine Learning Techniques in Concrete Mix Design

Materials ◽

10.3390/ma12081256 ◽

2019 ◽

Vol 12 (8) ◽

pp. 1256 ◽

Cited By ~ 16

Author(s):

Patryk Ziolkowski ◽

Maciej Niedostatkiewicz

Keyword(s):

Neural Network ◽

Machine Learning ◽

Artificial Neural Network ◽

Compressive Strength ◽

State Of The Art ◽

Mix Design ◽

Machine Learning Techniques ◽

Concrete Mix Design ◽

Learning Techniques ◽

Concrete Mix

Concrete mix design is a complex and multistage process in which we try to find the best composition of ingredients to create good performing concrete. In contemporary literature, as well as in state-of-the-art corporate practice, there are some methods of concrete mix design, from which the most popular are methods derived from The Three Equation Method. One of the most important features of concrete is compressive strength, which determines the concrete class. Predictable compressive strength of concrete is essential for concrete structure utilisation and is the main feature of its safety and durability. Recently, machine learning is gaining significant attention and future predictions for this technology are even more promising. Data mining on large sets of data attracts attention since machine learning algorithms have achieved a level in which they can recognise patterns which are difficult to recognise by human cognitive skills. In our paper, we would like to utilise state-of-the-art achievements in machine learning techniques for concrete mix design. In our research, we prepared an extensive database of concrete recipes with the according destructive laboratory tests, which we used to feed the selected optimal architecture of an artificial neural network. We have translated the architecture of the artificial neural network into a mathematical equation that can be used in practical applications.

Download Full-text

ThriftyNets: Convolutional Neural Networks with Tiny Parameter Budget

IoT ◽

10.3390/iot2020012 ◽

2021 ◽

Vol 2 (2) ◽

pp. 222-235

Author(s):

Guillaume Coiffier ◽

Ghouthi Boukli Hacene ◽

Vincent Gripon

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Convolutional Neural Network ◽

Spatial Resolution ◽

Network Architecture ◽

Deep Neural Networks ◽

State Of The Art ◽

Feature Maps ◽

Neural Network Architecture

Deep Neural Networks are state-of-the-art in a large number of challenges in machine learning. However, to reach the best performance they require a huge pool of parameters. Indeed, typical deep convolutional architectures present an increasing number of feature maps as we go deeper in the network, whereas spatial resolution of inputs is decreased through downsampling operations. This means that most of the parameters lay in the final layers, while a large portion of the computations are performed by a small fraction of the total parameters in the first layers. In an effort to use every parameter of a network at its maximum, we propose a new convolutional neural network architecture, called ThriftyNet. In ThriftyNet, only one convolutional layer is defined and used recursively, leading to a maximal parameter factorization. In complement, normalization, non-linearities, downsamplings and shortcut ensure sufficient expressivity of the model. ThriftyNet achieves competitive performance on a tiny parameters budget, exceeding 91% accuracy on CIFAR-10 with less than 40 k parameters in total, 74.3% on CIFAR-100 with less than 600 k parameters, and 67.1% On ImageNet ILSVRC 2012 with no more than 4.15 M parameters. However, the proposed method typically requires more computations than existing counterparts.

Download Full-text

Fast Pre-Diagnosis of Neoplastic Changes in Cytology Images Using Machine Learning

Applied Sciences ◽

10.3390/app11167181 ◽

2021 ◽

Vol 11 (16) ◽

pp. 7181

Author(s):

Jakub Caputa ◽

Daria Łukasik ◽

Maciej Wielgosz ◽

Michał Karwatowski ◽

Rafał Frączek ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Network Architecture ◽

State Of The Art ◽

Prototype Model ◽

Model Parameters ◽

Round Cell ◽

Neural Network Architecture ◽

Original Dataset ◽

Automated Tools

We present the experiment results to use the YOLOv3 neural network architecture to automatically detect tumor cells in cytological samples taken from the skin in canines. A rich dataset of 1219 smeared sample images with 28,149 objects was gathered and annotated by the vet doctor to perform the experiments. It covers three types of common round cell neoplasms: mastocytoma, histiocytoma, and lymphoma. The dataset has been thoroughly described in the paper and is publicly available. The YOLOv3 neural network architecture was trained using various schemes involving original dataset modification and the different model parameters. The experiments showed that the prototype model achieved 0.7416 mAP, which outperforms the state-of-the-art machine learning and human estimated results. We also provided a series of analyses that may facilitate ML-based solutions by casting more light on some aspects of its performance. We also presented the main discrepancies between ML-based and human-based diagnoses. This outline may help depict the scenarios and how the automated tools may support the diagnosis process.

Download Full-text

Identifying Malaria Infection in Red Blood Cells using Optimized Step-Increase Convolutional Neural Network Model

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.i1131.0789s19 ◽

2019 ◽

Vol 8 (9S) ◽

pp. 813-818 ◽

Cited By ~ 1

Keyword(s):

Neural Network ◽

Malaria Infection ◽

State Of The Art ◽

Machine Learning Techniques ◽

Vast Number ◽

Learning Techniques ◽

Proposed Model ◽

Blood Smears ◽

Network Approaches

A vast number of image processing and neural network approaches are currently being utilized in the analysis of various medical conditions. Malaria is a disease which can be diagnosed by examining blood smears. But when it is examined manually by the microscopist, the accuracy of diagnosis can be error-prone because it depends upon the quality of the smear and the expertise of microscopist in examining the smears. Among the various machine learning techniques, convolutional neural networks (CNN) promise relatively higher accuracy. We propose an Optimized Step-Increase CNN (OSICNN) model to classify red blood cell images taken from thin blood smear samples into infected and non-infected with the malaria parasite. The proposed OSICNN model consists of four convolutional layers and is showing comparable results when compared with other state of the art models. The accuracy of identifying parasite in RBC has been found to be 98.3% with the proposed model.

Download Full-text

Deep Neural Network Architecture Search for Wearable Heart Rate Estimations

Studies in Health Technology and Informatics - Public Health and Informatics ◽

10.3233/shti210366 ◽

2021 ◽

Author(s):

Daniel Ray ◽

Tim Collins ◽

Prasad Ponnapalli

Keyword(s):

Neural Network ◽

Heart Rate ◽

Deep Learning ◽

Network Architecture ◽

Deep Neural Network ◽

State Of The Art ◽

Data Driven ◽

Network Architectures ◽

Learning Approaches ◽

Neural Network Architecture

Extracting accurate heart rate estimations from wrist-worn photoplethysmography (PPG) devices is challenging due to the signal containing artifacts from several sources. Deep Learning approaches have shown very promising results outperforming classical methods with improvements of 21% and 31% on two state-of-the-art datasets. This paper provides an analysis of several data-driven methods for creating deep neural network architectures with hopes of further improvements.

Download Full-text