Prediction of protein subcellular localization using deep learning and data augmentation

Knowledge of protein subcellular localization is vitally important for both basic research and drug development. With the avalanche of protein sequences emerging in the post-genomic age, it is highly desired to develop computational tools for timely and effectively identifying their subcellular localization based on the sequence information alone. Recently, a predictor called “pLoc-mPlant” was developed for identifying the subcellular localization of plant proteins. Its performance is overwhelmingly better than that of the other predictors for the same purpose, particularly in dealing with multi-label systems in which some proteins, called “multiplex proteins”, may simultaneously occur in two or more subcellular locations. Although it is indeed a very powerful predictor, more efforts are definitely needed to further improve it. This is because pLoc-mPlant was trained by an extremely skewed dataset in which some subsets (i.e., the protein numbers for some subcellular locations) were more than 10 times larger than the others. Accordingly, it cannot avoid the biased consequence caused by such an uneven training dataset. To overcome such biased consequence, we have developed a new and bias-free predictor called pLoc_bal-mPlant by balancing the training dataset. Cross-validation tests on exactly the same experimentconfirmed dataset have indicated that the proposed new predictor is remarkably superior to pLoc-mPlant, the existing state-of-the-art predictor in identifying the subcellular localization of plant proteins. To maximize the convenience for the majority of experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc_bal-mPlant/, by which users can easily get their desired results without the need to go through the detailed mathematics.

Download Full-text

Deep Learning-Based Classification of Protein Subcellular Localization from Immunohistochemistry Images

2017 4th IAPR Asian Conference on Pattern Recognition (ACPR) ◽

10.1109/acpr.2017.125 ◽

2017 ◽

Author(s):

Jin-Xian Hu ◽

Ying-Ying Xu ◽

Yang-Yang ◽

Hong-Bin Shen

Keyword(s):

Deep Learning ◽

Subcellular Localization ◽

Protein Subcellular Localization

Download Full-text

DeepLoc: prediction of protein subcellular localization using deep learning

Bioinformatics ◽

10.1093/bioinformatics/btx548 ◽

2017 ◽

Vol 33 (24) ◽

pp. 4049-4049 ◽

Cited By ~ 10

Author(s):

Jose Juan Almagro Armenteros ◽

Casper Kaae Sønderby ◽

Søren Kaae Sønderby ◽

Henrik Nielsen ◽

Ole Winther

Keyword(s):

Deep Learning ◽

Subcellular Localization ◽

Protein Subcellular Localization

Download Full-text

Prediction of human protein subcellular localization using deep learning

Journal of Parallel and Distributed Computing ◽

10.1016/j.jpdc.2017.08.009 ◽

2018 ◽

Vol 117 ◽

pp. 212-217 ◽

Cited By ~ 86

Author(s):

Leyi Wei ◽

Yijie Ding ◽

Ran Su ◽

Jijun Tang ◽

Quan Zou

Keyword(s):

Deep Learning ◽

Subcellular Localization ◽

Human Protein ◽

Protein Subcellular Localization

Download Full-text

Deep Learning for Cyber Security Applications: A Comprehensive Survey

10.36227/techrxiv.16748161 ◽

2021 ◽

Author(s):

vinayakumar R ◽

Mamoun Alazab ◽

Soman KP ◽

Sriram Srinivasan ◽

Sitalakshmi Venkatraman ◽

...

Keyword(s):

Deep Learning ◽

Language Processing ◽

Cyber Security ◽

Smart Cities ◽

Critical Discussion ◽

Future Research ◽

Next Generation ◽

Security Applications ◽

The Past ◽

Comprehensive Survey

Deep Learning (DL), a novel form of machine learning (ML) is gaining much research interest due to its successful application in many classical artificial intelligence (AI) tasks as compared to classical ML algorithms (CMLAs). Recently, DL architectures are being innovatively modelled for diverse applications in the area of cyber security. The literature is now growing with DL architectures and their variations for exploring different innovative DL models and prototypes that can be tailored to suit specific cyber security applications. However, there is a gap in literature for a comprehensive survey reporting on such research studies. Many of the survey-based research have a focus on specific DL architectures and certain types of malicious attacks within a limited cyber security problem scenario of the past and lack futuristic review. This paper aims at providing a well-rounded and thorough survey of the past, present, and future DL architectures including next-generation cyber security scenarios related to intelligent automation, Internet of Things (IoT), Big Data (BD), Blockchain, cloud and edge technologies. <br>This paper presents a tutorial-style comprehensive review of the state-of-the-art DL architectures for diverse applications in cyber security by comparing and analysing the contributions and challenges from various recent research papers. Firstly, the uniqueness of the survey is in reporting the use of DL architectures for an extensive set of cybercrime detection approaches such as intrusion detection, malware and botnet detection, spam and phishing detection, network traffic analysis, binary analysis, insider threat detection, CAPTCHA analysis, and steganography. Secondly, the survey covers key DL architectures in cyber security application domains such as cryptography, cloud security, biometric security, IoT and edge computing. Thirdly, the need for DL based research is discussed for the next generation cyber security applications in cyber physical systems (CPS) that leverage on BD analytics, natural language processing (NLP), signal and image processing and blockchain technology for smart cities and Industry 4.0 of the future. Finally, a critical discussion on open challenges and new proposed DL architecture contributes towards future research directions.

Download Full-text

Deep Learning for Cyber Security Applications: A Comprehensive Survey

10.36227/techrxiv.16748161.v1 ◽

2021 ◽

Author(s):

vinayakumar R ◽

Mamoun Alazab ◽

Soman KP ◽

Sriram Srinivasan ◽

Sitalakshmi Venkatraman ◽

...

Keyword(s):

Deep Learning ◽

Language Processing ◽

Cyber Security ◽

Smart Cities ◽

Critical Discussion ◽

Future Research ◽

Next Generation ◽

Security Applications ◽

The Past ◽

Comprehensive Survey

Deep Learning (DL), a novel form of machine learning (ML) is gaining much research interest due to its successful application in many classical artificial intelligence (AI) tasks as compared to classical ML algorithms (CMLAs). Recently, DL architectures are being innovatively modelled for diverse applications in the area of cyber security. The literature is now growing with DL architectures and their variations for exploring different innovative DL models and prototypes that can be tailored to suit specific cyber security applications. However, there is a gap in literature for a comprehensive survey reporting on such research studies. Many of the survey-based research have a focus on specific DL architectures and certain types of malicious attacks within a limited cyber security problem scenario of the past and lack futuristic review. This paper aims at providing a well-rounded and thorough survey of the past, present, and future DL architectures including next-generation cyber security scenarios related to intelligent automation, Internet of Things (IoT), Big Data (BD), Blockchain, cloud and edge technologies. <br>This paper presents a tutorial-style comprehensive review of the state-of-the-art DL architectures for diverse applications in cyber security by comparing and analysing the contributions and challenges from various recent research papers. Firstly, the uniqueness of the survey is in reporting the use of DL architectures for an extensive set of cybercrime detection approaches such as intrusion detection, malware and botnet detection, spam and phishing detection, network traffic analysis, binary analysis, insider threat detection, CAPTCHA analysis, and steganography. Secondly, the survey covers key DL architectures in cyber security application domains such as cryptography, cloud security, biometric security, IoT and edge computing. Thirdly, the need for DL based research is discussed for the next generation cyber security applications in cyber physical systems (CPS) that leverage on BD analytics, natural language processing (NLP), signal and image processing and blockchain technology for smart cities and Industry 4.0 of the future. Finally, a critical discussion on open challenges and new proposed DL architecture contributes towards future research directions.

Download Full-text

DeepLoc: prediction of protein subcellular localization using deep learning

Bioinformatics ◽

10.1093/bioinformatics/btx431 ◽

2017 ◽

Vol 33 (21) ◽

pp. 3387-3395 ◽

Cited By ~ 248

Author(s):

José Juan Almagro Armenteros ◽

Casper Kaae Sønderby ◽

Søren Kaae Sønderby ◽

Henrik Nielsen ◽

Ole Winther

Keyword(s):

Deep Learning ◽

Subcellular Localization ◽

Protein Subcellular Localization

Download Full-text

Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning

G3 Genes|Genome|Genetics ◽

10.1534/g3.116.033654 ◽

2017 ◽

Vol 7 (5) ◽

pp. 1385-1392 ◽

Cited By ~ 58

Author(s):

Tanel Pärnamaa ◽

Leopold Parts

Keyword(s):

Deep Learning ◽

Subcellular Localization ◽

High Throughput ◽

Protein Subcellular Localization ◽

Microscopy Images

Download Full-text

Deep Learning for Sentiment Analysis

Advances in Business Information Systems and Analytics - Natural Language Processing for Global and Local Business ◽

10.4018/978-1-7998-4240-8.ch005 ◽

2021 ◽

pp. 97-132

Author(s):

Vincent Karas ◽

Björn W. Schuller

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Language Processing ◽

Data Augmentation ◽

Future Research ◽

Network Architectures ◽

Business Decisions ◽

Adversarial Networks ◽

Current Trends ◽

Recent Developments

Sentiment analysis is an important area of natural language processing that can help inform business decisions by extracting sentiment information from documents. The purpose of this chapter is to introduce the reader to selected concepts and methods of deep learning and show how deep models can be used to increase performance in sentiment analysis. It discusses the latest advances in the field and covers topics including traditional sentiment analysis approaches, the fundamentals of sentence modelling, popular neural network architectures, autoencoders, attention modelling, transformers, data augmentation methods, the benefits of transfer learning, the potential of adversarial networks, and perspectives on explainable AI. The authors' intent is that through this chapter, the reader can gain an understanding of recent developments in this area as well as current trends and potentials for future research.

Download Full-text

A Review on Deep Learning Techniques for 3D Sensed Data Classification

Remote Sensing ◽

10.3390/rs11121499 ◽

2019 ◽

Vol 11 (12) ◽

pp. 1499 ◽

Cited By ~ 31

Author(s):

David Griffiths ◽

Jan Boehm

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Point Clouds ◽

Image Understanding ◽

Future Research ◽

National Scale ◽

The Past ◽

Current State ◽

Learning Techniques ◽

Learning Architectures

Over the past decade deep learning has driven progress in 2D image understanding. Despite these advancements, techniques for automatic 3D sensed data understanding, such as point clouds, is comparatively immature. However, with a range of important applications from indoor robotics navigation to national scale remote sensing there is a high demand for algorithms that can learn to automatically understand and classify 3D sensed data. In this paper we review the current state-of-the-art deep learning architectures for processing unstructured Euclidean data. We begin by addressing the background concepts and traditional methodologies. We review the current main approaches, including RGB-D, multi-view, volumetric and fully end-to-end architecture designs. Datasets for each category are documented and explained. Finally, we give a detailed discussion about the future of deep learning for 3D sensed data, using literature to justify the areas where future research would be most valuable.

Download Full-text