VUDENC: Vulnerability Detection with Deep Learning on a Natural Codebase for Python

The object of research of this work is the methods of deep learning for source code vulnerability detection. One of the most problematic areas is the use of only one approach in the code analysis process: the approach based on the AST (abstract syntax tree) or the approach based on the program dependence graph (PDG). In this paper, a comparative analysis of two approaches for source code vulnerability detection was conducted: approaches based on AST and approaches based on the PDG. In this paper, various topologies of neural networks were analyzed. They are used in approaches based on the AST and PDG. As the result of the comparison, the advantages and disadvantages of each approach were determined, and the results were summarized in the corresponding comparison tables. As a result of the analysis, it was determined that the use of BLSTM (Bidirectional Long Short Term Memory) and BGRU (Bidirectional Gated Linear Unit) gives the best result in terms of problems of source code vulnerability detection. As the analysis showed, the most effective approach for source code vulnerability detection systems is a method that uses an intermediate representation of the code, which allows getting a language-independent tool. Also, in this work, our own algorithm for the source code analysis system is proposed, which is able to perform the following operations: predict the source code vulnerability, classify the source code vulnerability, and generate a corresponding patch for the found vulnerability. A detailed analysis of the proposed system’s unresolved issues is provided, which is planned to investigate in future researches. The proposed system could help speed up the software development process as well as reduce the number of software code vulnerabilities. Software developers, as well as specialists in the field of cybersecurity, can be stakeholders of the proposed system.

Download Full-text

PreNNsem: A Heterogeneous Ensemble Learning Framework for Vulnerability Detection in Software

Applied Sciences ◽

10.3390/app10227954 ◽

2020 ◽

Vol 10 (22) ◽

pp. 7954

Author(s):

Lu Wang ◽

Xin Li ◽

Ruiheng Wang ◽

Yang Xin ◽

Mingcheng Gao ◽

...

Keyword(s):

Deep Learning ◽

Ensemble Learning ◽

Detection Efficiency ◽

False Positive Rate ◽

False Negative ◽

False Negative Rate ◽

Training Methods ◽

Vulnerability Detection ◽

Sample Data ◽

Heterogeneous Ensemble

Automated vulnerability detection is one of the critical issues in the realm of software security. Existing solutions to this problem are mostly based on features that are defined by human experts and directly lead to missed potential vulnerability. Deep learning is an effective method for automating the extraction of vulnerability characteristics. Our paper proposes intelligent and automated vulnerability detection while using deep representation learning and heterogeneous ensemble learning. Firstly, we transform sample data from source code by removing segments that are unrelated to the vulnerability in order to reduce code analysis and improve detection efficiency in our experiments. Secondly, we represent the sample data as real vectors by pre-training on the corpus and maintaining its semantic information. Thirdly, the vectors are fed to a deep learning model to obtain the features of vulnerability. Lastly, we train a heterogeneous ensemble classifier. We analyze the effectiveness and resource consumption of different network models, pre-training methods, classifiers, and vulnerabilities separately in order to evaluate the detection method. We also compare our approach with some well-known vulnerability detection commercial tools and academic methods. The experimental results show that our proposed method provides improvements in false positive rate, false negative rate, precision, recall, and F1 score.

Download Full-text

The Efficiency of Vulnerability Detection Based on Deep Learning

Advances in Intelligent Systems and Computing - Advancements in Mechatronics and Intelligent Robotics ◽

10.1007/978-981-16-1843-7_52 ◽

2021 ◽

pp. 449-455

Author(s):

Xue Yuan ◽

Peng Zeng ◽

Yonghang Tai ◽

Feiyan Cheng

Keyword(s):

Deep Learning ◽

Vulnerability Detection

Download Full-text

Vulnerability Detection and Analysis in Adversarial Deep Learning

Computer Communications and Networks - Guide to Vulnerability Analysis for Computer Networks and Systems ◽

10.1007/978-3-319-92624-7_9 ◽

2018 ◽

pp. 211-234 ◽

Cited By ~ 2

Author(s):

Yi Shi ◽

Yalin E. Sagduyu ◽

Kemal Davaslioglu ◽

Renato Levy

Keyword(s):

Deep Learning ◽

Vulnerability Detection

Download Full-text

DeeSCVHunter: A Deep Learning-Based Framework for Smart Contract Vulnerability Detection

10.1109/ijcnn52387.2021.9534324 ◽

2021 ◽

Author(s):

Xingxin Yu ◽

Haoyue Zhao ◽

Botao Hou ◽

Zonghao Ying ◽

Bin Wu

Keyword(s):

Deep Learning ◽

Vulnerability Detection ◽

Smart Contract

Download Full-text

μVulDeePecker: A Deep Learning-Based System for Multiclass Vulnerability Detection

IEEE Transactions on Dependable and Secure Computing ◽

10.1109/tdsc.2019.2942930 ◽

2019 ◽

pp. 1-1 ◽

Cited By ~ 4

Author(s):

Deqing Zou ◽

Sujuan Wang ◽

Shouhuai Xu ◽

Zhen Li ◽

Hai Jin

Keyword(s):

Deep Learning ◽

Vulnerability Detection

Download Full-text

DeeDP: vulnerability detection and patching based on deep learning

Theoretical and Applied Cybersecurity ◽

10.20535/tacs.2664-29132020.1.209465 ◽

2020 ◽

Vol 2 (1) ◽

Author(s):

A. Savchenko ◽

O. Fokin ◽

A. Chernousov ◽

O. Sinelnikova ◽

S. Osadchyi

Keyword(s):

Deep Learning ◽

Vulnerability Detection

Download Full-text

Detecting Vulnerability in Source Code Using CNN and LSTM Network

10.21203/rs.3.rs-174281/v1 ◽

2021 ◽

Author(s):

Junjun Guo ◽

Zhengyuan Wang ◽

Haonan Li ◽

Yang Xue

Keyword(s):

Deep Learning ◽

Rapid Development ◽

False Positive Rate ◽

Vulnerability Detection ◽

Detection Model ◽

Subjective Awareness ◽

Lower False Positive Rate ◽

Positive Rate ◽

Code Metrics ◽

Lstm Network

Abstract Vulnerabilities can have very serious consequences for information security, with huge implications for economic, social, and even national security. Automated vulnerability detection has always been a keen topic for researchers. From traditional manual vulnerability mining to static detection and dynamic detection, all rely on human experts to define features. The rapid development of machine learning and deep learning has alleviated the tedious task of manually defining features by human experts while reducing the lack of objectivity caused by human subjective awareness. However, we still need to find an objective characterization method to define the features of vulnerabilities. Therefore, we use code metrics for code characterization, which are sequences of metrics that represent code. To use code metrics for vulnerability detection, we propose VulnExplore, a deep learning-based vulnerability detection model that uses a composite neural network of CNN + LSTM for feature extraction and learning of code metrics. Experimental results show that VulnExplore has a lower false positive rate, a lower miss rate, and a better accuracy rate compared to other deep learning-based vulnerability detection models.

Download Full-text