A Novel Machine Learning-Based Analysis Model for Smart Contract Vulnerability

In recent years, a lot of vulnerabilities of smart contracts have been found. Hackers used these vulnerabilities to attack the corresponding contracts developed in the blockchain system such as Ethereum, and it has caused lots of economic losses. Therefore, it is very important to find out the potential problems of the smart contracts and develop more secure smart contracts. As blockchain security events have raised more important issues, more and more smart contract security analysis methods have been developed. Most of these methods are based on traditional static analysis or dynamic analysis methods. There are only a few methods that use emerging technologies, such as machine learning. Some models that use machine learning to detect smart contract vulnerabilities cost much time in extracting features manually. In this paper, we introduce a novel machine learning-based analysis model by introducing the shared child nodes for smart contract vulnerabilities. We build the Abstract-Syntax-Tree (AST) for smart contracts with some vulnerabilities from two data sets including SmartBugs and SolidiFI-benchmark. Then, we build the Abstract-Syntax-Tree (AST) of the labeled smart contract for data sets named Smartbugs-wilds. Next, we get the shared child nodes from both of the ASTs to obtain the structural similarity, and then, we construct a feature vector composed of the values that measure structural similarity automatically to build our machine learning model. Finally, we get a KNN model that can predict eight types of vulnerabilities including Re-entrancy, Arithmetic, Access Control, Denial of Service, Unchecked Low Level Calls, Bad Randomness, Front Running, and Denial of Service. The accuracy, recall, and precision of our KNN model are all higher than 90%. In addition, compared with some other analysis tools including Oyente and SmartCheck, our model has higher accuracy. In addition, we spent less time for training .

Download Full-text

Abstract Syntax Tree (AST) and Control Flow Graph (CFG) Construction of Notasi Algoritmik

10.1109/icodse53690.2021.9648437 ◽

2021 ◽

Author(s):

Irfan Sofyana Putra ◽

Satrio Adi Rukmono ◽

Riza Satria Perdana

Keyword(s):

Control Flow ◽

Control Flow Graph ◽

Abstract Syntax ◽

Flow Graph ◽

Abstract Syntax Tree ◽

Syntax Tree ◽

And Control

Download Full-text

Multi-Agent based Sequence Algorithm for Detecting Plagiarism and Clones in Java Source Code using Abstract Syntax Tree

International Journal of Computer Applications ◽

10.5120/15796-4494 ◽

2014 ◽

Vol 90 (15) ◽

pp. 19-24 ◽

Cited By ~ 1

Author(s):

D. Poongodi ◽

G.Tholkkappia Arasu

Keyword(s):

Source Code ◽

Abstract Syntax ◽

Agent Based ◽

Abstract Syntax Tree ◽

Syntax Tree ◽

Multi Agent

Download Full-text

Multi-purpose Syntax Definition with SDF3

Software Engineering and Formal Methods - Lecture Notes in Computer Science ◽

10.1007/978-3-030-58768-0_1 ◽

2020 ◽

pp. 1-23 ◽

Cited By ~ 1

Author(s):

Luís Eduardo de Souza Amorim ◽

Eelco Visser

Keyword(s):

Error Recovery ◽

Abstract Syntax ◽

Abstract Syntax Tree ◽

Syntax Tree ◽

Full Class ◽

High Level ◽

Modular Composition ◽

Tree Representations ◽

Context Free ◽

Context Free Grammars

Abstract SDF3 is a syntax definition formalism that extends plain context-free grammars with features such as constructor declarations, declarative disambiguation rules, character-level grammars, permissive syntax, layout constraints, formatting templates, placeholder syntax, and modular composition. These features support the multi-purpose interpretation of syntax definitions, including derivation of type schemas for abstract syntax tree representations, scannerless generalized parsing of the full class of context-free grammars, error recovery, layout-sensitive parsing, parenthesization and formatting, and syntactic completion. This paper gives a high level overview of SDF3 by means of examples and provides a guide to the literature for further details.

Download Full-text

Visualizing Project Evolution through Abstract Syntax Tree Analysis

2016 IEEE Working Conference on Software Visualization (VISSOFT) ◽

10.1109/vissoft.2016.6 ◽

2016 ◽

Cited By ~ 7

Author(s):

Michael D. Feist ◽

Eddie Antonio Santos ◽

Ian Watts ◽

Abram Hindle

Keyword(s):

Abstract Syntax ◽

Tree Analysis ◽

Abstract Syntax Tree ◽

Syntax Tree

Download Full-text

Research on technology of transforming Abstract Syntax Tree of JAVA Language to Implementation Layer of Procedure Blueprint

The 2nd International Conference on Information Science and Engineering ◽

10.1109/icise.2010.5689146 ◽

2010 ◽

Author(s):

Pei-Hong Tu ◽

Jian-Bin Liu

Keyword(s):

Abstract Syntax ◽

Abstract Syntax Tree ◽

Syntax Tree ◽

Java Language

Download Full-text

WASTK: A Weighted Abstract Syntax Tree Kernel Method for Source Code Plagiarism Detection

Scientific Programming ◽

10.1155/2017/7809047 ◽

2017 ◽

Vol 2017 ◽

pp. 1-8 ◽

Cited By ~ 12

Author(s):

Deqiang Fu ◽

Yanyan Xu ◽

Haoran Yu ◽

Boyang Yang

Keyword(s):

Kernel Method ◽

Source Code ◽

Detection Methods ◽

Abstract Syntax ◽

Plagiarism Detection ◽

Abstract Syntax Tree ◽

Syntax Tree ◽

Tree Kernel ◽

Document Frequency ◽

Abstract Syntax Trees

In this paper, we introduce a source code plagiarism detection method, named WASTK (Weighted Abstract Syntax Tree Kernel), for computer science education. Different from other plagiarism detection methods, WASTK takes some aspects other than the similarity between programs into account. WASTK firstly transfers the source code of a program to an abstract syntax tree and then gets the similarity by calculating the tree kernel of two abstract syntax trees. To avoid misjudgment caused by trivial code snippets or frameworks given by instructors, an idea similar to TF-IDF (Term Frequency-Inverse Document Frequency) in the field of information retrieval is applied. Each node in an abstract syntax tree is assigned a weight by TF-IDF. WASTK is evaluated on different datasets and, as a result, performs much better than other popular methods like Sim and JPlag.

Download Full-text

Specifying and Detecting Behavioral Changes in Source Code Using Abstract Syntax Tree Differencing

Trustworthy Computing and Services - Communications in Computer and Information Science ◽

10.1007/978-3-642-35795-4_59 ◽

2013 ◽

pp. 466-473

Author(s):

Yuankui Li ◽

Linzhang Wang

Keyword(s):

Source Code ◽

Behavioral Changes ◽

Abstract Syntax ◽

Abstract Syntax Tree ◽

Syntax Tree

Download Full-text

Abstract Syntax Tree Based Source Code Antiplagiarism System for Large Projects Set

IEEE Access ◽

10.1109/access.2020.3026422 ◽

2020 ◽

Vol 8 ◽

pp. 175347-175359

Author(s):

Michal Duracik ◽

Patrik Hrkut ◽

Emil Krsak ◽

Stefan Toth

Keyword(s):

Source Code ◽

Abstract Syntax ◽

Abstract Syntax Tree ◽

Syntax Tree

Download Full-text

Inferring Bug Patterns for Detecting Bugs in JavaScript By Analyzing Abstract Syntax Tree

2018 Joint 7th International Conference on Informatics, Electronics & Vision (ICIEV) and 2018 2nd International Conference on Imaging, Vision & Pattern Recognition (icIVPR) ◽

10.1109/iciev.2018.8640995 ◽

2018 ◽

Author(s):

Afsana Tasnim ◽

Md. Rayhanur Rahman

Keyword(s):

Abstract Syntax ◽

Abstract Syntax Tree ◽

Syntax Tree

Download Full-text

Automatic Grading for Complex Multifile Programs

Complexity ◽

10.1155/2020/3279053 ◽

2020 ◽

Vol 2020 ◽

pp. 1-15

Author(s):

Tiantian Wang ◽

Djoko Budi Santoso ◽

Kechao Wang ◽

Xiaohong Su

Keyword(s):

Programming Problem ◽

Program Analysis ◽

Abstract Syntax ◽

Fusion Algorithm ◽

Abstract Syntax Tree ◽

Syntax Tree ◽

External Sources ◽

Standardization Process ◽

Automatic Grading ◽

And Function

This paper presents an automatic grading method DGRADER, which handles complex multifile programs. Both the dynamic and the static grading support multifile program analysis. So, it can be an advantage to handle complex programming problem which requires more than one program file. Dynamic analysis takes advantage of object file linker in compilation to link complex multifile program. The static grading module consists of the following steps. Firstly, the program is parsed into abstract syntax tree, which is mapped into abstract syntax tree data map. Then, the information of preprocessor is used for linking external sources called in main program by complex multifile program linker-fusion algorithm. Next, standardization process is performed for problematic code removal, unused function removal, and function sequence ordering based on function call. Finally, program matching successfully tackles structure variance problem by previous standardization process and by simple tree matching using tag classifier. The novelty of the approach is that it handles complex multifile program analysis with flexible grading with consideration of modularity and big scale of programming problem complexity. The results have shown improvement in grading precision which gives reliable grading score delivered with intuitive system.

Download Full-text