Source Code Author Identification Method Combining Semantics and Statistical Features

Business Intelligence and Information Technology - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-3-030-92632-8_14 ◽

2021 ◽

pp. 141-151

Author(s):

Xu Sun ◽

Yutong Sun ◽

Leilei Kong ◽

Yong Han ◽

Hui Ning

Keyword(s):

Source Code ◽

Statistical Features ◽

Identification Method ◽

Author Identification

Download Full-text

A Temperature Identification Method Based on Chromaticity Statistical Features of Raw Format Visible Image and K-nearest Neighbor Algorithm

2020 IEEE 1st China International Youth Conference on Electrical Engineering (CIYCEE) ◽

10.1109/ciycee49808.2020.9332599 ◽

2020 ◽

Author(s):

Wenmao Li ◽

Qizheng Ye ◽

Zhe Yuan ◽

Yang He

Keyword(s):

Nearest Neighbor ◽

Statistical Features ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Visible Image ◽

Identification Method ◽

K Nearest Neighbor Algorithm

Download Full-text

ICodeNet - A Hierarchical Neural Network Approach For Source Code Author Identification

2021 13th International Conference on Machine Learning and Computing ◽

10.1145/3457682.3457709 ◽

2021 ◽

Author(s):

Pranali Bora ◽

Tulika Awalgaonkar ◽

Himanshu Palve ◽

Raviraj Joshi ◽

Purvi Goel

Keyword(s):

Neural Network ◽

Source Code ◽

Network Approach ◽

Neural Network Approach ◽

Author Identification ◽

Hierarchical Neural Network

Download Full-text

Zero-Shot Source Code Author Identification: A Lexicon and Layout Independent Approach

2020 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn48605.2020.9207647 ◽

2020 ◽

Author(s):

Pegah Hozhabrierdi ◽

Dunai Fuentes Hitos ◽

Chilukuri K. Mohan

Keyword(s):

Source Code ◽

Author Identification

Download Full-text

Author Identification of Software Source Code with Program Dependence Graphs

2010 IEEE 34th Annual Computer Software and Applications Conference Workshops ◽

10.1109/compsacw.2010.56 ◽

2010 ◽

Author(s):

Rong Chen ◽

Lina Hong ◽

Chunyan Lu ◽

Wu Deng

Keyword(s):

Source Code ◽

Author Identification ◽

Dependence Graphs ◽

Program Dependence

Download Full-text

Author Identification in Imbalanced Sets of Source Code Samples

2012 IEEE 24th International Conference on Tools with Artificial Intelligence ◽

10.1109/ictai.2012.112 ◽

2012 ◽

Author(s):

E. Chatzicharalampous ◽

G. Frantzeskou ◽

E. Stamatatos

Keyword(s):

Source Code ◽

Author Identification

Download Full-text

Deep Neural Networks for Source Code Author Identification

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-42042-9_46 ◽

2013 ◽

pp. 368-375 ◽

Author(s):

Upul Bandara ◽

Gamini Wijayarathna

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Source Code ◽

Author Identification

Download Full-text

On the Use of Discretized Source Code Metrics for Author Identification

2009 1st International Symposium on Search Based Software Engineering ◽

10.1109/ssbse.2009.18 ◽

2009 ◽

Author(s):

Maxim Shevertalov ◽

Jay Kothari ◽

Edward Stehle ◽

Spiros Mancoridis

Keyword(s):

Source Code ◽

Author Identification ◽

Code Metrics ◽

Source Code Metrics

Download Full-text

Source code author identification with unsupervised feature learning

Pattern Recognition Letters ◽

10.1016/j.patrec.2012.10.027 ◽

2013 ◽

Vol 34 (3) ◽

pp. 330-334 ◽

Author(s):

Upul Bandara ◽

Gamini Wijayarathna

Keyword(s):

Source Code ◽

Feature Learning ◽

Unsupervised Feature Learning ◽

Author Identification

Download Full-text

PyComm: Malicious commands detection model for python scripts

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211557 ◽

2021 ◽

pp. 1-13

Author(s):

Anmin Zhou ◽

Tianyi Huang ◽

Cheng Huang ◽

Dunhan Li ◽

Chuangchuang Song

Keyword(s):

Machine Learning ◽

Random Forest ◽

Static Analysis ◽

Source Code ◽

The Other ◽

Statistical Features ◽

Dynamic Object ◽

Excellent Performance ◽

Common Features ◽

Detection Model

Python is a concise language which can be used to build lightweight tools or dynamic object-orientated applications. The various attributes of Python have made it attractive to numerous malware authors. Attackers often embed malicious shell commands into Python scripts for illegal operations. However, traditional static analysis methods are not feasible to detect this kind of attack because they focus on common features and failure in finding those malicious commands. On the other hand, dynamic analysis is not optimal in this case for its time-consuming and inefficient. In this paper, we propose PyComm, a model for detecting malicious commands in Python scripts with multidimensional features based on machine learning, which considers both 12 statistical features and string sequences of Python source code. Meanwhile, three comparison experiments are designed to evaluate the validity of proposed method. Experimental results show that presented model has achieved an excellent performance based on those practical features and random forest (RF) algorithm, obtained an accuracy of 0.955 with a recall of 0.943.

Download Full-text

6mA-Pred: identifying DNA N6-methyladenine sites based on deep learning

PeerJ ◽

10.7717/peerj.10813 ◽

2021 ◽

Vol 9 ◽

pp. e10813

Author(s):

Qianfei Huang ◽

Wenyang Zhou ◽

Fei Guo ◽

Lei Xu ◽

Lichao Zhang

Keyword(s):

Deep Learning ◽

Mus Musculus ◽

Source Code ◽

Experimental Results ◽

Individual Species ◽

Identification Method ◽

Excellent Method ◽

Multiple Species ◽

Site Recognition

With the accumulation of data on 6mA modification sites, an increasing number of scholars have begun to focus on the identification of 6mA sites. Despite the recognized importance of 6mA sites, methods for their identification remain lacking, with most existing methods being aimed at their identification in individual species. In the present study, we aimed to develop an identification method suitable for multiple species. Based on previous research, we propose a method for 6mA site recognition. Our experiments prove that the proposed 6mA-Pred method is effective for identifying 6mA sites in genes from taxa such as rice, Mus musculus, and human. A series of experimental results show that 6mA-Pred is an excellent method. We provide the source code used in the study, which can be obtained from http://39.100.246.211:5004/6mA_Pred/.

Download Full-text