Android Malware Detection using Function Call Graph with Graph Convolutional Networks

The openness of Android operating system not only brings convenience to users, but also leads to the attack threat from a large number of malicious applications (apps). Thus malware detection has become the research focus in the field of mobile security. In order to solve the problem of more coarse-grained feature selection and larger feature loss of graph structure existing in the current detection methods, we put forward a method named DGCNDroid for Android malware detection, which is based on the deep graph convolutional network. Our method starts by generating a function call graph for the decompiled Android application. Then the function call subgraph containing the sensitive application programming interface (API) is extracted. Finally, the function call subgraphs with structural features are trained as the input of the deep graph convolutional network. Thus the detection and classification of malicious apps can be realized. Through experimentation on a dataset containing 11,120 Android apps, the method proposed in this paper can achieve detection accuracy of 98.2%, which is higher than other existing detection methods.

Download Full-text

Using G Features to Improve the Efficiency of Function Call Graph Based Android Malware Detection

Wireless Personal Communications ◽

10.1007/s11277-018-5982-0 ◽

2018 ◽

Vol 103 (4) ◽

pp. 2947-2955 ◽

Cited By ~ 4

Author(s):

Yu Liu ◽

Liqiang Zhang ◽

Xiangdong Huang

Keyword(s):

Malware Detection ◽

Android Malware ◽

Call Graph ◽

Android Malware Detection ◽

Function Call

Download Full-text

AFCGDroid: Deep Learning Based Android Malware Detection Using Attributed Function Call Graphs

Journal of Physics Conference Series ◽

10.1088/1742-6596/1693/1/012080 ◽

2020 ◽

Vol 1693 ◽

pp. 012080

Author(s):

Tong Lu ◽

Xiaoyuan Liu ◽

Jingwei Chen ◽

Naitian Hu ◽

Bo Liu

Keyword(s):

Deep Learning ◽

Malware Detection ◽

Android Malware ◽

Android Malware Detection ◽

Function Call ◽

Call Graphs

Download Full-text

Heterogeneous Graph Convolutional Networks for Android Malware Detection using Callback-Aware Caller-Callee Graphs

10.36227/techrxiv.15072087 ◽

2021 ◽

Author(s):

Vinayaka K V ◽

Jaidhar C D

Keyword(s):

Malware Detection ◽

Application Programming Interface ◽

Extraction Methods ◽

Android Application ◽

Convolutional Network ◽

Android Malware ◽

Detection Model ◽

Convolutional Networks ◽

Android Malware Detection ◽

Ablation Study

<pre> The popularity of the Android Operating System in the smartphone market has given rise to lots of Android malware. To accurately detect these malware, many of the existing works use machine learning and deep learning-based methods, in which feature extraction methods were used to extract fixed-size feature vectors using the files present inside the Android Application Package (APK). Recently, Graph Convolutional Network (GCN) based methods applied on the Function Call Graph (FCG) extracted from the APK are gaining momentum in Android malware detection, as GCNs are effective at learning tasks on variable-sized graphs such as FCG, and FCG sufficiently captures the structure and behaviour of an APK. However, the FCG lacks information about callback methods as the Android Application Programming Interface (API) is event-driven. This paper proposes enhancing the FCG to eFCG (enhanced-FCG) using the callback information extracted using Android Framework Space Analysis to overcome this limitation. Further, we add permission - API method relationships to the eFCG. The eFCG is reduced using node contraction based on the classes to get R-eFCG (Reduced eFCG) to improve the generalisation ability of the Android malware detection model. The eFCG and R-eFCG are then given as the inputs to the Heterogeneous GCN models to determine whether the APK file from which they are extracted is malicious or not. To test the effectiveness of eFCG and R-eFCG, we conducted an ablation study by removing their various components. To determine the optimal neighbourhood size for GCN, we experimented with a varying number of GCN layers and found that the Android malware detection model using R-eFCG with all its components with four convolution layers achieved maximum accuracy of 96.28%.</pre>

Download Full-text

Heterogeneous Graph Convolutional Networks for Android Malware Detection using Callback-Aware Caller-Callee Graphs

10.36227/techrxiv.15072087.v1 ◽

2021 ◽

Author(s):

Vinayaka K V ◽

Jaidhar C D

Keyword(s):

Malware Detection ◽

Application Programming Interface ◽

Extraction Methods ◽

Android Application ◽

Convolutional Network ◽

Android Malware ◽

Detection Model ◽

Convolutional Networks ◽

Android Malware Detection ◽

Ablation Study

<pre> The popularity of the Android Operating System in the smartphone market has given rise to lots of Android malware. To accurately detect these malware, many of the existing works use machine learning and deep learning-based methods, in which feature extraction methods were used to extract fixed-size feature vectors using the files present inside the Android Application Package (APK). Recently, Graph Convolutional Network (GCN) based methods applied on the Function Call Graph (FCG) extracted from the APK are gaining momentum in Android malware detection, as GCNs are effective at learning tasks on variable-sized graphs such as FCG, and FCG sufficiently captures the structure and behaviour of an APK. However, the FCG lacks information about callback methods as the Android Application Programming Interface (API) is event-driven. This paper proposes enhancing the FCG to eFCG (enhanced-FCG) using the callback information extracted using Android Framework Space Analysis to overcome this limitation. Further, we add permission - API method relationships to the eFCG. The eFCG is reduced using node contraction based on the classes to get R-eFCG (Reduced eFCG) to improve the generalisation ability of the Android malware detection model. The eFCG and R-eFCG are then given as the inputs to the Heterogeneous GCN models to determine whether the APK file from which they are extracted is malicious or not. To test the effectiveness of eFCG and R-eFCG, we conducted an ablation study by removing their various components. To determine the optimal neighbourhood size for GCN, we experimented with a varying number of GCN layers and found that the Android malware detection model using R-eFCG with all its components with four convolution layers achieved maximum accuracy of 96.28%.</pre>

Download Full-text