Android Smells Detection Using ML Algorithms with Static Code Metrics

Bug prediction aims at finding source code elements in a software system that are likely to contain defects. Being aware of the most error-prone parts of the program, one can efficiently allocate the limited amount of testing and code review resources. Therefore, bug prediction can support software maintenance and evolution to a great extent. In this paper, we propose a function level JavaScript bug prediction model based on static source code metrics with the addition of a hybrid (static and dynamic) code analysis based metric of the number of incoming and outgoing function calls (HNII and HNOI). Our motivation for this is that JavaScript is a highly dynamic scripting language for which static code analysis might be very imprecise; therefore, using a purely static source code features for bug prediction might not be enough. Based on a study where we extracted 824 buggy and 1943 non-buggy functions from the publicly available BugsJS dataset for the ESLint JavaScript project, we can confirm the positive impact of hybrid code metrics on the prediction performance of the ML models. Depending on the ML algorithm, applied hyper-parameters, and target measures we consider, hybrid invocation metrics bring a 2–10% increase in model performances (i.e., precision, recall, F-measure). Interestingly, replacing static NOI and NII metrics with their hybrid counterparts HNOI and HNII in itself improves model performances; however, using them all together yields the best results.

Download Full-text

Source Code Metrics to Predict the Properties of FPGA/VHDL-Based Synthesized Products

2018 6th International Conference in Software Engineering Research and Innovation (CONISOFT) ◽

10.1109/conisoft.2018.8645854 ◽

2018 ◽

Author(s):

Oscar E. Perez-Cham ◽

Carlos Soubervielle- Montalvo ◽

Alberto S. Nunez-Varela ◽

Cesar Puente ◽

Luis J. Ontanon-Garcia

Keyword(s):

Source Code ◽

Code Metrics ◽

Source Code Metrics

Download Full-text

Mutual Information-Based Feature Selection in Fuzzy Databases Applied to Searching for the Best Code Metrics in Automatic Grading

Lecture Notes in Computer Science - Hybrid Artificial Intelligence Systems ◽

10.1007/978-3-319-07617-1_30 ◽

2014 ◽

pp. 330-341

Author(s):

José Otero ◽

Rosario Suárez ◽

Luciano Sánchez

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Code Metrics ◽

Automatic Grading

Download Full-text

A Study of the Relationships between Source Code Metrics and Attractiveness in Free Software Projects

2010 Brazilian Symposium on Software Engineering ◽

10.1109/sbes.2010.27 ◽

2010 ◽

Cited By ~ 18

Author(s):

Paulo Meirelles ◽

Carlos Santos Jr. ◽

Joao Miranda ◽

Fabio Kon ◽

Antonio Terceiro ◽

...

Keyword(s):

Source Code ◽

Free Software ◽

Software Projects ◽

Code Metrics ◽

Source Code Metrics

Download Full-text

Automatic detection of Long Method and God Class code smells through neural source code embeddings

10.36227/techrxiv.17206010.v1 ◽

2021 ◽

Author(s):

Aleksandar Kovačević ◽

Jelena Slivka ◽

Dragan Vidaković ◽

Katarina-Glorija Grujić ◽

Nikola Luburić ◽

...

Keyword(s):

Machine Learning ◽

Large Scale ◽

Negative Impact ◽

Source Code ◽

Systematic Evaluation ◽

Small Scale ◽

Code Smells ◽

Code Metrics ◽

Code Smell ◽

F Measure

Code smells are structures in code that often have a negative impact on its quality. Manually detecting code smells is challenging and researchers proposed many automatic code smell detectors. Most of the studies propose detectors based on code metrics and heuristics. However, these studies have several limitations, including evaluating the detectors using small-scale case studies and an inconsistent experimental setting. Furthermore, heuristic-based detectors suffer from limitations that hinder their adoption in practice. Thus, researchers have recently started experimenting with machine learning (ML) based code smell detection. This paper compares the performance of multiple ML-based code smell detection models against multiple traditionally employed metric-based heuristics for detection of God Class and Long Method code smells. We evaluate the effectiveness of different source code representations for machine learning: traditionally used code metrics and code embeddings (code2vec, code2seq, and CuBERT). We perform our experiments on the large-scale, manually labeled MLCQ dataset. We consider the binary classification problem – we classify the code samples as smelly or non-smelly and use the F1-measure of the minority (smell) class as a measure of performance. In our experiments, the ML classifier trained using CuBERT source code embeddings achieved the best performance for both God Class (F-measure of 0.53) and Long Method detection (F-measure of 0.75). With the help of a domain expert, we perform the error analysis to discuss the advantages of the CuBERT approach. This study is the first to evaluate the effectiveness of pre-trained neural source code embeddings for code smell detection to the best of our knowledge. A secondary contribution of our study is the systematic evaluation of the effectiveness of multiple heuristic-based approaches on the same large-scale, manually labeled MLCQ dataset.

Download Full-text

An Empirical Analysis on Effectiveness of Source Code Metrics for Aging Related Bug Prediction

10.18293/jvlc2019-n2-022 ◽

2019 ◽

Vol 2019 (2) ◽

pp. 117-126

Author(s):

Chinmay Hota ◽

Lov Kumar ◽

Lalita Bhanu Murthy Neti

Keyword(s):

Empirical Analysis ◽

Source Code ◽

Code Metrics ◽

Source Code Metrics

Download Full-text

An efficient Software Source Code Metrics for Implementing for Software quality analysis

International Journal of Emerging Trends in Engineering Research ◽

10.30534/ijeter/2019/01792019 ◽

2019 ◽

pp. 216-222

Author(s):

Varun K L Srivastava ◽

Keyword(s):

Software Quality ◽

Source Code ◽

Quality Analysis ◽

Code Metrics ◽

Source Code Metrics ◽

Efficient Software

Download Full-text

Software energy consumption prediction using software code metrics

10.32920/ryerson.14666424.v1 ◽

2021 ◽

Author(s):

Sedef Akinli Koçak

Keyword(s):

Energy Efficiency ◽

Energy Consumption ◽

Prediction Models ◽

Code Size ◽

Research Attention ◽

Software Products ◽

Code Metrics ◽

Software Code ◽

Energy Consumption Prediction ◽

Consumption Prediction

In recent years, a significant amount of energy consumption of ICT products has resulted in environmental concerns. Growing demand for mobile devices, personal computers, and the widespread adaptation of cloud computing and data centers are the main drivers for the energy consumption of the ICT systems. Finding solutions for improving the energy efficiency of the systems has become an important objective for both industry and academia. In order to address the increase in ICT energy consumption, hardware technology, such as production of energy efficient processors, has been substantially improved. However, demand for energy is growing faster than improvements are being made on these energy-aware technologies. Therefore, in addition to hardware, software technologies must also be a focus of research attention. Although software does not consume energy by itself, its characteristics determine which hardware resources are made available and how much electrical energy is used. Current literature on the energy efficiency of software, highlights, in particular, a lack of measurements and models. In this dissertation, first, the relationship between software code properties and energy consumption is explored. Second, using static code metrics regression based energy consumption prediction models are investigated. Finally, the models performance are assessed using within product and cross-product energy consumption prediction approaches. For this purpose, a quantitative based retrospective cohort study was employed. As research methods, observational data collection, mining software repositories, and regression analysis were utilized. This research results show inconsistent relationships between energy consumption and code size and complexity attributes considering different types of software products. Such results provide a foundation of knowledge that static code attributes may give some insights but would not be the sole predictors of energy consumption of software products.

Download Full-text

Predicting Android Application Security and Privacy Risk with Static Code Metrics

2017 IEEE/ACM 4th International Conference on Mobile Software Engineering and Systems (MOBILESoft) ◽

10.1109/mobilesoft.2017.14 ◽

2017 ◽

Cited By ~ 4

Author(s):

Akond Rahman ◽

Priysha Pradhan ◽

Asif Partho ◽

Laurie Williams

Keyword(s):

Security And Privacy ◽

Android Application ◽

Privacy Risk ◽

Application Security ◽

Code Metrics

Download Full-text

AN EMPIRICAL STUDY OF SOFTWARE METRICS FOR ASSESSING THE PHASES OF AN AGILE PROJECT

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194012500131 ◽

2012 ◽

Vol 22 (04) ◽

pp. 525-548 ◽

Cited By ~ 12

Author(s):

GIULIO CONCAS ◽

MICHELE MARCHESI ◽

GIUSEPPE DESTEFANIS ◽

ROBERTO TONELLI

Keyword(s):

Software Quality ◽

Web Application ◽

Software Metrics ◽

Pair Programming ◽

Application Development ◽

Agile Practices ◽

Code Metrics ◽

Agile Process ◽

Adoption Level

We present an analysis of the evolution of a Web application project developed with object-oriented technology and an agile process. During the development we systematically performed measurements on the source code, using software metrics that have been proved to be correlated with software quality, such as the Chidamber and Kemerer suite and Lines of Code metrics. We also computed metrics derived from the class dependency graph, including metrics derived from Social Network Analysis. The application development evolved through phases, characterized by a different level of adoption of some key agile practices — namely pair programming, test-based development and refactoring. The evolution of the metrics of the system, and their behavior related to the agile practices adoption level, is presented and discussed. We show that, in the reported case study, a few metrics are enough to characterize with high significance the various phases of the project. Consequently, software quality, as measured using these metrics, seems directly related to agile practices adoption.

Download Full-text