code smell Latest Research Papers

Categorical Analysis of Code Smell Detection Using Machine Learning Algorithms

Intelligent Sustainable Systems - Lecture Notes in Networks and Systems ◽

10.1007/978-981-16-6309-3_66 ◽

2022 ◽

pp. 703-712

Author(s):

Ankita Bansal ◽

Utkarsh Jayant ◽

Abha Jain

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Categorical Analysis ◽

Code Smell

Automatic detection of Long Method and God Class code smells through neural source code embeddings

10.36227/techrxiv.17206010.v1 ◽

2021 ◽

Author(s):

Aleksandar Kovačević ◽

Jelena Slivka ◽

Dragan Vidaković ◽

Katarina-Glorija Grujić ◽

Nikola Luburić ◽

...

Keyword(s):

Machine Learning ◽

Large Scale ◽

Negative Impact ◽

Source Code ◽

Systematic Evaluation ◽

Small Scale ◽

Code Smells ◽

Code Metrics ◽

Code Smell ◽

F Measure

<p>Code smells are structures in code that often have a negative impact on its quality. Manually detecting code smells is challenging and researchers proposed many automatic code smell detectors. Most of the studies propose detectors based on code metrics and heuristics. However, these studies have several limitations, including evaluating the detectors using small-scale case studies and an inconsistent experimental setting. Furthermore, heuristic-based detectors suffer from limitations that hinder their adoption in practice. Thus, researchers have recently started experimenting with machine learning (ML) based code smell detection. </p><p>This paper compares the performance of multiple ML-based code smell detection models against multiple traditionally employed metric-based heuristics for detection of God Class and Long Method code smells. We evaluate the effectiveness of different source code representations for machine learning: traditionally used code metrics and code embeddings (code2vec, code2seq, and CuBERT).<br></p><p>We perform our experiments on the large-scale, manually labeled MLCQ dataset. We consider the binary classification problem – we classify the code samples as smelly or non-smelly and use the F1-measure of the minority (smell) class as a measure of performance. In our experiments, the ML classifier trained using CuBERT source code embeddings achieved the best performance for both God Class (F-measure of 0.53) and Long Method detection (F-measure of 0.75). With the help of a domain expert, we perform the error analysis to discuss the advantages of the CuBERT approach.<br></p><p>This study is the first to evaluate the effectiveness of pre-trained neural source code embeddings for code smell detection to the best of our knowledge. A secondary contribution of our study is the systematic evaluation of the effectiveness of multiple heuristic-based approaches on the same large-scale, manually labeled MLCQ dataset.<br></p>

Automatic detection of Long Method and God Class code smells through neural source code embeddings

10.36227/techrxiv.17206010 ◽

2021 ◽

Author(s):

Aleksandar Kovačević ◽

Jelena Slivka ◽

Dragan Vidaković ◽

Katarina-Glorija Grujić ◽

Nikola Luburić ◽

...

Keyword(s):

Machine Learning ◽

Large Scale ◽

Negative Impact ◽

Source Code ◽

Systematic Evaluation ◽

Small Scale ◽

Code Smells ◽

Code Metrics ◽

Code Smell ◽

F Measure

<p>Code smells are structures in code that often have a negative impact on its quality. Manually detecting code smells is challenging and researchers proposed many automatic code smell detectors. Most of the studies propose detectors based on code metrics and heuristics. However, these studies have several limitations, including evaluating the detectors using small-scale case studies and an inconsistent experimental setting. Furthermore, heuristic-based detectors suffer from limitations that hinder their adoption in practice. Thus, researchers have recently started experimenting with machine learning (ML) based code smell detection. </p><p>This paper compares the performance of multiple ML-based code smell detection models against multiple traditionally employed metric-based heuristics for detection of God Class and Long Method code smells. We evaluate the effectiveness of different source code representations for machine learning: traditionally used code metrics and code embeddings (code2vec, code2seq, and CuBERT).<br></p><p>We perform our experiments on the large-scale, manually labeled MLCQ dataset. We consider the binary classification problem – we classify the code samples as smelly or non-smelly and use the F1-measure of the minority (smell) class as a measure of performance. In our experiments, the ML classifier trained using CuBERT source code embeddings achieved the best performance for both God Class (F-measure of 0.53) and Long Method detection (F-measure of 0.75). With the help of a domain expert, we perform the error analysis to discuss the advantages of the CuBERT approach.<br></p><p>This study is the first to evaluate the effectiveness of pre-trained neural source code embeddings for code smell detection to the best of our knowledge. A secondary contribution of our study is the systematic evaluation of the effectiveness of multiple heuristic-based approaches on the same large-scale, manually labeled MLCQ dataset.<br></p>

A Novel Metric based Detection of Temporary Field Code Smell and its Empirical Analysis

Journal of King Saud University - Computer and Information Sciences ◽

10.1016/j.jksuci.2021.11.005 ◽

2021 ◽

Author(s):

Ruchin Gupta ◽

Sandeep Kumar Singh

Keyword(s):

Empirical Analysis ◽

Code Smell

Towards a systematic approach to manual annotation of code smells

10.36227/techrxiv.14159183.v2 ◽

2021 ◽

Author(s):

Nikola Luburić ◽

Simona Prokić ◽

Katarina-Glorija Grujić ◽

Jelena Slivka ◽

Aleksandar Kovačević ◽

...

Keyword(s):

Systematic Approach ◽

Manual Annotation ◽

Code Smells ◽

C Programming Language ◽

C Programming ◽

Software Engineers ◽

Code Smell ◽

Supporting Tool ◽

Almost All ◽

Primary Contribution

<div>Code smells are structures in code that indicate the presence of maintainability issues. A significant problem with code smells is their ambiguity. They are challenging to define, and software engineers have a different understanding of what a code smell is and which code suffers from code smells.</div><div>A solution to this problem could be an AI digital assistant that understands code smells and can detect (and perhaps resolve) them. However, it is challenging to develop such an assistant as there are few usable datasets of code smells on which to train and evaluate it. Furthermore, the existing datasets suffer from issues that mostly arise from an unsystematic approach used for their construction.</div><div>Through this work, we address this issue by developing a procedure for the systematic manual annotation of code smells. We use this procedure to build a dataset of code smells. During this process, we refine the procedure and identify recommendations and pitfalls for its use. The primary contribution is the proposed annotation model and procedure and the annotators’ experience report. The dataset and supporting tool are secondary contributions of our study. Notably, our dataset includes open-source projects written in the C# programming language, while almost all manually annotated datasets contain projects written in Java.</div>

MARS: Detecting brain class/method code smell based on metric–attention mechanism and residual network

Journal of Software Evolution and Process ◽

10.1002/smr.2403 ◽

2021 ◽

Author(s):

Yang Zhang ◽

Chunhao Dong

Keyword(s):

Attention Mechanism ◽

Residual Network ◽

Code Smell

How far are we from reproducible research on code smell detection? A systematic literature review

Information and Software Technology ◽

10.1016/j.infsof.2021.106783 ◽

2021 ◽

pp. 106783

Author(s):

Tomasz Lewowski ◽

Lech Madeyski

Keyword(s):

Literature Review ◽

Systematic Literature Review ◽

Reproducible Research ◽

Code Smell

A Study of Relevant Parameters Influencing Code Smell Prioritization in Object-Oriented Software Systems

10.1109/ispcc53510.2021.9609478 ◽

2021 ◽

Author(s):

Renu Verma ◽

Kuldeep Kumar ◽

Harsh K Verma

Keyword(s):

Object Oriented ◽

Software Systems ◽

Code Smell

Revealing Developers’ Arguments on Validating the Incidence of Code Smells: A Focus Group Experience

10.5753/vem.2021.17214 ◽

2021 ◽

Author(s):

Luis Felipi Junionello ◽

Rafael de Mello ◽

Roberto Oliveira ◽

Leonardo Sousa ◽

Alexander López ◽

...

Keyword(s):

Focus Group ◽

Tacit Knowledge ◽

Automated Detection ◽

Group Session ◽

Focus Group Session ◽

Code Smells ◽

Identifying Code ◽

Code Smell ◽

Human Validation ◽

Group Experience

Identifying code smells is considered a subjective task. Unfortunately, current automated detection tools cannot deal with such subjectivity, requiring human validation. Developers tend to follow different, albeit complementary, strategies when validating the identified smells. Intending to find out developers' arguments when validating the incidence of code smells, we conducted a focus group session with developers familiar with identifying code smells. We distributed them among two groups, in which they had to argue about the incidence of a code smell: either accepting or rejecting its presence. Based on their arguments, we compiled a set of general heuristics that developers follow when validating smells. We then used these heuristics for composing validation items. We understand that the set of validation items proposed may support developers in reflecting on the incidence of code smells. However, further studies are needed for reaching a more comprehensive and optimized set. The experience of this study reveals that conducting focus group sessions is helpful to emerge the tacit knowledge of developers when validating code smells.

How do Code Smell Co-occurrences Removal Impact Internal Quality Attributes? A Developers’ Perspective

10.1145/3474624.3474642 ◽

2021 ◽

Author(s):

Júlio Martins ◽

Carla Bezerra ◽

Anderson Uchôa ◽

Alessandro Garcia

Keyword(s):

Quality Attributes ◽

Internal Quality ◽

Code Smell

code smell
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Categorical Analysis of Code Smell Detection Using Machine Learning Algorithms

Automatic detection of Long Method and God Class code smells through neural source code embeddings

Automatic detection of Long Method and God Class code smells through neural source code embeddings

A Novel Metric based Detection of Temporary Field Code Smell and its Empirical Analysis

Towards a systematic approach to manual annotation of code smells

MARS: Detecting brain class/method code smell based on metric–attention mechanism and residual network

How far are we from reproducible research on code smell detection? A systematic literature review

A Study of Relevant Parameters Influencing Code Smell Prioritization in Object-Oriented Software Systems

Revealing Developers’ Arguments on Validating the Incidence of Code Smells: A Focus Group Experience

How do Code Smell Co-occurrences Removal Impact Internal Quality Attributes? A Developers’ Perspective

Export Citation Format

code smellRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Categorical Analysis of Code Smell Detection Using Machine Learning Algorithms

Automatic detection of Long Method and God Class code smells through neural source code embeddings

Automatic detection of Long Method and God Class code smells through neural source code embeddings

A Novel Metric based Detection of Temporary Field Code Smell and its Empirical Analysis

Towards a systematic approach to manual annotation of code smells

MARS: Detecting brain class/method code smell based on metric–attention mechanism and residual network

How far are we from reproducible research on code smell detection? A systematic literature review

A Study of Relevant Parameters Influencing Code Smell Prioritization in Object-Oriented Software Systems

Revealing Developers’ Arguments on Validating the Incidence of Code Smells: A Focus Group Experience

How do Code Smell Co-occurrences Removal Impact Internal Quality Attributes? A Developers’ Perspective

code smell
Recently Published Documents