bug reports Latest Research Papers

Rust is an emerging programming language that aims at preventing memory-safety bugs without sacrificing much efficiency. The claimed property is very attractive to developers, and many projects start using the language. However, can Rust achieve the memory-safety promise? This article studies the question by surveying 186 real-world bug reports collected from several origins, which contain all existing Rust common vulnerability and exposures (CVEs) of memory-safety issues by 2020-12-31. We manually analyze each bug and extract their culprit patterns. Our analysis result shows that Rust can keep its promise that all memory-safety bugs require unsafe code, and many memory-safety bugs in our dataset are mild soundness issues that only leave a possibility to write memory-safety bugs without unsafe code. Furthermore, we summarize three typical categories of memory-safety bugs, including automatic memory reclaim, unsound function, and unsound generic or trait. While automatic memory claim bugs are related to the side effect of Rust newly-adopted ownership-based resource management scheme, unsound function reveals the essential challenge of Rust development for avoiding unsound code, and unsound generic or trait intensifies the risk of introducing unsoundness. Based on these findings, we propose two promising directions toward improving the security of Rust development, including several best practices of using specific APIs and methods to detect particular bugs involving unsafe code. Our work intends to raise more discussions regarding the memory-safety issues of Rust and facilitate the maturity of the language.

Download Full-text

A Process Framework for the Classification of Security Bug Reports

10.1002/9781119821779.ch8 ◽

2022 ◽

pp. 175-185

Author(s):

Shahid Hussain

Keyword(s):

Bug Reports ◽

Process Framework

Download Full-text

Bug Reports and Deep Learning Models

International Journal of Computer Science and Mobile Computing ◽

10.47760/ijcsmc.2021.v10i12.003 ◽

2021 ◽

Vol 10 (12) ◽

pp. 21-26

Author(s):

Som Gupta ◽

Sanjai Kumar Gupta

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Research Area ◽

Learning Approaches ◽

Bug Localization ◽

Future Directions ◽

Bug Reports ◽

Bug Report ◽

The Future

Deep Learning is one of the emerging and trending research area of machine learning in various domains. The paper describes the deep learning approaches applied to the domain of Bug Reports. The paper classifies the tasks being performed for mining of Bug Reports into Bug Report Classification, Bug Localization, Bug Report Summarization and Duplicate Bug Report Detection. The paper systematically discusses about the deep learning approaches being used for the mentioned tasks, and the future directions in this field of research.

Download Full-text

Automated Classification of Unstructured Bilingual Software Bug Reports: An Industrial Case Study Research

Applied Sciences ◽

10.3390/app12010338 ◽

2021 ◽

Vol 12 (1) ◽

pp. 338

Author(s):

Ömer Köksal ◽

Bedir Tekinerdogan

Keyword(s):

Machine Learning ◽

Industrial Case Study ◽

Software Bugs ◽

Text Input ◽

Bug Reports ◽

Bug Report ◽

Software Bug ◽

Manual Classification

Software bug report classification is a critical process to understand the nature, implications, and causes of software failures. Furthermore, classification enables a fast and appropriate reaction to software bugs. However, for large-scale projects, one must deal with a broad set of bugs from multiple types. In this context, manually classifying bugs becomes cumbersome and time-consuming. Although several studies have addressed automated bug classification using machine learning techniques, they have mainly focused on academic case studies, open-source software, and unilingual text input. This paper presents our automated bug classification approach applied and validated in an industrial case study. In contrast to earlier studies, our study is applied to a commercial software system based on unstructured bilingual bug reports written in English and Turkish. The presented approach adopts and integrates machine learning (ML), text mining, and natural language processing (NLP) techniques to support the classification of software bugs. The approach has been applied within an industrial case study. Compared to manual classification, our results show that bug classification can be automated and even performs better than manual bug classification. Our study shows that the presented approach and the corresponding tools effectively reduce the manual classification time and effort.

Download Full-text

Sentiment-Based Neural Network Approach for Predicting the Severity of Bug Reports

10.1109/icds53782.2021.9626730 ◽

2021 ◽

Author(s):

Aladdin Baarah ◽

Ahmad Aloqaily ◽

Hala Zyod ◽

Nasser Mustafa

Keyword(s):

Neural Network ◽

Network Approach ◽

Neural Network Approach ◽

Bug Reports

Download Full-text

IdentiBug: Model-Driven Visualization of Bug Reports by Extracting Class Diagram Excerpts

10.1109/smc52423.2021.9658989 ◽

2021 ◽

Author(s):

Gelareh Meidanipour Lahijany ◽

Manuel Ohrndorf ◽

Johannes Zenkert ◽

Madjid Fathi ◽

Udo Kelte

Keyword(s):

Class Diagram ◽

Model Driven ◽

Bug Reports

Download Full-text

Discovering Known Biodiversity: Digital accessible knowledge — Getting the community involved

Biodiversity Information Science and Standards ◽

10.3897/biss.5.74369 ◽

2021 ◽

Vol 5 ◽

Author(s):

Carolina Sokolowicz ◽

Marcus Guidoti ◽

Donat Agosti

Keyword(s):

Online Teaching ◽

Application Programming Interface ◽

Independent Learning ◽

Golden Gate ◽

Data Quality Control ◽

Non Profit ◽

Learning Modules ◽

Bug Reports ◽

Questions And Answers ◽

Label System

Plazi is a non-profit organization focused on the liberation of data from taxonomic publications. As one of Plazi’s goals of promoting the accessibility of taxonomic data, our team has developed different ways of getting the outside community involved. The Plazi community on GitHub encourages the scientific community and other contributors to post GGI-related (Golden Gate Imagine document editor) questions, requirements, ideas, and/or suggestions, including bug reports and feature requests. One can contact us via this GitHub community by creating either an Issue (to report problems on our data or related systems) or a Discussion (to post questions, ideas, or suggestions). We use Github's built-in label system to actively curate the content posted in this repository in order to facilitate further interaction, including filtering and searching before creating new entries. In the plazi/community repository, there is a Q&A (question & answer) section with selected questions and answers that might help solving the encountered problems. Aiming at increasing external participation in the task of liberating taxonomic data, we are developing training courses with independent learning modules that can be combined in different ways to target different audiences (e.g., undergraduates, researchers, developers) in various formats. This material will include text, print-screens, slides, screencasts, and, eventually to a minor extent, online teaching. Each topic within a module will have one or more ‘inline tests', which will be HTML form-based with hard-coded answers to directly assess progress regarding the subject being covered in that particular topic. At the end of each module, we will have a capstone (form-based test asking questions about the topics covered in the respective module) which the user can access whenever needed. As examples of our independent learning modules we can cite Modules I, II and III and their respective topics. Module I (Biodiversity Taxonomy Basis) includes introductory topics (e.g., Topic I — Why do we classify living things; Topic II — Linnaean binomial; Topic III — How is taxonomic information displayed in the literature) aimed at those who don't have a biology/taxonomy background. Module II (The Plazi way) topics (Topic I — Plazi mission; Topic II — Taxomic treatments; Topic III — FAIR taxonomic treatments) are designed in a way that course takers can learn about Plazi processes. Module III (The Golden Gate Imagine) includes topics (Topic I — Introduction to GGI; Topic II — Other User Interface-based alternatives to annotate documents) about the document editor for marking up documents in XML. Other modules include subjects such as individual extractions, material and treatment citations, data quality control, and others. On completion of a module, the user will be awarded a certificate. The combination of these certificates will grant badges that will translate into server permissions that will allow the user to upload new liberated taxonomic treatments and edit treatments already in the system, for instance. Taxonomic treaments are any piece of information about a given taxon concept that involves, includes, or results from an interpretation of the concept of that given taxon. Additionally, Plazi TreatmentBank APIs (Application Programming Interface) are currently being expanded and redesigned and the documentation for these long-waited endpoints will be displayed, for the first time, in this talk.

Download Full-text

Adaptive Ranking Relevant Source Files for Bug Reports Using Genetic Algorithm

10.3233/faia210042 ◽

2021 ◽

Author(s):

Thi Mai Anh Bui ◽

Nhat Hai Nguyen

Keyword(s):

Genetic Algorithm ◽

Software Maintenance ◽

Large Scale ◽

Maintenance Phase ◽

Bug Localization ◽

Software Projects ◽

Source File ◽

Bug Reports ◽

Localization Model ◽

Bug Report

Precisely locating buggy files for a given bug report is a cumbersome and time-consuming task, particularly in a large-scale project with thousands of source files and bug reports. An efficient bug localization module is desirable to improve the productivity of the software maintenance phase. Many previous approaches rank source files according to their relevance to a given bug report based on simple lexical matching scores. However, the lexical mismatches between natural language expressions used to describe bug reports and technical terms of software source code might reduce the bug localization system’s accuracy. Incorporating domain knowledge through some features such as the semantic similarity, the fixing frequency of a source file, the code change history and similar bug reports is crucial to efficiently locating buggy files. In this paper, we propose a bug localization model, BugLocGA that leverages both lexical and semantic information as well as explores the relation between a bug report and a source file through some domain features. Given a bug report, we calculate the ranking score with every source files through a weighted sum of all features, where the weights are trained through a genetic algorithm with the aim of maximizing the performance of the bug localization model using two evaluation metrics: mean reciprocal rank (MRR) and mean average precision (MAP). The empirical results conducted on some widely-used open source software projects have showed that our model outperformed some state of the art approaches by effectively recommending relevant files where the bug should be fixed.

Download Full-text

Topic modeling in software engineering research

Empirical Software Engineering ◽

10.1007/s10664-021-10026-0 ◽

2021 ◽

Vol 26 (6) ◽

Author(s):

Camila Costa Silva ◽

Matthias Galster ◽

Fabian Gilson

Keyword(s):

Software Engineering ◽

Topic Modeling ◽

Latent Dirichlet Allocation ◽

Empirical Studies ◽

Engineering Research ◽

Bug Reports ◽

Textual Data ◽

Modeling Techniques ◽

Software Engineering Research ◽

Support Software

AbstractTopic modeling using models such as Latent Dirichlet Allocation (LDA) is a text mining technique to extract human-readable semantic “topics” (i.e., word clusters) from a corpus of textual documents. In software engineering, topic modeling has been used to analyze textual data in empirical studies (e.g., to find out what developers talk about online), but also to build new techniques to support software engineering tasks (e.g., to support source code comprehension). Topic modeling needs to be applied carefully (e.g., depending on the type of textual data analyzed and modeling parameters). Our study aims at describing how topic modeling has been applied in software engineering research with a focus on four aspects: (1) which topic models and modeling techniques have been applied, (2) which textual inputs have been used for topic modeling, (3) how textual data was “prepared” (i.e., pre-processed) for topic modeling, and (4) how generated topics (i.e., word clusters) were named to give them a human-understandable meaning. We analyzed topic modeling as applied in 111 papers from ten highly-ranked software engineering venues (five journals and five conferences) published between 2009 and 2020. We found that (1) LDA and LDA-based techniques are the most frequent topic modeling techniques, (2) developer communication and bug reports have been modelled most, (3) data pre-processing and modeling parameters vary quite a bit and are often vaguely reported, and (4) manual topic naming (such as deducting names based on frequent words in a topic) is common.

Download Full-text

Transfer Learning for Mining Feature Requests and Bug Reports from Tweets and App Store Reviews

10.1109/rew53955.2021.00019 ◽

2021 ◽

Author(s):

Pablo Restrepo Henao ◽

Jannik Fischbach ◽

Dominik Spies ◽

Julian Frattini ◽

Andreas Vogelsang

Keyword(s):

Transfer Learning ◽

App Store ◽

Bug Reports

Download Full-text

bug reports
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Memory-Safety Challenge Considered Solved? An In-Depth Study with All Rust CVEs

A Process Framework for the Classification of Security Bug Reports

Bug Reports and Deep Learning Models

Automated Classification of Unstructured Bilingual Software Bug Reports: An Industrial Case Study Research

Sentiment-Based Neural Network Approach for Predicting the Severity of Bug Reports

IdentiBug: Model-Driven Visualization of Bug Reports by Extracting Class Diagram Excerpts

Discovering Known Biodiversity: Digital accessible knowledge — Getting the community involved

Adaptive Ranking Relevant Source Files for Bug Reports Using Genetic Algorithm

Topic modeling in software engineering research

Transfer Learning for Mining Feature Requests and Bug Reports from Tweets and App Store Reviews

Export Citation Format

bug reportsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Memory-Safety Challenge Considered Solved? An In-Depth Study with All Rust CVEs

A Process Framework for the Classification of Security Bug Reports

Bug Reports and Deep Learning Models﻿

Automated Classification of Unstructured Bilingual Software Bug Reports: An Industrial Case Study Research

Sentiment-Based Neural Network Approach for Predicting the Severity of Bug Reports

IdentiBug: Model-Driven Visualization of Bug Reports by Extracting Class Diagram Excerpts

Discovering Known Biodiversity: Digital accessible knowledge — Getting the community involved

Adaptive Ranking Relevant Source Files for Bug Reports Using Genetic Algorithm

Topic modeling in software engineering research

Transfer Learning for Mining Feature Requests and Bug Reports from Tweets and App Store Reviews

bug reports
Recently Published Documents

Bug Reports and Deep Learning Models