A Prediction Model Based on ISOMAP for Software Defects

To improve and guarantee the quality of software, it is very necessary to effectively predicting modules with defects in the software. There are usually more measure attributes in software quality prediction, which often leads to the curse of dimension. To do this, a new algorithm based on ISOMAP was presented to predict software defect, which combined manifold learning algorithms and classification methods. In the model, the high dimensional software metrics attribute data were firstly mapped into the low dimensional space through ISOMAP. Then the low dimensional features were classified with KNN, SVM and NB. Experiments demonstrate that the new model progresses the prediction precision of software defects as well as great improves the efficiency of the algorithm.

Download Full-text

A Literature Review Study of Software Defect Prediction using Machine Learning Techniques

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i6.286 ◽

2018 ◽

Vol 6 (6) ◽

pp. 300 ◽

Cited By ~ 3

Author(s):

Feidu Akmel ◽

Ermiyas Birihanu ◽

Bahir Siraj

Keyword(s):

Machine Learning ◽

Software Metrics ◽

Quality Standard ◽

Machine Learning Techniques ◽

Software Systems ◽

Health Care Insurance ◽

Software Defect ◽

Learning Techniques ◽

Software Product

Software systems are any software product or applications that support business domains such as Manufacturing,Aviation, Health care, insurance and so on.Software quality is a means of measuring how software is designed and how well the software conforms to that design. Some of the variables that we are looking for software quality are Correctness, Product quality, Scalability, Completeness and Absence of bugs, However the quality standard that was used from one organization is different from other for this reason it is better to apply the software metrics to measure the quality of software. Attributes that we gathered from source code through software metrics can be an input for software defect predictor. Software defect are an error that are introduced by software developer and stakeholders. Finally, in this study we discovered the application of machine learning on software defect that we gathered from the previous research works.

Download Full-text

Performance Analysis of Feature Selection Methods in Software Defect Prediction: A Search Method Approach

Applied Sciences ◽

10.3390/app9132764 ◽

2019 ◽

Vol 9 (13) ◽

pp. 2764 ◽

Cited By ~ 8

Author(s):

Abdullateef Oluwagbemiga Balogun ◽

Shuib Basri ◽

Said Jadid Abdulkadir ◽

Ahmad Sobri Hashim

Keyword(s):

Software Metrics ◽

Prediction Models ◽

Predictive Performance ◽

Search Method ◽

Feature Subset Selection ◽

Defect Prediction ◽

Feature Subset ◽

Software Defect Prediction ◽

Software Defect

Software Defect Prediction (SDP) models are built using software metrics derived from software systems. The quality of SDP models depends largely on the quality of software metrics (dataset) used to build the SDP models. High dimensionality is one of the data quality problems that affect the performance of SDP models. Feature selection (FS) is a proven method for addressing the dimensionality problem. However, the choice of FS method for SDP is still a problem, as most of the empirical studies on FS methods for SDP produce contradictory and inconsistent quality outcomes. Those FS methods behave differently due to different underlining computational characteristics. This could be due to the choices of search methods used in FS because the impact of FS depends on the choice of search method. It is hence imperative to comparatively analyze the FS methods performance based on different search methods in SDP. In this paper, four filter feature ranking (FFR) and fourteen filter feature subset selection (FSS) methods were evaluated using four different classifiers over five software defect datasets obtained from the National Aeronautics and Space Administration (NASA) repository. The experimental analysis showed that the application of FS improves the predictive performance of classifiers and the performance of FS methods can vary across datasets and classifiers. In the FFR methods, Information Gain demonstrated the greatest improvements in the performance of the prediction models. In FSS methods, Consistency Feature Subset Selection based on Best First Search had the best influence on the prediction models. However, prediction models based on FFR proved to be more stable than those based on FSS methods. Hence, we conclude that FS methods improve the performance of SDP models, and that there is no single best FS method, as their performance varied according to datasets and the choice of the prediction model. However, we recommend the use of FFR methods as the prediction models based on FFR are more stable in terms of predictive performance.

Download Full-text

A Frequency-Based Approach for the Detection and Classification of Structural Changes Using t-SNE †

Sensors ◽

10.3390/s19235097 ◽

2019 ◽

Vol 19 (23) ◽

pp. 5097 ◽

Cited By ~ 7

Author(s):

David Agis ◽

Francesc Pozo

Keyword(s):

Structural Changes ◽

Dimensional Space ◽

Principal Component ◽

Majority Voting ◽

Data Set ◽

Centroid Distance ◽

Low Dimensional ◽

Reduced Data

This work presents a structural health monitoring (SHM) approach for the detection and classification of structural changes. The proposed strategy is based on t-distributed stochastic neighbor embedding (t-SNE), a nonlinear procedure that is able to represent the local structure of high-dimensional data in a low-dimensional space. The steps of the detection and classification procedure are: (i) the data collected are scaled using mean-centered group scaling (MCGS); (ii) then principal component analysis (PCA) is applied to reduce the dimensionality of the data set; (iii) t-SNE is applied to represent the scaled and reduced data as points in a plane defining as many clusters as different structural states; and (iv) the current structure to be diagnosed will be associated with a cluster or structural state based on three strategies: (a) the smallest point-centroid distance; (b) majority voting; and (c) the sum of the inverse distances. The combination of PCA and t-SNE improves the quality of the clusters related to the structural states. The method is evaluated using experimental data from an aluminum plate with four piezoelectric transducers (PZTs). Results are illustrated in frequency domain, and they manifest the high classification accuracy and the strong performance of this method.

Download Full-text

A Pivot-Based Routine for Improved Parent-Finding in Hybrid MDS

Information Visualization ◽

10.1057/palgrave.ivs.9500069 ◽

2004 ◽

Vol 3 (2) ◽

pp. 109-122 ◽

Cited By ~ 18

Author(s):

Alistair Morrison ◽

Matthew Chalmers

Keyword(s):

Negative Impact ◽

Dimensional Space ◽

High Dimensional ◽

Spring Model ◽

Constant Number ◽

Data Set ◽

Original Algorithm ◽

Bottle Neck ◽

Low Dimensional

The problem of exploring or visualising data of high dimensionality is central to many tools for information visualisation. Through representing a data set in terms of inter-object proximities, multidimensional scaling may be employed to generate a configuration of objects in low-dimensional space in such a way as to preserve high-dimensional relationships. An algorithm is presented here for a heuristic hybrid model for the generation of such configurations. Building on a model introduced in 2002, the algorithm functions by means of sampling, spring model and interpolation phases. The most computationally complex stage of the original algorithm involved the execution of a series of nearest-neighbour searches. In this paper, we describe how the complexity of this phase has been reduced by treating all high-dimensional relationships as a set of discretised distances to a constant number of randomly selected items: pivots. In improving this computational bottle-neck, the algorithmic complexity is reduced from O( N√N) to O( N5/4). As well as documenting this improvement, the paper describes evaluation with a data set of 108,000 13-dimensional items and a set of 23,141 17-dimensional items. Results illustrate that the reduction in complexity is reflected in significantly improved run times and that no negative impact is made upon the quality of layout produced.

Download Full-text

Developing A Code Readability Model To Improve Software Quality

International Journal of Computer Science and Informatics ◽

10.47893/ijcsi.2012.1039 ◽

2012 ◽

pp. 206-210

Author(s):

Venkatesh Podugu

Keyword(s):

Graphical User Interface ◽

Software Quality ◽

Software Maintenance ◽

Software Metrics ◽

Software Defect ◽

Code Changes ◽

Software Code ◽

The Many ◽

Code Quality

Software maintenance is one of the main phase in software evaluation. This paper presents the relation between software metrics and maintainability. This paper explains about the concept of Software code readability and its relation to software quality. The quality of code is very essential for the future and for the reuse purpose. Here generated a code readability model to calculate the readability of the code by selecting the snippets and these snippets are to be given to the expert to rate them. Collecting the features of code and combing the judgments generated the readability model. This paper focus on providing the graphical user interface (GUI),to the code readability model to improve the understanding of software code readability. By providing the readability of code to the many open source projects, automatically informing the existed code quality to improve the quality of code. It show that this readability model developed is correlates strongly with three measures of software quality: code changes in software, defect log messages and automated defect reports. It measures correlations over many releases of selected projects.

Download Full-text

A Generative-Discriminative Framework that Integrates Imaging, Genetic, and Diagnosis into Coupled Low Dimensional Space

NeuroImage ◽

10.1016/j.neuroimage.2021.118200 ◽

2021 ◽

pp. 118200

Author(s):

Sayan Ghosal ◽

Qiang Chen ◽

Giulio Pergola ◽

Aaron L. Goldman ◽

William Ulrich ◽

...

Keyword(s):

Dimensional Space ◽

Low Dimensional

Download Full-text

Hybrid features prediction model of movie quality using Multi-machine learning techniques for effective business resource planning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-201844 ◽

2021 ◽

Vol 40 (5) ◽

pp. 9361-9382 ◽

Cited By ~ 1

Author(s):

Naeem Iqbal ◽

Rashid Ahmad ◽

Faisal Jamil ◽

Do-Hyeun Kim

Keyword(s):

Machine Learning ◽

Social Media ◽

Resource Planning ◽

Experimental Results ◽

Quality Prediction ◽

Classification Models ◽

Hybrid Features ◽

Social Media Data ◽

Media Data

Quality prediction plays an essential role in the business outcome of the product. Due to the business interest of the concept, it has extensively been studied in the last few years. Advancement in machine learning (ML) techniques and with the advent of robust and sophisticated ML algorithms, it is required to analyze the factors influencing the success of the movies. This paper presents a hybrid features prediction model based on pre-released and social media data features using multiple ML techniques to predict the quality of the pre-released movies for effective business resource planning. This study aims to integrate pre-released and social media data features to form a hybrid features-based movie quality prediction (MQP) model. The proposed model comprises of two different experimental models; (i) predict movies quality using the original set of features and (ii) develop a subset of features based on principle component analysis technique to predict movies success class. This work employ and implement different ML-based classification models, such as Decision Tree (DT), Support Vector Machines with the linear and quadratic kernel (L-SVM and Q-SVM), Logistic Regression (LR), Bagged Tree (BT) and Boosted Tree (BOT), to predict the quality of the movies. Different performance measures are utilized to evaluate the performance of the proposed ML-based classification models, such as Accuracy (AC), Precision (PR), Recall (RE), and F-Measure (FM). The experimental results reveal that BT and BOT classifiers performed accurately and produced high accuracy compared to other classifiers, such as DT, LR, LSVM, and Q-SVM. The BT and BOT classifiers achieved an accuracy of 90.1% and 89.7%, which shows an efficiency of the proposed MQP model compared to other state-of-art- techniques. The proposed work is also compared with existing prediction models, and experimental results indicate that the proposed MQP model performed slightly better compared to other models. The experimental results will help the movies industry to formulate business resources effectively, such as investment, number of screens, and release date planning, etc.

Download Full-text

Intelligent radar software defect classification approach based on the latent Dirichlet allocation topic model

EURASIP Journal on Advances in Signal Processing ◽

10.1186/s13634-021-00761-3 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Xi Liu ◽

Yongfeng Yin ◽

Haifeng Li ◽

Jiabin Chen ◽

Chang Liu ◽

...

Keyword(s):

Latent Dirichlet Allocation ◽

Topic Model ◽

Recall Rate ◽

Defect Classification ◽

Software Defects ◽

Classification Approach ◽

Software Defect ◽

Model Combining ◽

Dirichlet Allocation

AbstractExisting software intelligent defect classification approaches do not consider radar characters and prior statistics information. Thus, when applying these appaoraches into radar software testing and validation, the precision rate and recall rate of defect classification are poor and have effect on the reuse effectiveness of software defects. To solve this problem, a new intelligent defect classification approach based on the latent Dirichlet allocation (LDA) topic model is proposed for radar software in this paper. The proposed approach includes the defect text segmentation algorithm based on the dictionary of radar domain, the modified LDA model combining radar software requirement, and the top acquisition and classification approach of radar software defect based on the modified LDA model. The proposed approach is applied on the typical radar software defects to validate the effectiveness and applicability. The application results illustrate that the prediction precison rate and recall rate of the poposed approach are improved up to 15 ~ 20% compared with the other defect classification approaches. Thus, the proposed approach can be applied in the segmentation and classification of radar software defects effectively to improve the identifying adequacy of the defects in radar software.

Download Full-text

Artifacts in Simultaneous hdEEG/fMRI Imaging: A Nonlinear Dimensionality Reduction Approach

Sensors ◽

10.3390/s19204454 ◽

2019 ◽

Vol 19 (20) ◽

pp. 4454 ◽

Cited By ~ 2

Author(s):

Marek Piorecky ◽

Vlastimil Koudelka ◽

Jan Strobl ◽

Martin Brunovsky ◽

Vladimir Krajca

Keyword(s):

Data Structure ◽

Spatial Clustering ◽

Dimensional Space ◽

Head Movements ◽

Nonlinear Dimensionality Reduction ◽

Space Resolution ◽

Additional Information ◽

Nonlinear Dimension ◽

The Time Domain ◽

Low Dimensional

Simultaneous recordings of electroencephalogram (EEG) and functional magnetic resonance imaging (fMRI) are at the forefront of technologies of interest to physicians and scientists because they combine the benefits of both modalities—better time resolution (hdEEG) and space resolution (fMRI). However, EEG measurements in the scanner contain an electromagnetic field that is induced in leads as a result of gradient switching slight head movements and vibrations, and it is corrupted by changes in the measured potential because of the Hall phenomenon. The aim of this study is to design and test a methodology for inspecting hidden EEG structures with respect to artifacts. We propose a top-down strategy to obtain additional information that is not visible in a single recording. The time-domain independent component analysis algorithm was employed to obtain independent components and spatial weights. A nonlinear dimension reduction technique t-distributed stochastic neighbor embedding was used to create low-dimensional space, which was then partitioned using the density-based spatial clustering of applications with noise (DBSCAN). The relationships between the found data structure and the used criteria were investigated. As a result, we were able to extract information from the data structure regarding electrooculographic, electrocardiographic, electromyographic and gradient artifacts. This new methodology could facilitate the identification of artifacts and their residues from simultaneous EEG in fMRI.

Download Full-text

Development of a three-dimensional color rendition space for tunable solid-state light sources

Color and Imaging Conference ◽

10.2352/issn.2169-2629.2021.29.136 ◽

2021 ◽

Vol 2021 (29) ◽

pp. 136-140

Author(s):

Dorukalp Durmus

Keyword(s):

Solid State ◽

Dimensional Space ◽

Three Dimensional ◽

Color Discrimination ◽

Continuous Measure ◽

Light Sources ◽

Vast Number ◽

Lighting Systems ◽

Single Number

The quality of building electric lighting systems can be assessed using color rendition metrics. However, color rendition metrics are limited in quantifying tunable solid-state light sources, since tunable lighting systems can generate a vast number of different white light spectra, providing flexibility in terms of color quality and energy efficiency. Previous research suggests that color rendition is multi-dimensional in nature, and it cannot be simplified to a single number. Color shifts under a test light source in comparison to a reference illuminant, changes in color gamut, and color discrimination are important dimensions of the quality of electric light sources, which are not captured by a single-numbered metric. To address the challenges in color rendition characterization of modern solid-state light sources, the development of a multi-dimensional color rendition space is proposed. The proposed continuous measure can quantify the change in color rendition ability of tunable solid-state light devices with caveats. Future work, discretization of the continuous color rendition space, will be carried out to address the shortcomings of a continuous three-dimensional space.

Download Full-text