K-Means Clustering Algorithms to Compute Software Effort Estimation

2016 ◽  
Vol 13 (10) ◽  
pp. 7093-7098 ◽  
Author(s):  
Shivakumar Nagarajan ◽  
Balaji Narayanan

Software development effort estimation is the way of predicting the effort to improve software economics. Accurate estimation of effort is the most tedious tasks in software projects. However, several methods are used to estimate the software development effort accurately. Imprecise estimation can leads to project failure due to uncertain data. In this paper, a hybrid model based on combination of Particle Swarm Optimization (PSO), K-means clustering algorithms, neural network and ABE method is proposed. The proposed method can be useful to predict better clustering and more accurate estimation and hence, there are difficulties in clustering and outliers in the software projects. The obtained results showed the better clustering result which provides the estimation result accurately. Then, neural network and Analogy methods are used which enhance the accuracy significantly.

Author(s):  
FATIMA AZZAHRA AMAZAL ◽  
ALI IDRI ◽  
ALAIN ABRAN

Software effort estimation is one of the most important tasks in software project management. Of several techniques suggested for estimating software development effort, the analogy-based reasoning, or Case-Based Reasoning (CBR), approaches stand out as promising techniques. In this paper, the benefits of using linguistic rather than numerical values in the analogy process for software effort estimation are investigated. The performance, in terms of accuracy and tolerance of imprecision, of two analogy-based software effort estimation models (Classical Analogy and Fuzzy Analogy, which use numerical and linguistic values respectively to describe software projects) is compared. Three research questions related to the performance of these two models are discussed and answered. This study uses the International Software Benchmarking Standards Group (ISBSG) dataset and confirms the usefulness of using linguistic instead of numerical values in analogy-based software effort estimation models.


2022 ◽  
pp. 165-193
Author(s):  
Kamlesh Dutta ◽  
Varun Gupta ◽  
Vachik S. Dave

Prediction of software development is the key task for the effective management of any software industry. The accuracy and reliability of the prediction mechanisms used for the estimation of software development effort is also important. A series of experiments are conducted to gradually progress towards the improved accurate estimation of the software development effort. However, while conducting these experiments, it was found that the size of the training set was not sufficient to train a large and complex artificial neural network (ANN). To overcome the problem of the size of the available training data set, a novel multilayered architecture based on a neural network model is proposed. The accuracy of the proposed multi-layered model is assessed using different criteria, which proves the pre-eminence of the proposed model.


2019 ◽  
Vol 21 (2) ◽  
pp. 88-112
Author(s):  
Kamlesh Dutta ◽  
Varun Gupta ◽  
Vachik S. Dave

Prediction of software development is the key task for the effective management of any software industry. The accuracy and reliability of the prediction mechanisms used for the estimation of software development effort is also important. A series of experiments are conducted to gradually progress towards the improved accurate estimation of the software development effort. However, while conducting these experiments, it was found that the size of the training set was not sufficient to train a large and complex artificial neural network (ANN). To overcome the problem of the size of the available training data set, a novel multilayered architecture based on a neural network model is proposed. The accuracy of the proposed multi-layered model is assessed using different criteria, which proves the pre-eminence of the proposed model.


2015 ◽  
Vol 6 (4) ◽  
pp. 39-68 ◽  
Author(s):  
Maryam Hassani Saadi ◽  
Vahid Khatibi Bardsiri ◽  
Fahimeh Ziaaddini

One of the major activities in effective and efficient production of software projects is the precise estimation of software development effort. Estimation of the effort in primary steps of software development is one of the most important challenges in managing software projects. Some reasons for these challenges such as: discordant software projects, the complexity of the manufacturing process, special role of human and high level of obscure and unusual features of software projects can be noted. Predicting the necessary efforts to develop software using meta-heuristic optimization algorithms has made significant progressions in this field. These algorithms have the potent to be used in estimation of the effort of the software. The necessity to increase estimation precision urged the authors to survey the efficiency of some meta-heuristic optimization algorithms and their effects on the software projects. To do so, in this paper, they investigated the effect of combining various optimization algorithms such as genetic algorithm, particle swarm optimization algorithm and ant colony algorithm on different models such as COCOMO, estimation based on analogy, machine learning methods and standard estimation models. These models have employed various data sets to evaluate the results such as COCOMO, Desharnais, NASA, Kemerer, CF, DPS, ISBSG and Koten & Gary. The results of this survey can be used by researchers as a primary reference.


2013 ◽  
Vol 2013 ◽  
pp. 1-21 ◽  
Author(s):  
Mahmoud O. Elish ◽  
Tarek Helmy ◽  
Muhammad Imtiaz Hussain

Accurate estimation of software development effort is essential for effective management and control of software development projects. Many software effort estimation methods have been proposed in the literature including computational intelligence models. However, none of the existing models proved to be suitable under all circumstances; that is, their performance varies from one dataset to another. The goal of an ensemble model is to manage each of its individual models’ strengths and weaknesses automatically, leading to the best possible decision being taken overall. In this paper, we have developed different homogeneous and heterogeneous ensembles of optimized hybrid computational intelligence models for software development effort estimation. Different linear and nonlinear combiners have been used to combine the base hybrid learners. We have conducted an empirical study to evaluate and compare the performance of these ensembles using five popular datasets. The results confirm that individual models are not reliable as their performance is inconsistent and unstable across different datasets. Although none of the ensemble models was consistently the best, many of them were frequently among the best models for each dataset. The homogeneous ensemble of support vector regression (SVR), with the nonlinear combiner adaptive neurofuzzy inference systems-subtractive clustering (ANFIS-SC), was the best model when considering the average rank of each model across the five datasets.


Author(s):  
Lucas Pereira dos Santos ◽  
Maurício Ferreira

This paper provides a real example of applying COCOMO II as an estimation technique for the required software development effort in a safety-critical software application project following the DO-178C processes. The main goal and contribution of the case study is to support the research on software effort estimation and to provide software practitioners with useful data based on a real project. We applied the method as it is, by correlating the effort multiplier factors with the complexity and objectives introduced by the DO-178C level A application, resulting in an estimated effort. The rationales for each scale factor and effort multiplier selection were also described in detail. By comparing the estimated values with the actual required data, we found a magnitude of relative error (MRE) of 40% and provided alternatives for future work in order to increase the effort estimation accuracy in safety-critical software projects.


2011 ◽  
Vol 7 (3) ◽  
pp. 41-53 ◽  
Author(s):  
Jeremiah D. Deng ◽  
Martin Purvis ◽  
Maryam Purvis

Software development effort estimation is important for quality management in the software development industry, yet its automation still remains a challenging issue. Applying machine learning algorithms alone often cannot achieve satisfactory results. This paper presents an integrated data mining framework that incorporates domain knowledge into a series of data analysis and modeling processes, including visualization, feature selection, and model validation. An empirical study on the software effort estimation problem using a benchmark dataset shows the necessity and effectiveness of the proposed approach.


Sign in / Sign up

Export Citation Format

Share Document