Software Project Estimation with Machine Learning

Software Project Estimation is a challenging and important activity in developing software projects. Software Project Estimation includes Software Time Estimation, Software Resource Estimation, Software Cost Estimation, and Software Effort Estimation. Software Effort Estimation focuses on predicting the number of hours of work (effort in terms of person-hours or person-months) required to develop or maintain a software application. It is difficult to forecast effort during the initial stages of software development. Various machine learning and deep learning models have been developed to predict the effort estimation. In this paper, single model approaches and ensemble approaches were considered for estimation. Ensemble techniques are the combination of several single models. Ensemble techniques considered for estimation were averaging, weighted averaging, bagging, boosting, and stacking. Various stacking models considered and evaluated were stacking using a generalized linear model, stacking using decision tree, stacking using a support vector machine, and stacking using random forest. Datasets considered for estimation were Albrecht, China, Desharnais, Kemerer, Kitchenham, Maxwell, and Cocomo81. Evaluation measures used were mean absolute error, root mean squared error, and R-squared. The results proved that the proposed stacking using random forest provides the best results compared with single model approaches using the machine or deep learning algorithms and other ensemble techniques.

Download Full-text

Software Project Estimation Using Improved Use Case Point

2018 IEEE 16th International Conference on Software Engineering Research, Management and Applications (SERA) ◽

10.1109/sera.2018.8477225 ◽

2018 ◽

Cited By ~ 3

Author(s):

Sima Bagheri ◽

Alireza Shameli-Sendi

Keyword(s):

Software Project ◽

Use Case ◽

Project Estimation

Download Full-text

Machine Learning Techniques to Predict Software Defect

Encyclopedia of Business Analytics and Optimization ◽

10.4018/978-1-4666-5202-6.ch129 ◽

2014 ◽

pp. 1422-1434 ◽

Cited By ~ 1

Author(s):

Ramakanta Mohanty ◽

Vadlamani Ravi

Keyword(s):

Machine Learning ◽

Feature Subset Selection ◽

Machine Learning Techniques ◽

Group Method ◽

Software Project ◽

Feature Subset ◽

Software Defects ◽

Software Defect ◽

Learning Techniques ◽

Sensitivity Specificity

The past 10 years have seen the prediction of software defects proposed by many researchers using various metrics based on measurable aspects of source code entities (e.g. methods, classes, files or modules) and the social structure of software project in an effort to predict the software defects. However, these metrics could not predict very high accuracies in terms of sensitivity, specificity and accuracy. In this chapter, we propose the use of machine learning techniques to predict software defects. The effectiveness of all these techniques is demonstrated on ten datasets taken from literature. Based on an experiment, it is observed that PNN outperformed all other techniques in terms of accuracy and sensitivity in all the software defects datasets followed by CART and Group Method of data handling. We also performed feature selection by t-statistics based approach for selecting feature subsets across different folds for a given technique and followed by the feature subset selection. By taking the most important variables, we invoked the classifiers again and observed that PNN outperformed other classifiers in terms of sensitivity and accuracy. Moreover, the set of ‘if- then rules yielded by J48 and CART can be used as an expert system for prediction of software defects.

Download Full-text

Early Software Project Estimation the Six Sigma Way

Lecture Notes in Business Information Processing - Agile Methods. Large-Scale Development, Refactoring, Testing, and Estimation ◽

10.1007/978-3-319-14358-3_16 ◽

2014 ◽

pp. 193-208 ◽

Cited By ~ 1

Author(s):

Thomas Michael Fehlmann ◽

Eberhard Kranich

Keyword(s):

Six Sigma ◽

Software Project ◽

Project Estimation

Download Full-text

Influence of end user development on software project estimation

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i3.13010 ◽

2018 ◽

Vol 7 (3) ◽

pp. 1812

Author(s):

Archana Srivastava ◽

Dr. K. Singh ◽

Dr Syed Qamar Abbas

Keyword(s):

User Satisfaction ◽

Point Method ◽

Development Effort ◽

Software Project ◽

Use Case ◽

Effort Estimation ◽

End User ◽

Software Development Effort ◽

Project Estimation

Use Case Point Method (UCP) is used to estimate software development effort. UCP uses a project’s use cases to produce a reasonable estimate of a project’s complexity and required man hours. Advance Use Case Point Method (AUCP) is an extension of UCP. AUCP extends UCP by adding the additional effort required in incorporating end user development (EUD) features in the software for overall project effort estimation. Today user needs are diverse, complex, and frequently changing hence need of EUD is also increasing. EUD features if incorporated in the software increases end user satisfaction exponentially but incorporating EUD features increases design time complexity and increases the effort significantly based on the end users requirements. This paper provides a case study to demonstrate the comparative analysis of UCP and AUCP using paired t-test. It also observes that there can be on an average 20% increase in overall effort of development on adding EUD features.

Download Full-text

Comparison of Machine Learning Algorithms for Software Project Time Prediction

International Journal of Multimedia and Ubiquitous Engineering ◽

10.14257/ijmue.2015.10.9.01 ◽

2015 ◽

Vol 10 (9) ◽

pp. 1-8 ◽

Cited By ~ 3

Author(s):

Wan Jiang Han ◽

Li Xin Jiang ◽

Tian Bo Lu ◽

Xiao Yan Zhang

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Software Project ◽

Time Prediction

Download Full-text

An effective approach for software project effort and duration estimation with machine learning algorithms

Journal of Systems and Software ◽

10.1016/j.jss.2017.11.066 ◽

2018 ◽

Vol 137 ◽

pp. 184-196 ◽

Cited By ~ 31

Author(s):

Przemyslaw Pospieszny ◽

Beata Czarnacka-Chrobot ◽

Andrzej Kobylinski

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Software Project ◽

Project Effort ◽

Duration Estimation

Download Full-text

A framework for software project estimation based on cosmic, dsm and rework characterization

Proceedings of the 13th international workshop on Software architectures and mobility - EA '08 ◽

10.1145/1370837.1370842 ◽

2008 ◽

Cited By ~ 3

Author(s):

Sharareh Afsharian ◽

Marco Giacomobono ◽

Paola Inverardi

Keyword(s):

Software Project ◽

Project Estimation

Download Full-text

Improving Credibility of Machine Learner Models in Software Engineering

Advances in Machine Learning Applications in Software Engineering ◽

10.4018/978-1-59140-941-1.ch003 ◽

2011 ◽

pp. 52-72 ◽

Cited By ~ 5

Author(s):

Gary D. Boetticher

Keyword(s):

Machine Learning ◽

Software Engineering ◽

Nearest Neighbor ◽

Project Managers ◽

Empirical Software Engineering ◽

Software Project ◽

Series Of Experiments ◽

Test Sets ◽

Fold Cross Validation ◽

Machine Learning Models

Given a choice, software project managers frequently prefer traditional methods of making decisions rather than relying on empirical software engineering (empirical/machine learning- based models). One reason for this choice is the perceived lack of credibility associated with these models. To promote better empirical software engineering, a series of experiments are conducted on various NASA datasets to demonstrate the importance of assessing the ease/difficulty of a modeling situation. Each dataset is divided into three groups, a training set, and “nice/nasty” neighbor test sets. Using a nearest neighbor approach, “nice neighbors” align closest to same class training instances. “Nasty neighbors” align to the opposite class training instances. The “nice”, “nasty” experiments average 94% and 20%accuracy, respectively. Another set of experiments show how a ten-fold cross-validation is not sufficient in characterizing a dataset. Finally, a set of metric equations is proposed for improving the credibility assessment of empirical/machine learning models.

Download Full-text

Software Parallel Processing in Pervasive Computing

Strategic Pervasive Computing Applications ◽

10.4018/978-1-61520-753-4.ch003 ◽

2011 ◽

pp. 56-66

Author(s):

Jitesh Dundas

Keyword(s):

Project Management ◽

Parallel Processing ◽

Pervasive Computing ◽

Propagation Speed ◽

Periodic Wave ◽

Software Project ◽

Software System ◽

Periodic Waves ◽

Worst Case ◽

Project Estimation

This chapter proposes the application of periodic wave concepts in management of software Parallel processing projects or processes. This chapter lays special emphasis on Runaway project, which create a lot of problems in Project Management for the stakeholders. This chapter proposes a new and dynamic way to control the software project estimation activity of Runaway project/processes, thereby reducing their future occurrences. This chapter also explains how the equations of unidirectional periodic Waves can be applied in software Parallel processing to measure quality and project execution in a dynamic way at any point in time. The concepts proposed here are dynamic unlike PERT/CPM and other metrics, which fail in worst-case scenarios. The Propagation Speed ‘C’ at any point in time of a stage or part of a software system executing in Parallel can be given by: C = H/ k (1). Where, H = length of the Wave (i.e. highest point), k = time taken in completing the stage.

Download Full-text