Understanding Project Data

Author(s):  
Emilia Mendes

The objective of this chapter is to provide an introduction to statistical techniques and concepts that are frequently used when dealing with data for effort estimation. The concepts presented here are in no way exhaustive since statistics comprises a very large body of knowledge where entire books are devoted to specific topics. The parts that are the focus of this chapter are those that are necessary to use when building effort estimation models, and also when comparing different effort estimation techniques.

This book is the product of a two-year research programme entitled Restarting European Long-Term Investment Finance (RELTIF), organized by Assonime and the Centre for Economic Policy Research (CEPR) in London. The programme brought together leading researchers from across the world to consider the causes of the persistently low level of investment in Europe, to examine the extent to which the financial system was a contributory factor and to identify possible policy remedies for it. It considered the relation of finance to corporate sector investment, the lending behaviour of banks, the provision of equity financing, the role of public sector institutions, regulation, and taxation. The chapters in this volume provide one of the most comprehensive and thorough analyses of any financial system that has been undertaken to date. They reflect a large body of research using new and existing data sets, employing advanced empirical tools, and exploiting the unique insights provided by the tumultuous events of the financial and sovereign debt crises. Together they comprise an exceptional body of knowledge to advance academic thinking and guide policy formulation in the future.


2015 ◽  
Vol 47 ◽  
pp. 1-14 ◽  
Author(s):  
Pablo Pytel ◽  
Alejandro Hossian ◽  
Paola Britos ◽  
Ramón García-Martínez

2021 ◽  
Vol 14 (7) ◽  
pp. 1159-1165
Author(s):  
Immanuel Trummer

A large body of knowledge on database tuning is available in the form of natural language text. We propose to leverage natural language processing (NLP) to make that knowledge accessible to automated tuning tools. We describe multiple avenues to exploit NLP for database tuning, and outline associated challenges and opportunities. As a proof of concept, we describe a simple prototype system that exploits recent NLP advances to mine tuning hints from Web documents. We show that mined tuning hints improve performance of MySQL and Postgres on TPC-H, compared to the default configuration.


Author(s):  
Scott Henderson

The epidemiology of psychiatric disorders has shown the extent to which these are present in all human populations. It has also provided a large body of knowledge about aetiology. Now that biological including genetic information can be added to data on environmental exposures, opportunities for further advancement carry much promise.


2015 ◽  
Vol 6 (4) ◽  
pp. 39-68 ◽  
Author(s):  
Maryam Hassani Saadi ◽  
Vahid Khatibi Bardsiri ◽  
Fahimeh Ziaaddini

One of the major activities in effective and efficient production of software projects is the precise estimation of software development effort. Estimation of the effort in primary steps of software development is one of the most important challenges in managing software projects. Some reasons for these challenges such as: discordant software projects, the complexity of the manufacturing process, special role of human and high level of obscure and unusual features of software projects can be noted. Predicting the necessary efforts to develop software using meta-heuristic optimization algorithms has made significant progressions in this field. These algorithms have the potent to be used in estimation of the effort of the software. The necessity to increase estimation precision urged the authors to survey the efficiency of some meta-heuristic optimization algorithms and their effects on the software projects. To do so, in this paper, they investigated the effect of combining various optimization algorithms such as genetic algorithm, particle swarm optimization algorithm and ant colony algorithm on different models such as COCOMO, estimation based on analogy, machine learning methods and standard estimation models. These models have employed various data sets to evaluate the results such as COCOMO, Desharnais, NASA, Kemerer, CF, DPS, ISBSG and Koten & Gary. The results of this survey can be used by researchers as a primary reference.


Author(s):  
Emilia Mendes

Although numerous studies on Web effort estimation have been carried out to date, there is no consensus on what constitutes the best effort estimation technique to be used by Web companies. It seems that not only the effort estimation technique itself can influence the accuracy of predictions, but also the characteristics of the data set used (e.g., skewness, collinearity; Shepperd & Kadoda, 2001). Therefore, it is often necessary to compare different effort estimation techniques, looking for those that provide the best estimation accuracy for the data set being employed. With this in mind, the use of graphical aids such as boxplots is not always enough to assess the existence of significant differences between effort prediction models. The same applies to measures of prediction accuracy such as the mean magnitude of relative error (MMRE), median magnitude of relative error (MdMRE), and prediction at level l (Pred[25]). Other techniques, which correspond to the group of statistical significance tests, need to be employed to check if the different residuals obtained for each of the effort estimation techniques compared come from the same population. This chapter details how to use such techniques and how their results should be interpreted.


Sign in / Sign up

Export Citation Format

Share Document