Skip to main content
Log in

Insightful analogy-based software development effort estimation through selective classification and localization

  • Original Paper
  • Published:
Innovations in Systems and Software Engineering Aims and scope Submit manuscript

Abstract

Accurate development effort estimation is a challenging issue in the management of software projects because it can considerably affect the planning and scheduling of a software project. Over the past few years, many algorithmic and non-algorithmic methods have been proposed to estimate the development effort in the early stages of project. Due to simplicity and estimation capability, analogy-based estimation (ABE) method has been widely accepted by researchers in this area. In spite of the fact that ABE is an efficient estimation method, it suffers from the non-normality and heterogeneous nature of software project datasets. Although prior studies have strived to remedy this issue by weighting, soft computing, and clustering techniques, the estimate accuracy is still not convincing and attempts are ongoing to reach more reliable estimates. The problem is that prior ABE-based studies have not considered the nature of software projects in the estimation process. This paper aims to show the effect of selective project classification and estimation process localization on the performance of ABE. An exhaustive investigation is conducted based on different development types, organization types, and development platforms as three underlying attributes in software projects. An evaluation framework is designed to reveal the ABE performance when it is combined with the proposed classification. A real dataset that includes 448 software projects is utilized for the evaluation purposes. The promising results showed that the estimate accuracy is significantly improved and the estimation process is considerably expedited if the nature of software projects is considered in the ABE method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Albrecht AJ, Gaffney JA (1983) Software function, source lines of codes, and development effort prediction: a software science validation. IEEE Trans Softw Eng SE 9(6):639–648

    Article  Google Scholar 

  2. Angelis L, Stamelos I (2000) A simulation tool for efficient analogy based cost estimation. Empir Softw Eng 5:35–68

    Article  Google Scholar 

  3. Aroba J, Cuadrado-Gallego JJ, Sicilia M-A, Ramos I, García-Barriocanal E (2008) Segmented software cost estimation models based on fuzzy clustering. J Syst Softw 81:1944–1950

    Article  Google Scholar 

  4. Azzeh M (2012) A replicated assessment and comparison of adaptation techniques for analogy-based effort estimation. Empir Softw Eng 17:90–127

    Article  Google Scholar 

  5. Azzeh M, Neagu D, Cowling P (2010) Fuzzy grey relational analysis for software effort estimation. Empir Softw Eng 15:60–90

    Article  Google Scholar 

  6. Bajwa SS (2009) Investigating the nature of relationship between software size and development effort. Blekinge Institute of Technology

  7. Bettenburg N, Nagappan M, Hassan AE (2012) Think locally, act globally: improving defect and effort prediction models. In: 9th Working Conference on Mining Software Repositories. Zurich, Switzerland, pp 60–69

  8. Chiu NH, Huang SJ (2007) The adjusted analogy-based software effort estimation based on similarity distances. J Syst Softw 80:628–640

    Article  Google Scholar 

  9. Conte SD, Dunsmore HE, Shen VY (1986) Software engineering metrics and models. Benjamin-Cummings Publishing Co., Inc, Menlo Park

    Google Scholar 

  10. Cuadrado-Gallego JJ, Sicilia MA, Garre M, Rodrguez D (2006) An empirical study of process-related attributes in segmented software cost-estimation relationships. J Syst Softw 79:353–361

    Article  Google Scholar 

  11. Cuadrado JJ, Gallego M, Sicilia A (2007) An algorithm for the generation of segmented parametric software estimation models and its empirical evaluation. Comput Inform 26:1–15

    MATH  Google Scholar 

  12. Dolado J (2001) On the problem of the software cost function. Inf Softw Technol 43:61–72

    Article  Google Scholar 

  13. Hsu CJ, Huang CY (2011) Comparison of weighted grey relational analysis for software effort estimation. Softw Qual J 19:165–200

    Article  Google Scholar 

  14. Huang SJ, Chiu NH (2006) Optimization of analogy weights by genetic algorithm for software effort estimation. Inf Softw Technol 48:1034–1045

    Article  Google Scholar 

  15. ISBSG (2011) International Software Benchmarking standard Group. In

  16. Jianfeng W, Shixian L, Linyan T (2009) Improve analogy-based software effort estimation using principal components analysis and correlation weighting. In: International Conference on Software Engineering. Sun Yat-sen Univ., Guangzhou, China, pp 179–186

  17. Jingzhou L, Guenther R (2008) Analysis of attribute weighting heuristics for analogy-based software effort estimation method. Empir Softw Eng 13:63–96

    Article  Google Scholar 

  18. Kadoda G, Cartwright M, Chen L, Shepperd M (2000) Experiences using case-based reasoning to predict software project effort. In: International Conference on Empirical Assessment and Evaluation in Software Engineering. KEELE University

  19. Keung J, Kocaguneli E, Menzies T (2013) Finding conclusion stability for selecting the best effort predictor in software effort estimation. Autom Softw Eng 20:543–567

  20. Keung JW, Kitchenham BA, Jeffery DR (2008) Analogy-X: providing statistical inference to analogy-based software cost estimation. IEEE Trans Softw Eng 34:471–484

    Article  Google Scholar 

  21. Bardsiri VK, Jawawi DNA, Hashim SZM, Khatibi E (2012a) Increasing the accuracy of software development effort estimation using projects clustering. IET Softw 6:461–473

    Article  Google Scholar 

  22. Bardsiri Vk, Jawawi DNA, Hashim SZM, Khatibi E (2013) A PSO-based model to increase the accuracy of software development effort estimation. Softw Qual J 21:501–526. doi:10.1007/s11219-012-9183-x

  23. Kocaguneli E, Menzies T, Bener A, Keung JW (2012) Exploiting the essential assumptions of analogy-based effort estimation. IEEE Trans Softw Eng 38:425–438

    Article  Google Scholar 

  24. Lafferty MT (2010) Software effort estimation accuracy: a comparative study of estimaion based on software sizing and development methods. Capella University, Minneapolis

    Google Scholar 

  25. Li J, Ruhe G (2008) Analysis of attribute weighting heuristics for analogy-based software effort estimation method AQUA+. Empir Softw Eng 13:63–96

    Article  Google Scholar 

  26. Li JZ, Ruhe G (2008) Software effort estimation by analogy using attribute selection based on rough set analysis. Int J Softw Eng Knowl Eng 18:1–23

    Article  Google Scholar 

  27. Li YF, Xie M, Goh TN (2007) A study of genetic algorithm for project selection for analogy based software cost estimation. In: International Conference on Industrial Engineering and Engineering Management. Singapore, pp 1256–1260

  28. Li YF, Xie M, Goh TN (2009a) A study of the non-linear adjustment for analogy based software cost estimation. Empir Softw Eng 14:603–643

    Article  Google Scholar 

  29. Li YF, Xie M, Goh TN (2009b) A study of project selection and feature weighting for analogy based software cost estimation. J Syst Softw 82:241–252

    Article  Google Scholar 

  30. Lin J-C, Tzeng H-Y, Lin Y-T (2012) Automatically estimating software effort and cost using computing intelligence technique. Comput Sci Inf Technol 2:381–392

    Google Scholar 

  31. Mendes E, Watson I, Triggs C, Mosley N, Counsell S (2003) A comparative study of cost estimation models for web hypermedia applications. Empir Softw Eng 8:163–196

    Article  Google Scholar 

  32. Menzies T, Butcher A, Marcus A, Zimmermann T, Cok D (2011) Local vs. global models for effort estimation and defect prediction. In: 26th IEEE/ACM International Conference on Automated Software Engineering. IEEE Computer Society, Lawrence, KS, pp 343–351

  33. Milios D, Stamelos I, Chatzibagias C (2011) Global optimization of analogy-based software cost estimation with genetic algorithms artificial intelligence applications and innovations. In: Iliadis L, Maglogiannis I, Papadopoulos H (eds), vol 364. Springer Boston, pp 350–359

  34. Minku LL, Yao X (2012) Ensembles and locality, insight on improving software effort estimation. Information and Software Technology

  35. Mittas N, Angelis L (2010) Lseba: least squares regression and estimation by analogy. In: A Semi-Parametric Model for Software Cost Estimation. Empirical Software Engineering, vol 15, pp 523–555

  36. Posnett D, Filkov V, Devanbu P (2011) Ecological inference in empirical software engineering. In: 26th IEEE/ACM International Conference on Automated Software Engineering. IEEE Computer Society, pp 362–371

  37. Rodrguez D, Cuadrado JJ, Sicilia MA, Ruiz R (2006) Segmentation of Software Engineering Datasets Using the M5 Algorithm. In: 6th international conference on Computational Science—vol Part IV. Springer, Reading, UK, pp 789–796

  38. Seo Y-S, Bae D-H (2013) On the value of outlier elimination on software effort estimation research. Empir Softw Eng 18:659–698

  39. Shepperd M, Schofield C (1997) Estimating software project effort using analogies. IEEE Trans Softw Eng 23:736–743

    Article  Google Scholar 

  40. Song Q, Shepperd M (2011) Predicting software project effort: a grey relational analysis based method. Expert Syst Appl 38:7302–7316

  41. Walkerden F, Jeffery R (1997) Software cost estimation: a review of models, process, and practice. In: Marvin VZ (ed) Advances in Computers, vol 44. Elsevier, pp 59–125

  42. Walkerden F, Jeffery R (1999) An empirical study of analogy-based software effort estimation. Empir Softw Eng 4:135–158

    Article  Google Scholar 

  43. Wu D, Li J, Liang Y (2013) Linear combination of multiple case-based reasoning with optimized weight for software effort estimation. J Supercomput 64:898–918

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vahid Khatibi Bardsiri.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khatibi Bardsiri, V., Khatibi, E. Insightful analogy-based software development effort estimation through selective classification and localization. Innovations Syst Softw Eng 11, 25–38 (2015). https://doi.org/10.1007/s11334-014-0242-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11334-014-0242-2

Keywords

Navigation