On Deriving Actions for Improving Cost Overrun by Applying Association Rule Mining to Industrial Project Repository

  • Junya Debari
  • Osamu Mizuno
  • Tohru Kikuno
  • Nahomi Kikuchi
  • Masayuki Hirayama
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5007)


For software project management, it is very important to identify risk factors which make project into runaway. In this study, we propose a method to extract improvement action items for a software project by applying association rule mining to the software project repository for a metric of “cost overrun”. We first mine a number of association rules affecting cost overrun. We then group compatible rules, which include several common metrics having different values, from the mined rules and extract improvement action items of project improvement. In order to evaluate the applicability of our method, we applied our method to the project data repository collected from plural companies in Japan. The result of experiment showed that project improvement actions for cost overrun were semi-automatically extracted from the mined association rules. We can confirm feasibility of our method by comparing these actions with the results in the previous research.


association rule mining project improvement actions cost overrun 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Takagi, Y., Mizuno, O., Kikuno, T.: An empirical approach to characterizing risky software projects based on logistic regression analysis. Empirical Software Engineering 10(4), 495–515 (2005)CrossRefGoogle Scholar
  2. 2.
    Amasaki, S., Hamano, Y., Mizuno, O., Kikuno, T.: Characterization of runaway software projects using association rule mining. In: Münch, J., Vierimaa, M. (eds.) PROFES 2006. LNCS, vol. 4034, pp. 402–407. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  3. 3.
    Software Engineering Center, Information-technology Promotion Agency (ed.): The 2006 White Paper on Software Development Projects (in Japanese). Nikkei Business Publications (2006)Google Scholar
  4. 4.
    Masticola, S.P.: A simple estimate of the cost of software project failures and the breakeven effectiveness of project risk management. In: Proceedings of the First International Workshop on The Economics of Software and Computation (ESC 2007), p. 6 (2007)Google Scholar
  5. 5.
    Lederer, A.L., Prasad, J.: Nine management guidelines for better cost estimating. Commun. ACM 35(2), 51–59 (1992)CrossRefGoogle Scholar
  6. 6.
    Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2001)Google Scholar
  7. 7.
    Michail, A.: Data mining library reuse patterns using generalized association rules. In: Proceedings of the 22nd international conference on Software engineering, pp. 167–176 (2000)Google Scholar
  8. 8.
    Zimmermann, T., Weißgerber, P., Diehl, S., Zeller, A.: Mining version histories to guide software change. IEEE Trans. on Software Engineering 31(6), 429–445 (2005)CrossRefGoogle Scholar
  9. 9.
    García, M.N.M., Román, I.R., Penalvo, F.J.G., Bonilla, M.T.: An association rule mining method for estimating the impact of project management policies on software quality, development time and effort. Expert Syst. Appl. 34(1), 522–529 (2008)CrossRefGoogle Scholar
  10. 10.
    Chawla, S., Arunasalam, B., Davis, J.: Mining open source software (oss) data using association rules network. In: Whang, K.-Y., Jeon, J., Shim, K., Srivastava, J. (eds.) PAKDD 2003. LNCS (LNAI), vol. 2637, pp. 461–466. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  11. 11.
    Mitani, Y., Kikuchi, N., Matsumura, T., Ohsugi, N., Monden, A., Higo, Y., Inoue, K., Barker, M., Matsumoto, K.: A proposal for analysis and prediction for software projects using collaborative filtering, in-process measurements and a benchmarks database. In: Proc. of International Conference on Software Process and Product Measurement, pp. 98–107 (2006)Google Scholar
  12. 12.
    Ohsugi, N., Monden, A., Kikuchi, N., Barker, M.D., Tsunoda, M., Kakimoto, T., Matsumoto, K.: Is this cost estimate reliable? – the relationship between homogeneity of analogues and estimation reliability. In: Proc. of First International Symposium on Empirical Software Engineering and Measurement, pp. 384–392 (2007)Google Scholar
  13. 13.
    Furuyama, T., Kikuchi, N., Yasuda, M., Tsuruho, M.: Analysis of the factors that affect the performance of software projects (in Japanese). Trans. of Information Processing Society of Japan 48(8), 2608–2619 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Junya Debari
    • 1
  • Osamu Mizuno
    • 1
  • Tohru Kikuno
    • 1
  • Nahomi Kikuchi
    • 2
  • Masayuki Hirayama
    • 2
  1. 1.Graduate School of Information Science and TechnologyOsaka UniversityOsakaJapan
  2. 2.Software Engineering CenterInformation-technology Promotion AgencyTokyoJapan

Personalised recommendations