Searching Repositories of Web Application Models

  • Alessandro Bozzon
  • Marco Brambilla
  • Piero Fraternali
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6189)


Project repositories are a central asset in software development, as they preserve the technical knowledge gathered in past development activities. However, locating relevant information in a vast project repository is problematic, because it requires manually tagging projects with accurate metadata, an activity which is time consuming and prone to errors and omissions. This paper investigates the use of classical Information Retrieval techniques for easing the discovery of useful information from past projects. Differently from approaches based on textual search over the source code of applications or on querying structured metadata, we propose to index and search the models of applications, which are available in companies applying Model-Driven Engineering practices. We contrast alternative index structures and result presentations, and evaluate a prototype implementation on real-world experimental data.


Business Process Index Structure Query Language Index Size Content Processing 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Acerbis, R., Bongio, A., Brambilla, M., Butti, S.: Webratio 5: An eclipse-based case tool for engineering web applications. In: Baresi, L., Fraternali, P., Houben, G.-J. (eds.) ICWE 2007. LNCS, vol. 4607, pp. 501–505. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  2. 2.
    Antoniol, G., Canfora, G., de Lucia, A., Casazza, G.: Information retrieval models for recovering traceability links between code and documentation. In: IEEE International Conference on Software Maintenance, p. 40 (2000)Google Scholar
  3. 3.
    Awad, A., Polyvyanyy, A., Weske, M.: Semantic querying of business process models. In: Enterprise Distributed Object Computing Conference (EDOC), pp. 85–94 (2008)Google Scholar
  4. 4.
    Bajracharya, S., Ossher, J., Lopes, C.: Sourcerer: An internet-scale software repository. In: ICSE Workshop on Search-Driven Development-Users, Infrastructure, Tools and Evaluation. SUITE ’09, pp. 1–4 (May 2009)Google Scholar
  5. 5.
    Beeri, C., Eyal, A., Kamenkovich, S., Milo, T.: Querying business processes. In: VLDB, pp. 343–354. ACM, New York (2006)Google Scholar
  6. 6.
    Belhajjame, K., Brambilla, M.: Ontology-based description and discovery of business processes. In: Interval Mathematics. LNBIP, vol. 29. Springer, Heidelberg (2009)Google Scholar
  7. 7.
    Ben Khalifa, H., Khayati, O., Ghezala, H.: A behavioral and structural components retrieval technique for software reuse. In: Advanced Software Engineering and Its Applications. ASEA 2008, pp. 134–137 (December 2008)Google Scholar
  8. 8.
    Ceri, S., Fraternali, P., Bongio, A., Brambilla, M., Comai, S., Matera, M.: Designing Data-Intensive Web Applications. Morgan Kaufmann Publishers Inc., San Francisco (2002)Google Scholar
  9. 9.
    Chen, K., Madhavan, J., Halevy, A.: Exploring schema repositories with schemr. In: SIGMOD ’09: Proc. of the 35th SIGMOD Int. Conf. on Management of data, New York, NY, USA, pp. 1095–1098. ACM, New York (2009)CrossRefGoogle Scholar
  10. 10.
    Frakes, W.B., Nejmeh, B.A.: Software reuse through information retrieval. SIGIR Forum 21(1-2), 30–36 (1987)CrossRefGoogle Scholar
  11. 11.
    Gibb, F., McCartan, C., O’Donnell, R., Sweeney, N., Leon, R.: The integration of information retrieval techniques within a software reuse environment. Journal of Information Science 26(4), 211–226 (2000)CrossRefGoogle Scholar
  12. 12.
    Goderis, A., Li, P., Goble, C.A.: Workflow discovery: the problem, a case study from e-science and a graph-based solution. In: ICWS, pp. 312–319. IEEE Computer Society, Los Alamitos (2006)Google Scholar
  13. 13.
    Gomes, P., Pereira, F.C., Paiva, P., Seco, N., Carreiro, P., Ferreira, J.L., BentoI, C.: Using wordnet for case-based retrieval of uml models. AI Communications 17(1), 13–23 (2004)zbMATHMathSciNetGoogle Scholar
  14. 14.
    Holmes, R., Murphy, G.C.: Using structural context to recommend source code examples. In: ICSE ’05: Proceedings of the 27th international conference on Software engineering, pp. 117–125. ACM, New York (2005)Google Scholar
  15. 15.
    Inoue, K., Yokomori, R., Yamamoto, T., Matsushita, M., Kusumoto, S.: Ranking significance of software components based on use relations. IEEE Transactions on Software Engineering 31(3), 213–225 (2005)CrossRefGoogle Scholar
  16. 16.
    Kiefer, C., Bernstein, A., Lee, H.J., Klein, M., Stocker, M.: Semantic process retrieval with iSPARQL. In: Franconi, E., Kifer, M., May, W. (eds.) ESWC 2007. LNCS, vol. 4519, pp. 609–623. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  17. 17.
    Llorens, J., Fuentes, J.M., Morato, J.: Uml retrieval and reuse using xmi. In: IASTED Software Engineering. Acta Press (2004)Google Scholar
  18. 18.
    Lu, R., Sadiq, S.: Managing process variants as an information resource. In: Dustdar, S., Fiadeiro, J.L., Sheth, A.P. (eds.) BPM 2006. LNCS, vol. 4102, pp. 426–431. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  19. 19.
    Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (July 2008)zbMATHGoogle Scholar
  20. 20.
    Markovic, I., Pereira, A.C., Stojanovic, N.: A framework for querying in business process modelling. In: Multikonferenz Wirtschaftsinformatik (February 2008)Google Scholar
  21. 21.
    Moreno, N., Fraternali, P., Vallecillo, A.: Webml modelling in uml. IET Software 1(3), 67–80 (2007)CrossRefGoogle Scholar
  22. 22.
    Platzer, C., Dustdar, S.: A vector space search engine forweb services. In: ECOWS ’05: Proceedings of the Third European Conference on Web Services, Washington, DC, USA, pp. 62–71. IEEE Computer Society, Los Alamitos (2005)Google Scholar
  23. 23.
    Seacord, R.C., Hissam, S.A., Wallnau, K.C.: Agora: A search engine for software components. IEEE Internet Computing 2(6), 62–70 (1998)CrossRefGoogle Scholar
  24. 24.
    Settimi, R., Cleland-Huang, J., Ben Khadra, O., Mody, J., Lukasik, W., DePalma, C.: Supporting software evolution through dynamically retrieving traces to uml artifacts. In: Proceedings of 7th International Workshop on Principles of Software Evolution, pp. 49–54 (2004)Google Scholar
  25. 25.
    Shao, Q., Sun, P., Chen, Y.: Wise: A workflow information search engine. In: IEEE 25th International Conference on Data Engineering. ICDE ’09, pp. 1491–1494 (2009)Google Scholar
  26. 26.
    Zhuge, H.: A process matching approach for flexible workflow process reuse. Information & Software Technology 44(8), 445–450 (2002)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Alessandro Bozzon
    • 1
  • Marco Brambilla
    • 1
  • Piero Fraternali
    • 1
  1. 1.Dipartimento di Elettronica e InformazionePolitecnico di MilanoMilanoItaly

Personalised recommendations