Skip to main content

Automating Search Strings for Secondary Studies

  • Conference paper
  • First Online:
Information Technology - New Generations

Abstract

Background: secondary studies (SSs), in the form of systematic literature reviews and systematic mappings, have become a widely used evidence-based methodology to to create a classification scheme and structure research fields, thus giving an overview of what has been done in a given research field.

Problem: often, the conduction of high-quality SSs is hampered by the difficulties that stem from creating a proper “search string”. Creating sound search strings entails an array of skills and domain knowledge. Search strings are ill-defined because of a number of reasons. Two common reasons are (i) insuffient domain knowledge and (ii) time and resource constraints. When ill-defined search strings are used to carry out SSs, a potentially high number of pertinent studies is likely to be left out of the analysis.

Method: to overcome this limitation we propose an approach that applies a search-based algorithm called Hill Climbing to automate this key step in the conduction of SSs: search string generation and calibration.

Results: we conducted an experiment to evaluate our approach in terms of sensibility and precision. The results would seem to suggest that the precision and the sensibility our approach are 25.2% and 96.2%, respectively.

Conclusion: The results were promising given that our approach was able to generate and calibrate suitable search strings to support researchers during the conduction of SSs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Control studies are papers that must be retrieved when search in a given database.

  2. 2.

    see: http://ieeexplore.ieee.org/

  3. 3.

    see: http://www-01.ibm.com/software/analytics/spss/

References

  1. Basili, V., Caldiera, G., & Rombach, H. (1994). The goal question metric paradigm (1st ed.). Wiley.

    Google Scholar 

  2. Blum, C., & Roli, A. (2003). Metaheuristics in combinatorial optimization: Overview and conceptual comparison. ACM Computing Surveys, 35(3), 268–308.

    Article  Google Scholar 

  3. Dieste, O., & Padua, A. (2007). Developing search strategies for detecting relevant experiments for systematic reviews. In ESEM 2007, Madrid (pp. 215–224).

    Google Scholar 

  4. Gay, G. (2010). A baseline method for search-based software engineering. In PROMISE 2010, PROMISE ’10, Timisoara (pp. 2:1–2:11). New York, NY: ACM.

    Google Scholar 

  5. Hannay, J. E., Dybå, T., Arisholm, E., & Sjøberg, D. I. K. (2009). The effectiveness of pair programming: A meta-analysis. Information and Software Technology, 51(7), 1110–1122.

    Google Scholar 

  6. Harman, M., McMinn, P., de Souza, J. T., & Yoo, S. (2012). Search based software engineering: Techniques, taxonomy, tutorial. Empirical software engineering and verification (pp. 1–59). Berlin/Heidelberg: Springer.

    Google Scholar 

  7. Haynes, R. B., Wilczynski, K. A., McKibbon, C. J., & Sinclair, J. C. (1994). Developing optimal search strategies for detecting clinically sound studies in medline. Journal of the American Medical Informatics Association, 1, 447–458.

    Google Scholar 

  8. Kitchenham, B. (2011). Chapter three – What we can learn from systematic reviews. In Making software what really works, and why we believe it (Vol. 1, 1st ed.). Gravenstein Highway North, Sebastopol, CA: O’Reilly Media

    Google Scholar 

  9. Kitchenham, B., Mendes, E., & Travassos, G. H. (2007). A systematic review of cross vs. within-company cost estimation studies. IEEE Transactions on Software Engineering, 33(5), 361–329.

    Google Scholar 

  10. Kitchenham, B. A., Budgen, D., & Pearl Brereton, O. (2011). Using mapping studies as the basis for further research – A participant-observer case study. Information and Software Technology, 53(6), 638–651.

    Google Scholar 

  11. Kitchenham, B. A., Dyba, T., & Jorgensen, M. (2004). Evidence-based software engineering. In ICSE 2004, ICSE ’04, Edinburgh (pp. 273–281). Washington, DC: IEEE Computer Society.

    Google Scholar 

  12. Russell, S. J., & Norvig, P. (2003). Artificial intelligence: A modern approach (2nd ed.). Upper Saddle River, N.J: Pearson Education.

    Google Scholar 

  13. Straus, S., & Richardson, W. (2010). Evidence-based medicine: How to practice and teach it (4th ed.). Edinburgh: Churchill Livingstone.

    Google Scholar 

  14. Zhang, H., Babar, M. A., & Tell, P. (2011). Identifying relevant studies in software engineering. Information and Software Technology, 53(6), 625–637.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Francisco Carlos Souza .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Souza, F.C., Santos, A., Andrade, S., Durelli, R., Durelli, V., Oliveira, R. (2018). Automating Search Strings for Secondary Studies. In: Latifi, S. (eds) Information Technology - New Generations. Advances in Intelligent Systems and Computing, vol 558. Springer, Cham. https://doi.org/10.1007/978-3-319-54978-1_104

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-54978-1_104

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-54977-4

  • Online ISBN: 978-3-319-54978-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics