Learning Ranking Functions by Genetic Programming Revisited

  • Ricardo Baeza-Yates
  • Alfredo CuzzocreaEmail author
  • Domenico Crea
  • Giovanni Lo Bianco
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11030)


We revisit the use of Genetic Programming (GP) to learn ranking functions in the context of web documents, by adding linking information. Our results show that GP can cope with larger sets of features as well as bigger document collections, obtaining small improvements over the state-of-the-art of GP learned functions applied to web search.


  1. 1.
    Almeida, H.M., Gonçalves, M.A., Cristo, M., Calado, P.: A combined component approach for finding collection-adapted ranking functions based on genetic programming. In: Proceedings of the 30th ACM SIGIR, pp. 399–406 (2007)Google Scholar
  2. 2.
    Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval: The Concepts and Technology Behind Search Engines, 2nd edn. Addison-Wesley, Boston (2011)Google Scholar
  3. 3.
    Banzhaf, W., Francone, F.D., Keller, R.E., Nordin, P.: Genetic Programming: An Introduction – On the Automatic Evolution of Computer Programs and Its Applications. Morgan Kaufmann Publishers, Burlington (1998)Google Scholar
  4. 4.
    Bazen, S., Moyes, P.: Elitism and stochastic dominance. Soc. Choice Welfare 39(1), 207–251 (2012)MathSciNetCrossRefGoogle Scholar
  5. 5.
    Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. In: Proceedings of the 7th International Conference on World Wide Web 7, pp. 107–117 (1998)Google Scholar
  6. 6.
    Buckley, C., Voorhees, E.M.: Evaluating evaluation measure stability. In: Proceedings of the 23th ACM SIGIR, pp. 33–40 (2000)Google Scholar
  7. 7.
    Fan, W., Pathak, P., Wu, H.: The effects of fitness functions on genetic programming-based ranking discovery for web search. JASIST 55, 628–636 (2004)CrossRefGoogle Scholar
  8. 8.
    Gevrey, J., Ruger, S.M.: Link based approaches for text retrieval. In: Proceedings of the 10th TREC (2001)Google Scholar
  9. 9.
    Lacerda, A., Cristo, M., Gonçalves, M.A., Fan, W., Ziviani, N., Ribeiro-Neto, N.: Learning to advertise. In: Proceedings of the 29th ACM SIGIR, pp. 549–556 (2006)Google Scholar
  10. 10.
    Oren, N.: Reexamining TF-IDF based information retrieval with genetic programming. In: Proceedings of SAICSIT, pp. 224–234 (2002)Google Scholar
  11. 11.
    Robertson, S.E., Zaragoza, H.: The probabilistic relevance framework: BM25 and beyond. Found. Trends Inf. Retrieval 3(4), 333–389 (2009)CrossRefGoogle Scholar
  12. 12.
    Trotman, A.: Learning to rank. Inf. Retrieval J. 8, 359–381 (2005)CrossRefGoogle Scholar
  13. 13.
    Cuzzocrea, A., Saccà, D., Ullman, J.D.: Big data: a research agenda. In: Proceedings of IDEAS, pp. 198–203 (2013)Google Scholar
  14. 14.
    Keyhanipour, A.H., Moshiri, B., Oroumchian, F., Rahgozar, M., Badie, K.: Learning to rank: new approach with the layered multi-population genetic programming on click-through features. Genet. Program Evolvable Mach. 17(3), 203–230 (2016)CrossRefGoogle Scholar
  15. 15.
    Khodadi, I., Abadeh, M.S.: Genetic programming-based feature learning for question answering. Inf. Process. Manag. 52(2), 340–357 (2016)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Ricardo Baeza-Yates
    • 1
  • Alfredo Cuzzocrea
    • 2
    • 3
    Email author
  • Domenico Crea
    • 4
  • Giovanni Lo Bianco
    • 4
  1. 1.Northeastern University at Silicon ValleySan JoseUSA
  2. 2.DIA DepartmentUniversity of TriesteTriesteItaly
  3. 3.ICAR-CNRRendeItaly
  4. 4.DIMES DepartmentUniversity of CalabriaRendeItaly

Personalised recommendations