Methods of Relevance Ranking and Hit-content Generation in Math Search

  • Abdou S. Youssef
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4573)

Abstract

To be effective and useful, math search systems must not only maximize precision and recall, but also present the query hits in a form that makes it easy for the user to identify quickly the truly relevant hits. To meet that requirement, the search system must sort the hits according to domain-appropriate relevance criteria, and provide with each hit a query-relevant summary of the hit target.

The standard relevance measures in text search, which rely mostly on keyword frequencies and document sizes, turned out to be inadequate in math search. Therefore, alternative relevance measures must be defined, which give more weight to certain types of information than to others and take into account cross-reference statistics. In this paper, new, multidimensional relevance metrics are defined for math search, methods for computing and implementing them are discussed, and comparative performance evaluation results are presented.

Query-relevant hit-summary generation is another factor that enables users to quickly determine the relevance of the presented hits. Although the hit title accompanied by a few leading sentences from the target document is simple to produce, this often fails to convey to the user the document’s relevant excerpts. This shifts the burden onto the user to pursue many of the hits, and read significant portions of their target documents, to finally locate the wanted documents. Clearly, this task is too time-consuming and should be largely automated. This paper presents query-relevant hit-summary generation methods, outlines implementation strategies, and presents performance evaluation results.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    MathSciNet. American Mathematical Society (AMS), http://www.ams.org/mathscinet
  2. 2.
    Bancerek, G.: The 5th International Conference on Mathematical Knowledge Management, Wokingham, UK, pp. 266–279 (August 11-12, 2006)Google Scholar
  3. 3.
    Einwohner, T.H., Fateman, R.: Searching techniques for integral tables. In: International symposium on Symbolic and algebraic computation, ACM, New York (1995), http://torte.cs.berkeley.edu:8010/tilu Google Scholar
  4. 4.
    Guidi, F.: Searching and Retrieving in Content-based Repositories of Formal Mathematical Knowledge. Ph.D. Thesis in Computer Science, University of Bologna, Technical report UBLCS 2003-06 (March 2003)Google Scholar
  5. 5.
    Guidi, F., Schena, I.: A Query Language for a Metadata Framework about Mathematical Resources. In: The 2nd International Conf. Mathematical Knowledge Management, Bertinoro, Italy (February 2003)Google Scholar
  6. 6.
  7. 7.
    Lozier, D.W.: The DLMF Project: A New Initiative in Classical Special Functions. In: International Workshop on Special Functions - Asymptotics, Harmonic Analysis and Mathematical Physics. Hong Kong (June 21-25, 1999)Google Scholar
  8. 8.
    Lozier, D.W., Miller, B.R., Saunders, B.V.: Design of a Digital Mathematical Library for Science, Technology and Education. In: Proceedings of the IEEE Forum on Research and Technology Advances in Digital Libraries, IEEE ADL 1999, Baltimore, Maryland (May 1999)Google Scholar
  9. 9.
    MathDi (Mathematics Didactics Database), http://www.emis.de/MATH/DI.html
  10. 10.
  11. 11.
  12. 12.
  13. 13.
    Miller, B., Youssef, A.: Technical Aspects of the Digital Library of Mathematical Functions. Annals of Mathematics and Artificial Intelligence 38, 121–136 (2003)MATHCrossRefMathSciNetGoogle Scholar
  14. 14.
    MoWGLI: Mathematics on the Web: Get It by Logics and Interfaces, http://mowgli.cs.unibo.it/
  15. 15.
    Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw Hill, New York (1993)Google Scholar
  16. 16.
    Baeza-Yates, R., Ribeiro-Neto, B.: Modern information retrieval. Addison-Wesley, London (1999)Google Scholar
  17. 17.
    Youssef, A.: Information Search And Retrieval of Mathematical Contents: Issues And Methods. In: Proceedings of the ISCA 14th International Conference on Intelligent and Adaptive Systems and Software Engineering (IASSE-2005), July 20-22, Toronto, Canada (2005)Google Scholar
  18. 18.
    Youssef, A.: Roles of Math Search in Mathematics. In: The 5th International Conference on Mathematical Knowledge Management, Wokingham, UK, pp. 2–16 (August 11-12, 2006)Google Scholar
  19. 19.
    Zentralblatt MATH database at European Mathematical Information Service (EMIS), http://www.emis.de/ZMATH/

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Abdou S. Youssef
    • 1
  1. 1.Department of Computer Science, The George Washington University, Washington DC, 20052USA

Personalised recommendations