Skip to main content

Methods of Relevance Ranking and Hit-content Generation in Math Search

  • Conference paper
Towards Mechanized Mathematical Assistants (MKM 2007, Calculemus 2007)

Abstract

To be effective and useful, math search systems must not only maximize precision and recall, but also present the query hits in a form that makes it easy for the user to identify quickly the truly relevant hits. To meet that requirement, the search system must sort the hits according to domain-appropriate relevance criteria, and provide with each hit a query-relevant summary of the hit target.

The standard relevance measures in text search, which rely mostly on keyword frequencies and document sizes, turned out to be inadequate in math search. Therefore, alternative relevance measures must be defined, which give more weight to certain types of information than to others and take into account cross-reference statistics. In this paper, new, multidimensional relevance metrics are defined for math search, methods for computing and implementing them are discussed, and comparative performance evaluation results are presented.

Query-relevant hit-summary generation is another factor that enables users to quickly determine the relevance of the presented hits. Although the hit title accompanied by a few leading sentences from the target document is simple to produce, this often fails to convey to the user the document’s relevant excerpts. This shifts the burden onto the user to pursue many of the hits, and read significant portions of their target documents, to finally locate the wanted documents. Clearly, this task is too time-consuming and should be largely automated. This paper presents query-relevant hit-summary generation methods, outlines implementation strategies, and presents performance evaluation results.

This work was done in part at the National Institute of Standards and Technology, USA, as part of the DLMF Project.

This work was supported in part by the National Science Foundation (NSF), USA, under Grant No. 0208818.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. MathSciNet. American Mathematical Society (AMS), http://www.ams.org/mathscinet

  2. Bancerek, G.: The 5th International Conference on Mathematical Knowledge Management, Wokingham, UK, pp. 266–279 (August 11-12, 2006)

    Google Scholar 

  3. Einwohner, T.H., Fateman, R.: Searching techniques for integral tables. In: International symposium on Symbolic and algebraic computation, ACM, New York (1995), http://torte.cs.berkeley.edu:8010/tilu

    Google Scholar 

  4. Guidi, F.: Searching and Retrieving in Content-based Repositories of Formal Mathematical Knowledge. Ph.D. Thesis in Computer Science, University of Bologna, Technical report UBLCS 2003-06 (March 2003)

    Google Scholar 

  5. Guidi, F., Schena, I.: A Query Language for a Metadata Framework about Mathematical Resources. In: The 2nd International Conf. Mathematical Knowledge Management, Bertinoro, Italy (February 2003)

    Google Scholar 

  6. Jahrbuch Database, http://www.emis.de/MATH/JFM/JFM.html

  7. Lozier, D.W.: The DLMF Project: A New Initiative in Classical Special Functions. In: International Workshop on Special Functions - Asymptotics, Harmonic Analysis and Mathematical Physics. Hong Kong (June 21-25, 1999)

    Google Scholar 

  8. Lozier, D.W., Miller, B.R., Saunders, B.V.: Design of a Digital Mathematical Library for Science, Technology and Education. In: Proceedings of the IEEE Forum on Research and Technology Advances in Digital Libraries, IEEE ADL 1999, Baltimore, Maryland (May 1999)

    Google Scholar 

  9. MathDi (Mathematics Didactics Database), http://www.emis.de/MATH/DI.html

  10. Mathdex search tool, http://www.mathdex.com:8080/mathfind/search

  11. Mathdex description, http://www.ima.umn.edu/2006-2007/SW12.8-9.06/activities/Miner-Robert/index.html

  12. Mathematica, http://www.mathematica.com

  13. Miller, B., Youssef, A.: Technical Aspects of the Digital Library of Mathematical Functions. Annals of Mathematics and Artificial Intelligence 38, 121–136 (2003)

    Article  MATH  MathSciNet  Google Scholar 

  14. MoWGLI: Mathematics on the Web: Get It by Logics and Interfaces, http://mowgli.cs.unibo.it/

  15. Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw Hill, New York (1993)

    Google Scholar 

  16. Baeza-Yates, R., Ribeiro-Neto, B.: Modern information retrieval. Addison-Wesley, London (1999)

    Google Scholar 

  17. Youssef, A.: Information Search And Retrieval of Mathematical Contents: Issues And Methods. In: Proceedings of the ISCA 14th International Conference on Intelligent and Adaptive Systems and Software Engineering (IASSE-2005), July 20-22, Toronto, Canada (2005)

    Google Scholar 

  18. Youssef, A.: Roles of Math Search in Mathematics. In: The 5th International Conference on Mathematical Knowledge Management, Wokingham, UK, pp. 2–16 (August 11-12, 2006)

    Google Scholar 

  19. Zentralblatt MATH database at European Mathematical Information Service (EMIS), http://www.emis.de/ZMATH/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Manuel Kauers Manfred Kerber Robert Miner Wolfgang Windsteiger

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Youssef, A.S. (2007). Methods of Relevance Ranking and Hit-content Generation in Math Search . In: Kauers, M., Kerber, M., Miner, R., Windsteiger, W. (eds) Towards Mechanized Mathematical Assistants. MKM Calculemus 2007 2007. Lecture Notes in Computer Science(), vol 4573. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73086-6_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73086-6_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73083-5

  • Online ISBN: 978-3-540-73086-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics