Methods of Relevance Ranking and Hit-content Generation in Math Search
To be effective and useful, math search systems must not only maximize precision and recall, but also present the query hits in a form that makes it easy for the user to identify quickly the truly relevant hits. To meet that requirement, the search system must sort the hits according to domain-appropriate relevance criteria, and provide with each hit a query-relevant summary of the hit target.
The standard relevance measures in text search, which rely mostly on keyword frequencies and document sizes, turned out to be inadequate in math search. Therefore, alternative relevance measures must be defined, which give more weight to certain types of information than to others and take into account cross-reference statistics. In this paper, new, multidimensional relevance metrics are defined for math search, methods for computing and implementing them are discussed, and comparative performance evaluation results are presented.
Query-relevant hit-summary generation is another factor that enables users to quickly determine the relevance of the presented hits. Although the hit title accompanied by a few leading sentences from the target document is simple to produce, this often fails to convey to the user the document’s relevant excerpts. This shifts the burden onto the user to pursue many of the hits, and read significant portions of their target documents, to finally locate the wanted documents. Clearly, this task is too time-consuming and should be largely automated. This paper presents query-relevant hit-summary generation methods, outlines implementation strategies, and presents performance evaluation results.
Unable to display preview. Download preview PDF.
- 1.MathSciNet. American Mathematical Society (AMS), http://www.ams.org/mathscinet
- 2.Bancerek, G.: The 5th International Conference on Mathematical Knowledge Management, Wokingham, UK, pp. 266–279 (August 11-12, 2006)Google Scholar
- 4.Guidi, F.: Searching and Retrieving in Content-based Repositories of Formal Mathematical Knowledge. Ph.D. Thesis in Computer Science, University of Bologna, Technical report UBLCS 2003-06 (March 2003)Google Scholar
- 5.Guidi, F., Schena, I.: A Query Language for a Metadata Framework about Mathematical Resources. In: The 2nd International Conf. Mathematical Knowledge Management, Bertinoro, Italy (February 2003)Google Scholar
- 6.Jahrbuch Database, http://www.emis.de/MATH/JFM/JFM.html
- 7.Lozier, D.W.: The DLMF Project: A New Initiative in Classical Special Functions. In: International Workshop on Special Functions - Asymptotics, Harmonic Analysis and Mathematical Physics. Hong Kong (June 21-25, 1999)Google Scholar
- 8.Lozier, D.W., Miller, B.R., Saunders, B.V.: Design of a Digital Mathematical Library for Science, Technology and Education. In: Proceedings of the IEEE Forum on Research and Technology Advances in Digital Libraries, IEEE ADL 1999, Baltimore, Maryland (May 1999)Google Scholar
- 9.MathDi (Mathematics Didactics Database), http://www.emis.de/MATH/DI.html
- 10.Mathdex search tool, http://www.mathdex.com:8080/mathfind/search
- 11.Mathdex description, http://www.ima.umn.edu/2006-2007/SW12.8-9.06/activities/Miner-Robert/index.html
- 12.Mathematica, http://www.mathematica.com
- 14.MoWGLI: Mathematics on the Web: Get It by Logics and Interfaces, http://mowgli.cs.unibo.it/
- 15.Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw Hill, New York (1993)Google Scholar
- 16.Baeza-Yates, R., Ribeiro-Neto, B.: Modern information retrieval. Addison-Wesley, London (1999)Google Scholar
- 17.Youssef, A.: Information Search And Retrieval of Mathematical Contents: Issues And Methods. In: Proceedings of the ISCA 14th International Conference on Intelligent and Adaptive Systems and Software Engineering (IASSE-2005), July 20-22, Toronto, Canada (2005)Google Scholar
- 18.Youssef, A.: Roles of Math Search in Mathematics. In: The 5th International Conference on Mathematical Knowledge Management, Wokingham, UK, pp. 2–16 (August 11-12, 2006)Google Scholar
- 19.Zentralblatt MATH database at European Mathematical Information Service (EMIS), http://www.emis.de/ZMATH/