Mathematical Document Retrieval Model Using Structural Information of Equations in Pseudo-documents
Math-aware search engines are required to effectively retrieve mathematical documents including various equations. In this paper, we propose a mathematical document retrieval system by which users can retrieve documents using any combination of keywords and equations. The proposed system indexes equations and their surrounding keywords from mathematical documents. Then, it searches and ranks mathematical documents using a language model modified for the heterogeneous indexing units (i.e., mixtures of equations and keywords). In the experiments, the proposed system performed well, especially for high ranks.
KeywordsMathematical Document Retrieval Heterogeneous Indexing Term Pseudo-document
Unable to display preview. Download preview PDF.
- 1.Addel, M., Cheung, H.S., Khiyal, S.H.: Math go! Prototype of a content based mathematical formula search engine. Journal of Theoretical and Applied Information Technology 4(10), 1002–1012 (2008)Google Scholar
- 3.MathML, http://www.w3.org/Math
- 4.Misutka, J., Galambos, L.: Extending full text search engine for mathematical content. In: DML 2008 Workshop, pp. 55–67 (2008)Google Scholar
- 5.Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: ACM SIGIR, pp. 275–281 (1998)Google Scholar