Abstract
This paper clarifies the apparent randomness and confusion in the choice of different query methods using LSI to be found in the current text and image retrieval literature. We also propose and test some modified query approaches using the novel technique of singular value rescaling (SVR). Experiments on standardized TREC data set confirmed the effectiveness of SVR, showing an improvement ratio of 5.6% over the best conventional LSI query approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dumais, S.: Improving the Retrieval of Information from External Sources. Behavior Research Methods, Instruments and Computers, 229–236 (1991)
Berry, M., Dumais, S., O’Brien, G.: Using Linear Algebra for Intelligent Information Retrieval. SIAM Review, 573–595 (1995)
Kolda, T.G., O’Leary, D.P.: A Semi-Discrete Matrix Decomposition for Latent Semantic Indexing in Information Retrieval. ACM Transactions on Information Systems, 322–346 (1998)
Deerwester, S., Dumais, S., Furnas, G., et al.: Patent: Computer Information Retrieval using Latent Semantic Structure. U. S. Patent No. 4,839,853 (1989)
Berry, M., Fierro, R.: Low-Rank Orthogonal Decompositions for Information Retrieval Applications. Numerical Linear Algebra with Applications, 301–328 (1996)
Jiang, J.: Using Latent Semantic Indexing for Data Mining, MS Thesis, Department of Computer Science, University of Tennessee (1997)
Zhao, R., Grosky, W.I.: Negotiating the Semantic Gap: From Feature Maps to Semantic Landscapes. Pattern Recognition, 593–600 (2002)
Ding, C.H.: A Probabilistic Model for Dimensionality Reduction in Information Retrieval and Filtering. In: Proceedings of the First SIAM Computational Information Retrieval Workshop, Raleigh, NC (2000)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
Ando, R.K.: Latent Semantic Space: Iterative Scaling Improves Precision of Inter-document Similarity Measurement. In: Proceedings of SIGIR, Athens, Greece, pp. 216–223 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yan, H., Grosky, W.I., Fotouhi, F. (2005). Augmenting the Power of the Various Versions of LSI Used in Document Retrieval. In: Bhalla, S. (eds) Databases in Networked Information Systems. DNIS 2005. Lecture Notes in Computer Science, vol 3433. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31970-2_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-31970-2_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25361-7
Online ISBN: 978-3-540-31970-2
eBook Packages: Computer ScienceComputer Science (R0)