Abstract
We introduce vector space based approaches to natural language processing and some of their similarities with quantum theory when applied to information retrieval. We explain how dimensional reduction is called for from both a practical and theoretical point of view and how this can be achieved through choice of product or through projectors onto subspaces.
Similar content being viewed by others
References
Aerts, D.: General quantum modeling of combining concepts: a quantum field model in Fock space (2007). arXiv:0705.1740
Aerts, D.: Quantum structure in cognition. J. Math. Psychol. 53, 314–348 (2009)
Czachor, M., Aerts, D.: Quantum aspects of semantic analysis and symbolic artificial intelligence, J. Phys. A. Math Gen, 37 (2004)
Dumais, S.T., Furnas, G.W., Landauer, T.K., Deerwester, S.: Using latent semantic analysis to improve information retrieval. In: Proceedings of CHI’88: Conference on Human Factors in Computing, pp. 281–285. ACM, New York (1988)
Eckart, C., Young, G.: The approximation of one matrix by another of lower rank. Psychometrika 1, 211–218 (1936)
Frieze, A.M., Kannan, R., Vempala, S.: Fast Monte-Carlo algorithms for finding low-rank approximations. In: IEEE Symposium on Foundations of Computer Science, pp. 370–378 (1998)
Gärdenfors, P.: Conceptual Spaces: The Geometry of Thought. MIT Press, Cambridge (2000)
Golub, G., Kahan, W.: Calculating the singular values and pseudo-inverse of a matrix. J. SIAM, Numer. Anal. Ser. B 2(2), 205–224 (1965)
Johnson, W., Lindenstrauss, J.: Extensions of Lipschitz mappings into a Hilbert space. Contemp. Math. 26, 189–206 (1984)
Kanerva, P., Kristofersson, J., Holst, A.: Random indexing of text samples for latent semantic analysis. In: Proceedings of the 22nd Annual Conference of the Cognitive Science Society, Mahwah, New Jersey, p. 1036 (2000)
Kontostathis, A., Pottenger, W.M.: A framework for understanding latent semantic indexing (LSI) performance. Information Processing and Management 42(1), 56–73 (2006)
Landauer, T.K., Dumais, S.T.: A solution to Plato’s problem: the latent semantic analysis theory of acquisition, induction and representation of knowledge. Psychol. Rev. 104, 211–240 (1997)
Lund, K., Burgess, C.: Producing high-dimensional semantic spaces from lexical co-occurrence. Behav. Res. Methods Instrum. Comput 28(2), 203–208 (1996)
Sahlgren, M.: Vector-based semantic analysis: representing word meanings based on random labels. In: Lenci, A., Montemagni, S., Pirrelli, V. (eds.) The Acquisition and Representation of Word Meaning. Kluwer Academic, Norwell (2001)
Nerlich, B., Clarke, D.: Polysemy and Flexibility, Polysemy: Flexible Patterns of Meaning in Mind and Language. Walter de Gruyter, Berlin (2003)
Plate, T.: Holographic Reduced Representations. CSLI Lecture Notes, vol. 150. CSLI Publications, Stanford (2003)
Rosch, E.: Principles of categorization. In: Rosch, E., Lloyd, B. (eds.) Cognition and Categorization. Erlbaum, Hillsdale (1978)
Rosch, E.: Prototype classification and logical classification: the two systems. In: Scholnick, E.K. (ed.) New Trends in Conceptual Representation: Challenges to Piaget’s Theory? Erlbaum, Hillsdale (1983)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing, information retrieval and language processing. Commun. ACM 18(11), 613–620 (1975)
Smolensky, P.: Tensor product variable binding and the representation of symbolic structures in connectionist systems. Artif. Intell. 46, 159 (1990)
van Rijsbergen, C.J.: Geometry of Information Retrieval. Cambridge University Press, Cambridge (2004)
Widdows, D., Peters, S.: Word vectors and quantum logic: experiments with negation and disjunction. In: Proceedings of Mathematics of Language, vol. 8, pp. 141–154 (2003)
Widdows, D., Higgins, M.: Geometric ordering of concepts, logical disjunction, and leaning by induction. In Compositional Connectionism in Cognitive Science (2004)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Aerts, S. Dimensional Reduction in Vector Space Methods for Natural Language Processing: Products and Projections. Int J Theor Phys 50, 3646–3653 (2011). https://doi.org/10.1007/s10773-011-0851-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10773-011-0851-6