Advertisement

Information Flow Analysis with Chinese Text

  • Paulo Cheong
  • Dawei Song
  • Peter Bruza
  • Kam-Fai Wong
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3248)

Abstract

This article investigates the effectiveness of an information inference mechanism on Chinese text. The information inference derives implicit associations via computation of information flow on a high dimensional conceptual space, which is approximated by a cognitively motivated lexical semantic space model, namely Hyperspace Analogue to Language (HAL). A dictionary-based Chinese word segmentation system was used to segment words. To evaluate the Chinese-based information flow model, it is applied to query expansion, in which a set of test queries are expanded automatically via information flow computations and documents are retrieved. Standard recall-precision measures are used to measure performance. Experimental results for TREC-5 Chinese queries and People Daily’s corpus suggest that the Chinese information flow model significantly increases average precision, though the increase is not as high as those achieved using English corpus. Nevertheless, there is justification to believe that the HAL-based information flow model, and in turn our psychologistic stance on the next generation of information processing systems, have a promising degree of language independence.

Keywords

Information Flow Average Precision Latent Semantic Analysis Query Term Query Expansion 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Barwise, J., Seligman, J.: Information Flow: The Logic of Distributed Systems. Cambridge Tracts in Theoretical Computer Science 44 (1997)Google Scholar
  2. 2.
    Bruza, P.D., Song, D.: Inferring Query Models by Information Flow Analysis. In: Proceedings of the 11th International ACM Conference on Information and Knowledge Management (CIKM 2002), pp. 260–269 (2002)Google Scholar
  3. 3.
    Burgess, C., Lund, K.: Parsing Constraints and High-Dimensional Semantic Space. Language and Cognitive Processes 12, 177–210 (1997)CrossRefGoogle Scholar
  4. 4.
    Burgess, C., Livesay, L., Lund, K.: Explorations in Context Space: Words, Sentences, Discourse. In: Foltz, P.W. (ed.) Quantitative Approaches to Semantic Knowledge Representation, vol. 25(2&3), pp. 179–210 (1998)Google Scholar
  5. 5.
    Gärdenfors, P.: Conceptual Spaces: The Geometry of Thought. MIT Press, Cambridge (2000)Google Scholar
  6. 6.
    Landauer, T., Dumais, S.: A Solution to Plato.s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review 104(2), 211–240 (1997)CrossRefGoogle Scholar
  7. 7.
    Lund, K., Burgess, C.: Producing High-dimensional Semantic Spaces from Lexical Co-occurrence. Behavior Research Methods. Instruments, & Computers 28(2), 203–208 (1996)CrossRefGoogle Scholar
  8. 8.
    Robertson, S.E., Walker, S., Spark-Jones, K., Hancock-Beaulieu, M.M., Gatford, M.: OKAPI at TREC-3. In: Proceedings of the 3rd Text Retrieval Conference (TREC-3), pp. 109–126 (1994)Google Scholar
  9. 9.
    Song, D., Bruza, P.D.: Towards context-sensitive information inference. Journal of the American Society for Information Science and Technology 54(4), 321–334 (2003)CrossRefGoogle Scholar
  10. 10.
    Text Retrieval Conference (TREC), National Institution of Standards and Technology( NIST), http://trec.nist.gov/data/
  11. 11.
    Cheong, P., Song, D., Bruza, P.D., Wong, K.F.: Information Flow Analysis with Chinese Text. In: Su, K.-Y., Tsujii, J., Lee, J.-H., Kwong, O.Y. (eds.) IJCNLP 2004. LNCS (LNAI), vol. 3248, pp. 91–98. Springer, Heidelberg (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Paulo Cheong
    • 1
  • Dawei Song
    • 1
  • Peter Bruza
    • 1
  • Kam-Fai Wong
    • 2
  1. 1.CRC for Enterprise Distributed Systems Technology (DSTC)The University of QueenslandBrisbane
  2. 2.Department of Systems Engineering and Engineering ManagementThe Chinese University of Hong KongShatin, N.T, Hong Kong

Personalised recommendations