Abstract
Synonymy — different words with the same meaning — is a major problem for text mining systems. We have proposed asymmetric word similarities as a possible solution to this problem, where the similarity between words is computed on the basis of the similarities between contexts in which the words appear, rather than on their syntactic identity. In this paper, we give details of an incremental algorithm to compute word similarities and outline some tests which show the method’s effectiveness.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hearst, M.A.: Invited Talk: Untangling Text Data Mining. Annual Meeting-Association for Computational Linguistics 37 (1999) 3–10.
Blake, C. and Pratt, W.: Better Rules, Few Features: A Semantic Approach to Selecting Features from Text. In Proc IEEE Data mining. San Jose, CA: (2001) 59–66.
Miller, G.A.: WordNet: A Lexical Database for English. Comm-ACM 38 (1995) 39.
Azmi-Murad, M. and Martin, T.P.: Information Retrieval using Contextual Word Similarity. In Proc UKCI-04 — UK Workshop on Computational Intell (2004) 144–149.
Azmi-Murad, M. and Martin, T.P.: Asymmetric Word Similarities for Information Retrieval, Document Grouping and Taxonomic Organisation. In Proc EUNITE 2004 — Aachen, Germany (2004) 277–282.
Hahn, U. and Ramscar, M., (eds.): Similarity and Categorization. OUP (2001).
Goodman, N.: Seven strictures on similarity, in Problems and Projects, N. Goodman, (Ed)., Bobbs-Merrill. (1972) 437–447
Zadeh, L.A.: Similarity Relations and Fuzzy Orderings. Inf Sciences 3 (1971) 177–200.
Dubois, D. and Prade, H.: Fuzzy Sets and Systems: Theory and Applications. Academic Press (1980).
Love, B.C.: Similarity and Categorization: A Review. AI Magazine 23 (2002) 103–105.
Pantel, P. and Lin, D.: Discovering Word Senses from Text. In Proc ACM SIGKDD international conference on knowledge discovery and data mining. (2002) 613–619.
Harris, Z.: Distributional Structure, in The Philosophy of Linguistics, J.J. Katz, (Ed., 111), Oxford University Press. (1985) 26–47
Baldwin, J.F.: The Management of Fuzzy and Probabilistic Uncertainties for Knowledge Based Systems, in Encyclopedia of AI, S.A. Shapiro, (Ed.), Wiley. (1992) 528–537
Baldwin, J.F., Martin, T.P., and Pilsworth, B.W.: FRIL — Fuzzy and Evidential Reasoning in AI. Research Studies Press (John Wiley), U.K. (1995).
Martin, T.P., Azvine, B., and Assadian, B.: Acquisition of Soft Taxonomies. In Proc IPMU-04. Perugia, Italy (2004) 1027–1032.
Martin, T.P. and Azvine, B.: Acquisition of Soft Taxonomies for Intelligent Personal Hierarchies and the Soft Semantic Web. BT Technology Journal 21 (2003) 113–122.
Azmi-Murad, M. and Martin, T.P.: Using Fuzzy Sets in Contextual Word Similarity, in Intell. Data Eng’ng and Automated Learning, LNCS 3177, Springer(2004) 517–522
Azmi-Murad, M. and Martin, T.P.: Sentence Extraction using Asymmetric Word Similiarity and Topic Similarity. In Proc 9th Online World Conference on Soft Computing in Industrial Applications.-http://wsc9.softcomputing.net/ (2004).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martin, T.P., Azmi-Murad, M. (2005). An Incremental Algorithm to find Asymmetric Word Similarities for Fuzzy Text Mining. In: Abraham, A., Dote, Y., Furuhashi, T., Köppen, M., Ohuchi, A., Ohsawa, Y. (eds) Soft Computing as Transdisciplinary Science and Technology. Advances in Soft Computing, vol 29. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-32391-0_88
Download citation
DOI: https://doi.org/10.1007/3-540-32391-0_88
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25055-5
Online ISBN: 978-3-540-32391-4
eBook Packages: EngineeringEngineering (R0)