Chapter

Modelling with Words

Volume 2873 of the series Lecture Notes in Computer Science pp 153-167

A Hybrid Framework Using SOM and Fuzzy Theory for Textual Classification in Data Mining

  • Yi-Ping Phoebe ChenAffiliated withCentre for Information Technology Innovation, Faculty of Information Technology, Queensland University of Technology

* Final gross prices may vary according to local VAT.

Get Access

Abstract

This paper presents a hybrid framework combining self-organising map (SOM) and fuzzy theory for textual classification. Clustering using self-organizing maps is applied to produce multiple targets. In this paper, we propose that an amalgamation of SOM and association rule theory may hold the key to a more generic solution, less reliant on initial supervision and redundant user interaction. The results of clustering stem words from text documents could be utilised to derive association rules which designate the applicability of documents to the user. A four stage process is consequently detailed, demonstrating a generic example of how a graphical derivation of associations may be derived from a repository of text documents, or even a set of synopses of many such repositories. This research demonstrates the feasibility of applying such processes for data mining and knowledge discovery.