Abstract
The present paper reveals that fuzzy graphs are a basic tool for agglomerative hierarchical clustering. A new theorem is given that states equivalence between Wishart’s mode analysis using kth nearest neighbor element (Wishart, 1968, 1969) and connected components of a fuzzy graph with membership values on vertices. This theorem is proved by generalizing a basic result in agglomerative clustering, which states the equivalence between the nearest neighbor method and the connected components of the standard fuzzy graph (Miyamoto, 1990). As a consequence, it is easily seen that Wishart’s method can be replaced by the nearest neighbor method with a modified similarity. Second aim of this paper is to show how clusters are used in information retrieval with fuzziness in indices. A typical example is an input-output diagram that represents information retrieval process using fuzzy algebra (min-max algebra). A diagram representation for information retrieval with the ordinary algebra has been proposed and feedback has been introduced (Heaps, 1978). This system of the diagram has two drawbacks, one of which is that the system is unable to generalize the ordinary retrieval method of binary indexing, and the other is that feedback process fails to converge. The use of min-max algebra in fuzzy set theory solves these problems. The diagram with this algebra which corresponds to calculation of attainability of a fuzzy graph generalizes the binary retrieval method, and feedback always converges; the result shows a fuzzy equivalence relation, namely, a clustering. Application of Wishart’s mode method to information retrieval using the above equivalence is also discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anderberg, M. R. (1973), Cluster Analysis for Applications, Academic Press, New Nork.
Bookstein, A. (1985), Probability and fuzzy-set applications to information retrieval, Annual Review of Information Science and Technology, Vol. 20, 331–342.
Garfield, E. (1979), Citation Indexing: Its Theory and Application in Science, Technology, and Humanities, John Wiley and Sons, New York.
Heaps, H. S. (1978), Information Retrieval: Computational and Theoretical Aspects, Academic Press, New York.
Miyamoto, S. (1990), Fuzzy Sets in Information Retrieval and Cluster Analysis, Kluwer Academic Publishers, Dordrecht.
Tarjan, R. E. (1983), An improved algorithm for hierarchical clustering using strong components. Information Processing Letters, Vol. 17, 37–41.
Wishart, D. (1968), Mode analysis: a generalization of nearest neighbour which reduces chaining effects, in A.J.Cole, ed., Numerical Taxonomy, Proc. Colloq. in Numerical Taxonomy, Univ., St. Andrews, 283-311.
Wishart, D. (1969), Numerical classification method for deriving natural classes, Nature, 221, 97–98.
Zadeh, L. A. (1965), Fuzzy sets, Information and Control, 8, 338–353.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1993 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Miyamoto, S. (1993). Fuzzy Graphs as a Basic Tool for Agglomerative Clustering and Information Retrieval. In: Opitz, O., Lausen, B., Klar, R. (eds) Information and Classification. Studies in Classification, Data Analysis and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-50974-2_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-50974-2_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-56736-3
Online ISBN: 978-3-642-50974-2
eBook Packages: Springer Book Archive