Abstract
The small world topology is known widespread in biological, social and man-made systems. This paper shows that the small world structure also exists in documents,such as papers. A document is represented by a network;the nodes represent terms,and the edges represent the co-occurrence of terms. This network is shown to have the characteristics of being a small world,i.e.,nodes are highly clustered yet the path length between them is small. Based on the topology,we develop an indexing system called KeyWorld,which extracts important terms by measuring their contribution to the graph being small world.
A term is a word or a word sequence.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Albert, H. Jeong,and A.-L. Barabasi.The diameter of the World Wide Web. Nature,401,1999.
L.C. Freeman.Centrality in social networks:Conceptual clarification.Social Networks,1:215–239,1979.
M. Granovetter.Strength of weak ties.American Journal of Sociology,78:1360–1380,1973.
H. Kautz, B. Selman,and M. Shah.The hidden Web.AI magagine,18(2),1997.
S. Milgram.The small-world problem.Psychology Today,2:60–67,1967.
Y. Ohsawa, N.E. Benson,and M. Yachida.KeyGraph:Automatic indexing by co-occurrence graph based on building construction metaphor. In Proc. Advanced Digital Library Conference (IEEE ADL’ 98),1998.
Y. Ohsawa and M. Yachida.Discover risky active faults by indexing an earthquake sequence. In Proc. Discovery Science,pages 208–219,1999.
G. Salton.Automatic Text Processing. Addison-Wesley,1988.
T. Walsh.Search in a small world. In Proc. IJCAI’ 99,pages 1172–1177,1999.
D. Watts.Small worlds:the dynamics of networks between order and randomness. Princeton,1999.
D. Watts and S. Strogatz.Collective dynamics of small-world networks.Nature, 393:440–442,1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Matsuo, Y., Ohsawa, Y., Ishizuka, M. (2001). KeyWorld:Extracting Keywords from Document s Small World. In: Jantke, K.P., Shinohara, A. (eds) Discovery Science. DS 2001. Lecture Notes in Computer Science(), vol 2226. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45650-3_24
Download citation
DOI: https://doi.org/10.1007/3-540-45650-3_24
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42956-2
Online ISBN: 978-3-540-45650-6
eBook Packages: Springer Book Archive