Abstract
In this paper we present a comprehensive approach to conceptual structuring and intelligent navigation of text databases. Given any collection of texts, we first automatically extract a set of index terms describing each text. Next, we use a particular lattice conceptual clustering method to build a network of clustered texts whose nodes are described using the index terms. We argue that the resulting network supports an hybrid navigational approach to text retrieval — implemented into an actual user interface — that combines browsing potentials with good retrieval performance. We present the results of an experiment on subject searching where this approach outperformed a conventional Boolean retrieval system.
Preview
Unable to display preview. Download preview PDF.
References
Barletta, R., Mark, W. (1988). Explanation-Based Indexing of Cases. Proceedings of AAAI-88, St. Paul, Minnesota, Morgan Kaufmann.
Baudin, C., Pell, B., Kedar, S. (1994). Incremental Acquisition of Conceptual Indices for Multimedia Design Documentation. Proceedings of the AAAI-94 Workshop on Indexing and Reuse in Multimedia Systems, Seattle, Washington.
Bowman, M., Danzig, P., Manber, U., & Schwartz, F. (1994). Scalable Internet Resource Discovery: Research Problems and Approaches. Communications of the ACM, 37, 8, pp. 98–114.
Carpineto, C., & Romano, G. (1993). GALOIS: An order-theoretic approach to conceptual clustering. Proceedings of the 10th International Conference on Machine Learning (pp. 33–40), Amherst, MA:Morgan Kaufmann.
Carpineto, C., & Romano, G. (1994a). A lattice conceptual clustering system and its application to browsing retrieval. Submitted to Machine Learning.
Carpineto, C., & Romano, G. (1994b). Dynamically bounding browsable retrieval spaces: an application to Galois lattices. In Proceedings of RIAO 94: Intelligent Multimedia Information Retrieval Systems and Management(pp. 520–533), New York.
Carpineto, C., & Romano, G. (1995). ULYSSES: A lattice-based multiple interaction strategy retrieval interface. To appear in Proceedings of EWHCI'95: 5th East-West Human Computer Interaction Conference, Moscow.
Chen, H., Hsu, P., Orwig, R., Hoopes, L., Nunamaker, J. (1994). Automatic concept classification of text from electronic meeting. Communications of the ACM, 37, 10, pp. 57–73.
Crouch, D., Crouch, C., & Andreas, G. (1989). The use of cluster hierarchies in hypertext information retrieval. Proceedings of the ACM Hypertext '89 Conference (pp. 225–237), Pittsburgh, PA: ACM.
Furnas, G. (1986). Generalized fisheye views. Proceedings of the Human Factors in Computing Systems (pp. 16–23). North Holland.
Karp, D., Schabes, Y., Zaidel, M., Egedi, D. (1992). A freely available wide coverage morphological analyzer for English. Proceedings of the 14th International Conference on Computational Linguistics (COLING '92), Nantes, France.
Lucarella, D., Parisotto, S., Zanzi, A. (1993). MORE: Multimedia Object Retrieval Environment. Proceedings of the Fifth ACM Conference on Hypertext (pp. 39–50). Seattle, WA.
Maarek, Y., Berry, D., & Kaiser, G. (1991). An Information Retrieval Approach For Automatically Constructing Software Libraries. IEEE Transactions on Software Engineering, 17, 8, 800–813.
Mellish, C. (1991). The description identification problem. Artificial Intelligence, 52, 2, 151–168.
Michalski, R., Stepp, R. (1983). Learning from observation: Conceptual clustering. In R. Michalski, J. Carbonell, T. Mitchell (Eds.), Machine Learning: An Artificial Intelligence Approach (Vol. 1). Palo Alto, CA: Tioga Publishing.
Salton, G. (1989). Automatic Text Processing: The transformation, Analysis and Retrieval of Information by Computer. Addison Wesley.
Sowa, J. (1984). Conceptual Structures: Information Processing in Mind and Machine, Addison-Wesley, 1984.
Srihari, R., Burhans, D. (1994). Visual Semantics: Extracting Visual Information from Text Accompanying Pictures. Proceedings of AAAI-94, Seattle, Washington, AAAI Press.
Thompson, R., & Croft, B. (1989). Support for browsing in an intelligent text retrieval system. International Journal of Man-machine Studies, 30, 639–668.
Willet, P. (1988). Recent trends in hierarchic document clustering: a critical review. Information Processing & Management, 24, 5, 577–597.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Carpineto, C., Romano, G. (1995). Automatic construction of navigable concept networks characterizing text databases. In: Gori, M., Soda, G. (eds) Topics in Artificial Intelligence. AI*IA 1995. Lecture Notes in Computer Science, vol 992. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60437-5_7
Download citation
DOI: https://doi.org/10.1007/3-540-60437-5_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60437-2
Online ISBN: 978-3-540-47468-5
eBook Packages: Springer Book Archive