Abstract
A text usually contains multiple semantic units corresponding to various reading requests from users. One semantic unit represents a topic that people are interested in reading. A meaningful combination of semantic units can represent a certain aspect of the text. This paper proposes a mechanism that can extract the semantic units from text according to the keywords representing users’ interests and can organise semantic units into facets reflecting certain aspects of a text. The mechanism can display facets of a text with a set of operations. The proposed mechanism considers human reading process. With this mechanism, readers can quickly obtain the interested content from a large text. Experiments show its effectiveness and robustness.
Similar content being viewed by others
References
Amati, G., Rijsbergen, C.J.V.: Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans Inf Syst 20(4), 357–389 (2002)
Batagelj, V., Zaversnik, M.: Partitioning Approach to Visualization of Large Networks. In Proc. Seventh International Symposium on Graph Drawing’99, 1731, 90–98 (1999)
Batet, M., Sanchez, D., Valls, A.: An ontology-based measure to compute semantic similarity in biomedicine. J Biomed Informat 44(1), 118–125 (2011)
Baxendale, P.: Machine-made index for technical literature - an experiment. IBM J Res Dev 2(4), 354–361 (1958)
Bun, K.K., Ishizuka, M.: Topic Extraction from News Archive Using TF*PDF Algorithm. In Proc. the 3rd International Conference on Web Information Systems Engineering, 73–82 (2002)
Costa, L.D.F., Rodrigues, F.A., Travieso, G., Boas, P.R.V.: Characterization of complex networks: A survey of measurements. Adv Phys 56(1), 167–242 (2007)
Dakka, W., Ipeirotis, P.G., Wood, K.R.: Automatic Construction of Multifaceted Browsing Interfaces. In Proceedings of the ACM Conference on Information and Knowledge Management, 768–775 (2005)
De Gemmis, M., Lops, P., Semeraro, G., Basile, P.: Integrating tags in a semantic content-based recommender. In proceedings of the 2008 ACM Conference on Recommender Systems (RecSys’08). ACM. New York. 163–170 (2008)
Dumais, S.: Latent semantic analysis. Annual Review of Information Science and Technology (ARIST), 38 (2004).
Ebbinghaus, H.: Memory: a contribution to experimental psychology. Columbia University Teachers College, New York (1913)
Fauconnier, G.: Mappings in Thought and Language. Cambridge University Press (1997)
Fauconnier, G., Turner, M.: The Way We Think: Conceptual Blending and the Mind’s Hidden Complexities. Basic Books, New York (2002)
Girvan, M., Newman, M.: Community structure in social and biological networks. Proc Natl Acad Sci USA 99, 8271–8276 (2002)
Gonçalves, M.A., Fox, E.A., Watson, L.T., Kipp, N.A.: Streams, structures, spaces, scenarios, societies (5 s): a formal model for digital libraries. ACM Trans Inf Syst 22(2), 270–312 (2004)
Harabagiu, S., Lacatusu, F.: Topic themes for multi-document summarization. In Proc. SIGIR’05, 30–38 (2005)
Hearst, M.: Design Recommendations for Hierarchical Faceted Search Interfaces. ACM SIGIR Workshop on Faceted Search, (2006)
Heijmans, H.J.A.M., Nacken, P., Toet, A., Vincent, L.: Graph morphology. J. Vis. Commun. Image Represent. 3(1), 24–38 (1992)
Hildebrand, M., Van Ossenbruggen, J., Hardman. L.: Facet: a browser for heterogeneous semantic web repositories, In ISWC06 (2006)
Hotho, A., Maedche, A., Staab, S.: Ontology-based text document clustering. Künstl Intell 04, 48–54 (2002)
Lafferty, J., Zhai, C.: Probabilistic relevance models based on document and query generation. Language Modeling and Information Retrieval. W. B. Croft and J. Lafferty Eds., Kluwer Academic Publishers (2003)
Luhn, H.P.: The automatic creation of literature abstracts. IBM J Res Dev 2(2), 159–165 (1958)
McKeown, K., Radev, D.R.: Generating summaries of multiple news articles. In the Proceedings of the 18th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. Seattle. WA. 74–82 (1995)
Nastase, V.: Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. In Proc. the Conference on Empirical Methods in Natural Language Processing (EMNLP’08), 763–772 (2008)
Nenkova, A.: Automatic text summarization of newswire: Lessons learned from the document understanding conference. In Proc. the 20th National Conference on Artificial Intelligence (AAAI‘05), Pittsburgh. USA (2005)
Oren, E., Delbru, R., Decker, S.: Extending Faceted Navigation for RDF Data. In Proceedings of the International Semantic Web Conference, 559–572 (2006)
Porcel, C., Moreno, J.M., Herrera-Viedma, E.: A multi-disciplinar recommender system to advice research resources in university digital libraries. Expert Syst Appl 36(10), 12520–12528 (2009)
Radev, D.R., McKeown, K.: Generating natural language summaries from multiple on-line sources. Comput Ling 24(3), 469–500 (1998)
Radev, D.R., Hovy, E., McKeown, K.: Introduction to the special issue on summarization. Comput Ling 28(4), 399–408 (2002)
Renda, M.E., Straccia, U.: A personalized collaborative digital library environment: a model and an application. Inf Process Manag 41, 5–21 (2005)
Ross, L., Sennyey, P.: The library is dead, long live the library! The practice of academic librarianship and the digital revolution. J. Acad. Librariansh. 34(2), 145–152 (2008)
Salton, G.: Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison-Wesley (1989)
Salton, G., Buckley, C.: Weighting approaches in automatic text retrieval. Inf Process Manag 24(5), 513–523 (1988)
Singhal, A.: Modern information retrieval: A brief overview. IEEE Data Engineering Bulletin 24, 35–43 (2001)
Sinha, V and Karger, D.R.: Magnet: Supporting navigation in semistructured data environments. In SIGMOD’05, pp 97–106, (2005)
Wagner, A., Ladwig, G., Tran, D.T.: Browsing-oriented Semantic Faceted Search. In Proc. of the 22nd Conf. on Database and Expert Systems Applications (DEXA). Springer (2011)
Xiaojun, W., Jianwu, Y., Jianguo, X.: Manifold-ranking based topic-focused multi-document summarization. In Proc. of the 20th international joint conference on Artificial intelligence. Hyderabad. India. 2903–2908 (2007)
Yee, K-P., Swearingen, K., Hearst, M.: Faceted metadata for image search and browsing. In Proceedings of the Special Interest Group on Computer-Human Interaction conference on Human factors in computing systems, 401–408 (2003)
Zhuge, H.: Interactive semantics. Artif Intell 174(2), 190–204 (2010)
Zhuge, H.: The Web Resource Space Model. Springer (2008)
Zhuge, H., Xing, Y.: Probabilistic resource space model for managing resources in cyber-physical society. IEEE Trans Serv Comput 5(3), 404–421 (2012)
Zhuge, H.: The Knowledge Grid: Toward Cyber-Physical Society, World Scientific Publishing Co. (2004, 1st Edition), (2012, 2nd Edition)
Zhuge, H., Xing, Y., Shi, P.: Resource Space Model, OWL and Database: Mapping and Integration, ACM Transactions on Internet Technology, 8/4 (2008)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xu, B., Zhuge, H. Faceted navigation through keyword interaction. World Wide Web 17, 671–689 (2014). https://doi.org/10.1007/s11280-012-0192-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-012-0192-2