A Hierarchical Framework for Multi-document Summarization of Dissertation Abstracts

  • Christopher S.G. Khoo
  • Shiyan Ou
  • Dion Hoe-Lian Goh
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2555)


This paper reports initial work on developing methods for automatic generation of multi-document summaries of dissertation abstracts in a digital library. The focus is on automatically generating a summary of a set of dissertation abstracts retrieved in response to user query, and presenting the summary using a visualization method. A hierarchical variable-based framework for multi-document summarization of dissertation abstracts in sociology and psychology is presented. The framework makes use of macro-level and micro-level discourse structure of dissertation abstracts as well as cross-document structure. The micro-level structure of problem statements found in a sample of 50 dissertation abstracts was analyzed, and the common features found are described in the paper. A demonstration prototype with a tree-view interface for presenting multi-document abstracts has been implemented.


Domestic Violence Discourse Structure Dissertation Abstract School Violence Hierarchical Framework 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bell, A., & Garrett, P. (19 98). Media and discourse: A critical overview. Approaches to Media Discourse, 65–103. Oxford: Malden.Google Scholar
  2. 2.
    Jansen, B., Spink, A., Bateman, J., & Saracevic, T. (1998). Real life information retrieval: A study of user queries on the web. SIGIR Forum, 32(1), 5–17.CrossRefGoogle Scholar
  3. 3.
    Mann, W., & Thompson, S. (1988). Rhetorical structure theory: Toward a functional theory of text organization. Text, 8(3), 243–281.Google Scholar
  4. 4.
    Marcu, D. (1997). The rhetorical parsing, summarization, and generation of natural language texts. Ph.D. Dissertation, Department of Computer Science, University of Toronto.Google Scholar
  5. 5.
    Radev, D. (2000). A common theory of information fusion from multiple text sources step one: cross-document structure. In Proceedings of the 1st SIGdial Workshop on Discourse and Dialogue. Available at:
  6. 6.
    Sowa, J. (1984). Conceptual structures: Information processing in mind and machine. Reading, MA: Addison-Wesley.Google Scholar
  7. 7.
    Spink, A., & Xu, J.L. (2000). Selected results from a large study of Web searching: the Excite study. Information Research [Online], 6(1). Available at:
  8. 8.
    Trochim, W. (1999). The research methods knowledge base. Cincinnati, OH: Atomic Dog Publishing.Google Scholar
  9. 9.
    Zamir, O., & Etzioni, O. (1999). Grouper: A dynamic clustering interface to web search results. In Proceedings of the Eighth International World Wide Web Conference. Available at:

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Christopher S.G. Khoo
    • 1
  • Shiyan Ou
    • 1
  • Dion Hoe-Lian Goh
    • 1
  1. 1.Division of Information Studies School of Communication & InformationNanyang Technological UniversitySingapore

Personalised recommendations