Structure-Aware Query for Digital Libraries: Use Cases and Challenges for the Humanities

  • Christopher York
  • Clifford Wulfman
  • Greg Crane
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2769)


Much recent research in database design focuses on persistence models for semistructured data similar to the SGML and XML that humanities digital libraries have long used to encode digital editions of texts. Structure-aware querying promises to simplify the design of such digital repositories by allowing them to store and query texts using a single, unified information model. Using content the Perseus Project has acquired over the past ten years as a test case, we describe the advantages and delimit the problems in managing structure-aware queries over multiple or ambiguous schemas, evaluate the place of markup in digital libraries where much content is automatically generated, and examine the uses for structure-aware query in a system that stores both semistructured content and graph-structured metadata.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Abiteboul, S., McHugh, J., Rys, M., Vassalos, V., Wiener, J.: Incremental Maintenance for Materialized Views over Semistructured Data. In: Proceedings of the 24th International Conference on Very Large Data Bases (VLDB 1998), pp. 38–49 (1998)Google Scholar
  2. 2.
    Aguilera, V., Cluet, S., Milo, T., Veltri, P., Vodislav, D.: Views in a Large-Scale XML Repository. VLDB 11 (3), 238–255 (2002)MATHCrossRefGoogle Scholar
  3. 3.
    Boag, S., Chamberlin, D., Fernandez, M.F., Florescu, D., Robie, J., Simeon, J., Stefanescu, M.: XQuery 1.0: An XML Query Language. W3C Working Draft (2002),
  4. 4.
    Clark, J., DeRose, S.: XML Path Language (XPath) 1.0. W3C Recommendation (1999),
  5. 5.
    Fiebig, T., Helmer, S., Kanne, C.-C., Moerkotte, G., Neumann, J., Schiele, R., Westmann, T.: Anatomy of a Native XML Base Management System. VLDB 11 (4), 292–314 (2002)MATHCrossRefGoogle Scholar
  6. 6.
    Fuhr, N., Großjohann, K.: XIRQL: A Query Language for Information Retrieval in XML Documents. In: Proceedings of the 24th Annual International Conference on Research and Development in Information Retrieval, pp. 172–180 (2001)Google Scholar
  7. 7.
    Jagadish, H.V., Al-Khalifa, S., Chapman, A., Lakshmanan, L.V.S., Nierman, A., Paparizos, S., Patel, J.M., Srivastava, D., Wiwatwattana, N., Wu, Y., Yu, C.: TIMBER: A Native XML Database. VLDB 11 (4), 274–291 (2002)MATHCrossRefGoogle Scholar
  8. 8.
    McHugh, J., Abiteboul, S., Goldman, R., Quass, D., Wisdom, J.: Lore: A Database Management System for Semistructured Data. In: ACM SIGMOD International Conference on Management of Data (SIGMOD 1997) SIGMOD Record, vol. 26 (3), pp. 54–66 (1997)Google Scholar
  9. 9.
    Robie, J., Garshol, L.M., Newcomb, S., Fuchs, M., Miller, L., Brickley, D., Christophides, V., Karvounarakis, G.: The Syntactic Web: Syntax and Semantics on the Web. Markup Languages: Theory and Practice 3 (4), 411–440 (2001)CrossRefGoogle Scholar
  10. 10.
    Smith, D.A., Mahoney, A., Rydberg-Cox, J.A.: Managing XML Documents in an Integrated Digital Library. Markup Languages: Theory and Practice 2 (3), 205–214 (2000)CrossRefGoogle Scholar
  11. 11.
    Smith, D., Crane, G.: Disambiguating Geographic Names in a Historical Digital Library. In: Constantopoulos, P., Sølvberg, I.T. (eds.) ECDL 2001. LNCS, vol. 2163, pp. 127–136. Springer, Heidelberg (2001)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Christopher York
    • 1
  • Clifford Wulfman
    • 1
  • Greg Crane
    • 1
  1. 1.Perseus Project Tufts UniversityMedfordU.S.A.

Personalised recommendations