Skip to main content
Log in

HyperFile: A data and query model for documents

  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract

Non-quantitative information such as documents and pictures pose interesting new problems in the database world. Traditional data models and query languages do not provide appropriate support for this information. Such data are typically stored in file systems, which do not provide the security, integrity, or query features of database management systems. The hypertext model has emerged as a good interface to this information; however,finding information using hypertext browsing does not scale well. We developed a query interface that serves as an extension of the browsing model of hypertext systems. These queries minimize the repeated user interactions required to locate data in a standard hypertext system. HyperFile is a prototype data server interface. In this article, we describe HyperFile, including a number of issues such as query generation, query processing, and indexing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Anderson, T.L., Berre, A.J., Mallison, M., Porter, H., and Schneider, B. The Tektronix HyperModel benchmark specification. Technical Report No. 89-05, Tektronix Computer Research Laboratory, Beaverton, OR, August 3, 1989.

    Google Scholar 

  • Aoki, P.M. Implementation of extended indexes in POSTGRES.SIGIR Forum, 25(1):2–9, 1991.

    Google Scholar 

  • Berners-Lee, T.J., Cailliau, R., Groff, J.-F., and Pollermann, B. World-wide web: The information universe.Electronic Networking: Research, Applications, and Policy, 2(1):52–58, 1992.

    Google Scholar 

  • Christophides, V., Abiteboul, S., Cluet, S., and Scholl, M. From structured documents to novel query facilities.Proceedings of the ACM SIGMOD International Conference on Management of Data, Minneapolis, MN, 1994.

  • Clifton, C., Garcia-Molina, H., and Hagmann, R. The design of a document database.Proceedings of the ACM Conference on Document Processing Systems, Santa Fe, NM, 1988.

  • Clifton, C. and Garcia-Molina, H. Indexing in a Hypertext Database.Proceedings of the International Conference on Very Large Databases, Brisbane, Australia, 1990.

  • Clifton, C. and Garcia-Molina, H. Distributed processing of filtering queries in HyperFile.Proceedings of the IEEE International Conference on Distributed Computing Systems, Arlington, TX, 1991.

  • Conklin, J. Hypertext: An introduction and survey.IEEE Computer, 20(9):17–41, 1987.

    Google Scholar 

  • Croft, W.B. and Lewis, D.D. An approach to natural language processing for document retrieval.Proceedings of the Tenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, LA, 1987.

  • Cruz, I.F., Mendelzon, A.O., and Wood, P.T. A graphical query language supporting recursion.Proceedings of the ACM SIGMOD International Conference on Management of Data, San Francisco, CA, 1987.

  • Dadam, P., Kuespert, K., Andersen, F., Blanken, H., Erbe, R., Guenauer, J., Lum, V., Pistor, P., and Walch, G. A DBMS prototype to support extended NF2 relations: An integrated view on flat tables and hierarchies.Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington, DC, 1986.

  • Data Base Task Group. CODASYL Data Description Language. National Bureau of Standards Handbook 113, US Department of Commerce, Washington, DC, January, 1974.

    Google Scholar 

  • Deux, O. The O2 system.Communications of the ACM, 34(10):34–48, 1991.

    Google Scholar 

  • Frisse, M.E. and Cousins, S.B. Information retrieval from Hypertext: Update on the dynamic medical handbook project.ACM Hypertext Proceedings, Pittsburgh, PA, 1989.

  • Grønbaek, K., Hem, J.A., Madsen, O.L., and Sloth, L. Cooperative hypermedia systems: A dexter-based architecture.Communications of the ACM, 37(2):64–74, 1994.

    Google Scholar 

  • Halasz, F.G., Moran, T.P., and Trigg, R.H. NoteCards in A nutshell.Proceedings of the ACM CHI+GI Conference, Toronto, Canada, 1987.

  • Halasz, F. Reflections on NoteCards: Seven issues for the next generation of hypermedia systems.Communications of the ACM 31(7):836–852, 1988.

    Google Scholar 

  • Jagadish, H.V. A compression technique to materialize transitive closure.Transactions on Database Systems, 15(4):558–598, 1990.

    Google Scholar 

  • Kapidakis, S. Average-case analysis of graph-searching algorithms. Ph.D. Thesis, Princeton University, Princeton, NJ, 1990.

    Google Scholar 

  • Lange, D.B., Østerbye, K., and Schütt, H. Hypermedia storage. Technical Report R-92-2009, The University of Aalborg, Institute for Electronic Systems, 1992.

  • Lange, D.B. Object-oriented hypermodeling of Hypertext supported information systems.Proceedings of the Twenty-sixth IEEE International Conference on System Sciences, Hawaii, 1993.

  • Lum, V.Y. Multiple-attribute retrieval with combined indexes.Communications of the ACM, 13(11):660–665, 1970.

    Google Scholar 

  • Maier, D., Stein, J., Otis, A., and Purdy, A. Development of an object-oriented DBMS.Proceedings of the ACM Object-Oriented Programming Systems, Langauges, and Applications Conference, Portland, OR, 1986.

  • Mendelzon, A.O. and Wood, P.T. Finding regular simple paths in graph databases.Proceedings of the Fifteenth International Conference on Very Large Data Bases, Amsterdam, 1989.

  • Sacks-Davis, R., Kent, A., Ramamohanarao, K., Thom, J., and Zobel, J. Atlas: A nested relational database system for text applications.IEEE Knowledge and Data Engineering, to appear.

  • Salton, G. Automatic text indexing using complex identifiers.Proceedings of the ACM Conference on Document Processing Systems, Santa Fe, NM, 1988.

  • Salton, G.Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Reading, MA: Addison-Wesley, 1989.

    Google Scholar 

  • Salton, G., Allan, J., and Buckley, C. Automatic structuring and retrieval of large text files.Communications of the ACM, 37(2): 97–108, 1994.

    Google Scholar 

  • Schnase, J.L., Leggett, J.J., Hicks, D.L., Nuernberg P.J., and Sanchez, J.A. Design and implementation of the HB1 hyperbase management system.Electronic Publishing: Origination, Dissemination, and Design, 6(1):35–63, 1993.

    Google Scholar 

  • Schwarz, P., Chang, W., Freytag, J., Lohman, G., McPherson, J., Mohan, C., and Pirahesh, H. Extensibility in the Starburst database system.Proceedings of the International Workshop on Object Oriented Database Systems, Pacific Grove, CA 1986.

  • Smith, K.E. and Zdonik, S.B. Intermedia: A case study of the differences between relational and object-oriented database systems.Proceedings of the ACM Conference on Object Oriented Programming Systems, Languages, and Applications, Orlando, FL, 1986.

  • Stonebraker, M. Inclusion of new types in relational database systems.Proceedings of the Fourth IEEE International Conference on Data Engineering, Washington, DC, 1986.

  • Stonebraker, M. The Miro DBMS.Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington, DC, 1993.

  • Stonebraker, M., Stettner, A., Lynn, N., Kalash, J., and Guttman, N. Document processing in a relational database system.Transactions on Office Information Systems, 1 (2):143–158, 1983.

    Google Scholar 

  • Stonebraker, M. and Rowe, L. The design of POSTGRES.Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington, DC, 1986.

  • Ubell, M. The Montage extensible DataBladeTM architecture.Proceedings of the ACM SIGMOD International Conference on Management of Data, Minneapolis, MN, 1994.

  • Weinreb, D., Feinberg, N., Gerson, D., and Lamb, C. An object-oriented database system to support an integrated programming environment.IEEE Data Engineering, 11(2), 1988.

  • Wiederhold, G.File Organization for Database Design. New York, NY: McGraw-Hill, 1987.

    Google Scholar 

  • Wiil, U.K. and Leggett, J.J. Hyperform: An extensible hyperbase management system. Department of Computer Science Technical Report No. TAMU-HRL 92-003, Texas A&M University, College Station, TX, 1992.

    Google Scholar 

  • Woelk, D., Kim, W., and Luther, W. An object-oriented approach to multimedia databases.Proceedings of the ACM SIGMOD International Conference on the Management of Data, Washington, DC, 1986.

  • Zdonik, S.B. Incremental database systems: Databases from the ground up.Proceedings of the ACM SIGMOD International Conference on the Management of Data, Washington, DC, 1993.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Clifton, C., Garcia-Molina, H. & Bloom, D. HyperFile: A data and query model for documents. VLDB Journal 4, 45–86 (1995). https://doi.org/10.1007/BF01232472

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01232472

Key Words

Navigation