A library application on top of an RDBMS: Performance aspects
Applications which require a combination of structured data with unstructured text fields are becoming of increasing practical interest. But whereas structured data are usually stored in a relational database, large text collections are maintained by proprietary text or information retrieval systems. The synthesis of both areas is still a topic of intensive research. We describe one such application, namely maintaining library catalogues, and study the efficiency of two implementation alternatives both based on RDBMS technology. In the first alternative word occurrence information is encoded using bitlists. The other chooses a direct implementation within the relational model. Performance tests are done which are based on real world data and real world user transactions. They demonstrate that the problem of the bitlist implementation is caused by conversions which are necessary to combine them with structured data. In contrast, our direct implementation benefits from today's sophisticated RDBMS technology and performs promisingly well.
Unable to display preview. Download preview PDF.
- 1.O. Balownew, T. Bode, A.B. Cremers, J. Kalinski, J.E. Wolff, and H. Rottmann. Maintaing Library Catalogues with an RDBMS — A Performance Study —. Technical Report IAI-TR-96-13, University of Bonn, November 1996.Google Scholar
- 2.S. DeFazio, A. Daoud, L.A. Smith, and J. Srinivasan. Integrating IR and RDBMS using cooperative indexing. In Proc. of the 18th Annual Int. SIGIR Conf. on Research and Development in Information Retrieval, pages 84–92, 1995.Google Scholar
- 3.Deutsche Forschungsgemeinschaft — Bibliotheksausschuß. Empfehlungen zur Migration der deutschen Bibliotheksverbiinde. ZfBB, 42(2):105–136, 1995.Google Scholar
- 4.W.B. Frakes and R. Baeza-Yates, editors. Information Retrieval — Data Structures and Algorithms. Prentice Hall, 1992.Google Scholar
- 5.Jürgen Freitag, Horst-Dieter Werner, and Wolfgang Wilkes. Strukturierte Attribute in Relationen zur Unterstützung von IR-Anwendungen. In GI 12. Jahrestagung, Informatik-Fachberichte 57, pages 623–647. Springer, 1982.Google Scholar
- 6.Graham Hoare. Oracle TextServer3 — Workbench C Guide. Oracle Corp., 500 Oracle Parkway, Redwood City, CA 94065, 1995. Version 1.0, Part No. A22190-1.Google Scholar
- 7.H. Kaufmann and H.-J. Schek. Text Search Using Database Systems Revisited — Some Experiments —. In C.A. Goble and J.A. Keane, editors, Proc. of the 13th British National Conference on Databases (BNCOD 13), pages 204–225. Springer, LNCS 940, 1995.Google Scholar
- 8.Ian A. Macleod. Text retrieval and the relational model. Journal of the American Society for Information Science, 42(3):155–165, 1991.Google Scholar
- 9.National Institute of Standards and Technology. Proceedings of the Fourth Text REtrieval Conference (TREC-4), Gaithersburg, Md., November 1995.Google Scholar
- 10.J. Newton and D.Y. Brenman. Oracle TextServer3 — Administrator's Guide. Oracle Corp., 500 Oracle Parkway, Redwood City, CA 94065, 1995. Version 3.0, Part No. A22191-1.Google Scholar
- 11.Hans-Jörg Schek. Methods for the administration of textual data in database systems. In R.N. Oddy, S.E. Robertson, C.J. van Rijsbergen, and P.W. Williams, editors, Information Retrieval Research, pages 218–235. Butterworths, 1981.Google Scholar