The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives
Publications are essential for scientific communication. Access to publications is provided by conventional libraries, digital libraries operated by learned societies or commercial publishers, and a huge number of web sites maintained by the scientists themselves or their institutions. Comprehensive meta-indices for this increasing number of information sources are missing for most areas of science. The DBLP Computer Science Bibliography of the University of Trier has grown from a very specialized small collection of bibliographic information to a major part of the infrastructure used by thousands of computer scientists. This short paper first reports the history of DBLP and sketches the very simple software behind the service. The most time-consuming task for the maintainers of DBLP may be viewed as a special instance of the authority control problem: how to normalize different spellings of person names. The third section of the paper discusses some details of this problem which might be an interesting research issue for the information retrieval community.
KeywordsDigital Library Huffman Code Bibliographic Record Query String Path Element
Unable to display preview. Download preview PDF.
- [L]Michael Ley: Die Trierer Informatik-Bibliographie DBLP. GI-Jahrestagung 1997: 257–266.Google Scholar
- [LS]Hartmut Liefke, Dan Suciu: XMill: an Efficient Compressor for XML Data. SIGMOD Conf. 2000, 153–164.Google Scholar
- [PG]Neoklis Polyzotis, Minos N. Garofalakis: Statistical Synopses for Graph-Structured XML Databases. SIGMOD Conf. 2002.Google Scholar
- [WMB]Ian H. Witten, Alistair Moffat, Timothy C. Bell: Managing Gigabytes, Compressing and Indexing Documents and Images, 2nd Ed., Morgan Kaufmann, 1999.Google Scholar