Stress-Testing General Purpose Digital Library Software
DSpace, Fedora, and Greenstone are three widely used open source digital library systems. In this paper we report on scalability tests performed on these tools by ourselves and others. These range from repositories populated with synthetically produced data to real world deployment with content measured in millions of items. A case study is presented that details how one of the systems performed when used to produce fully-searchable newspaper collections containing in excess of 20 GB of raw text (2 billion words, with 60 million unique terms), 50 GB of metadata, and 570 GB of images.
KeywordsDigital Library Unique Term Newspaper Page Server Response Time Digital Library System
Unable to display preview. Download preview PDF.
- 2.Littman, J.: Technical approach and distributed model for validation of digital objects. D-Lib Magazine 12(5) (2006)Google Scholar
- 3.Misr, D., Seamans, J., Thoma, G.R.: Testing the scalability of a DSpace-based archive. Technical report, National Library of Medicine, Bethesda, Maryland, USA (2007)Google Scholar
- 6.Smith, M., Bass, M., McClella, G., Tansley, R., Barton, M., Branschofsky, M., Stuve, D., Walker, J.: DSpace: An open source dynamic digital repository. D-Lib Magazine 9(1) (2003), doi:10.1045/january2003-smithGoogle Scholar