References
Baker P, Hardie A, McEnery A, Cunningham H, Gaizauskas R (2002) EMILLE, a 67-million word corpus of Indic languages: data collection, mark-up and harmonisation. In: Proceedings of the conference on language resources and evaluation, Gran Canaria, Spain, 29–31 May 2002, pp 819–825
Cunningham H (2002) GATE, a general architecture for text engineering. Comput Humanit 36:223–254
Declerck T, Wittenberg P, Cunningham H (2001) The automatic generation of formal annotations in a multimedia indexing and searching environment. In: Proceedings of the ACL/EACL workshop on human language technology and knowledge management, Toulouse, France, 6–7 July 2001, pp 129–136
Hearst MA (1999) Untangling text mining. In: Proceedings of the annual meeting of the Association for Computational Linguistics, University of Maryland, Baltimore, June 1999
Paynter GW, Witten IH (2001) A combined phrase and thesaurus browser for large document collections. In: Proceedings of the European conference on digital libraries, Darmstadt, Germany, 4–9 September 2001, pp 25–36
Tablan V, Ursu C, Bontcheva K, Cunningham H, Maynard D, Hamza O, McEnery A, Baker P, Leisher M (2002) A Unicode-based environment for creation and use of language resources. In: Proceedings of the conference on language resources and evaluation, Gran Canaria, Spain, 29–31 May 2002, pp 66–71
Witten IH, Bainbridge D (2003) How to build a digital library. Morgan Kaufmann, San Francisco
Witten IH (in press) Text mining. In: Singh MP (ed) Practical handbook of Internet computing. CRC Press, Boca Raton, FL
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Witten, I., Don, K., Dewsnip, M. et al. Text mining in a digital library. Int J Digit Libr 4, 56–59 (2004). https://doi.org/10.1007/s00799-003-0066-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00799-003-0066-4