Skip to main content

COLLAGE: An NLP Toolset to Support Boolean Retrieval

  • Chapter
Book cover Natural Language Information Retrieval

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 7))

Abstract

COLLAGE is a collection of processes and methods which carry out automatic analysis of topics in a natural language form. The results of this analysis are used to determine which NLP resources should be applied to converting each part of a topic into a set of Boolean queries, and how the document lists resulting from the application of each query should be combined to give a final list of ranked documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Broglio, J., Croft, W. B., Callan, J., and Nachbar, D. (1995). Document retrieval and routing using the INQUERY system. In Harman, D., editor, Proceedings of the Third Text Retrieval Conference (TREC-3), NIST Special Publication 500–225, pp. 29–38.

    Google Scholar 

  • Bryson, B. (1995). Made In America. London.

    Google Scholar 

  • Buckley, C., Salton, G., Allan, J., and Singhal, A. (1995). Automatic query expansion using SMART. In Harman, D., editor, Proceedings of the Third Text Retrieval Conference (TREC-3), NIST Special Publication 500–225, pp. 69–80.

    Google Scholar 

  • Cowie, J. (1995). Description of the CR.L/NMSU systems used for MUC-6. In Proceedings of the Sixth Message Understanding Conference (MUC-6), San Mateo, CA, Morgan Kaufmann, pp. 157–166.

    Google Scholar 

  • Cowie, J. and Guan, Z. (1996). CRL English routing system for TREC-5. In Harman, D., editor, Proceedings of the Fifth Text Retrieval Conference (TREC-5). NIST Special Publication 500–238, pp. 445–446.

    Google Scholar 

  • Greenbaum, S. (1991). The Development of the International Corpus of English. In Aijmer, K. and Bengt Altenberg, eds. English Corpus Linguistics, Studies in Honour of Jan Svartvik. Longman, London, pp. 83–91.

    Google Scholar 

  • Guthrie, L., Walker, W., and Guthrie, J. (L992). Document classification by machine. Technical Report MCCS-92–235, Computing Research Laboratory, New Mexico State University.

    Google Scholar 

  • Harman, D., editor (1995). Proceedings of the Fourth Text Retrieval Conference (TREC4), Gaithersburg, MD. NIST Soecial Publication 500–236.

    Google Scholar 

  • Harman, D., editor (1996). Proceedings of the Fifth Text Retrieval Conference (TREC-5)Gaithersburg, MD. NIST Special Publication 500–238.

    Google Scholar 

  • Heimreich, S., Guthrie, L., and Wilks, Y. (1993). The use of machine readable dictionaries in the PANGLOSS project. In Proceedings of the AAAI Spring Symposium on Building Lexicons for Machine Translation,Stanford University, CA.

    Google Scholar 

  • Mahesh, K. and Nirenburg, S. (1995). A situated ontology for practical NLP. In Proceedings of the Workshop on Basic Ontological Issues in Knowledge Sharing, International Joint Conference on Artificial Intelligence (IJCAI-95),Montreal, Canada.

    Google Scholar 

  • Miller, G., Beckwidth, R., Fellbaum, C., Gross, D., Miller, K.J. (1990). WordNet: An on-line lexical database. International Journal of Lexicography, 3 (4), pp. 235–244.

    Article  Google Scholar 

  • MITRE (1995). Mitre POS tagger software. Available for Research Use.

    Google Scholar 

  • MUC5 (1993). Proceedings of the Fifth Message Understanding Conference (MUC-5), San Francisco, California. DARPA, Morgan Kaufmann.

    Google Scholar 

  • MUC6 (1995). Proceedings of the Sixth Message Understanding Conference (MUC-6). DARPA, Morgan Kaufmann.

    Google Scholar 

  • Office of Management and Budget (1987). Standard Industrial Classification Manual. National Technical Information Service, Springfield, VA.

    Google Scholar 

  • Proctor, P., Ilson, R., Ayto, J., et al., editors (1978). Longman Dictionary of Contemporary English. Longman, Harlow.

    Google Scholar 

  • SIGIR (1996). Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland. ACM.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Cowie, J. (1999). COLLAGE: An NLP Toolset to Support Boolean Retrieval. In: Strzalkowski, T. (eds) Natural Language Information Retrieval. Text, Speech and Language Technology, vol 7. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-2388-6_11

Download citation

  • DOI: https://doi.org/10.1007/978-94-017-2388-6_11

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-5209-4

  • Online ISBN: 978-94-017-2388-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics