Skip to main content

Relevance Feedback Document Retrieval Using Support Vector Machines

  • Conference paper
Active Mining

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3430))

Abstract

We investigate the following data mining problems from the document retrieval: From a large data set of documents, we need to find documents that relate to human interest as few iterations of human testing or checking as possible. In each iteration a comparatively small batch of documents is evaluated for relating to the human interest. We apply active learning techniques based on Support Vector Machine for evaluating successive batches, which is called relevance feedback. Our proposed approach has been very useful for document retrieval with relevance feedback experimentally. In this paper, we adopt several representations of the Vector Space Model and several selecting rules of displayed documents at each iteration, and then show the comparison results of the effectiveness for the document retrieval in these several situations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Yates, R.B., Neto, B.R.: Modern Information Retrieval. Addison-Wesley, Reading (1999)

    Google Scholar 

  2. TREC, http://trec.nist.gov/

  3. IREX, http://cs.nyu.edu/cs/projects/proteus/irex/

  4. NTCIR: http://www.rd.nacsis.ac.jp/~ntcadm/

  5. Salton, G., McGill, J.: Introduction to modern information retrieval. McGraw-Hill, New York (1983)

    MATH  Google Scholar 

  6. Salton, G. (ed.): Relevance feedback in information retrieval, pp. 313–323. Prentice Hall, Englewood Cliffs (1971)

    Google Scholar 

  7. Okabe, M., Yamada, S.: Interactive document retrieval with relational learning. In: Proceedings of the 16th ACM Symposium on Applied Computing, pp. 27–31 (2001)

    Google Scholar 

  8. Tong, S., Koller, D.: Support vector machine active learning with applications to text classification. Journal of Machine Learning Research 2, 45–66 (2001)

    Article  Google Scholar 

  9. Drucker, H., Shahrary, B., Gibbon, D.C.: Relevance feedback using support vector machines. In: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 122–129 (2001)

    Google Scholar 

  10. Onoda, T., Murata, H., Yamada, S.: Interactive document retrieval with active learning. In: International Workshop on Active Mining (AM 2002), Maebashi, Japan, pp. 126–131 (2002)

    Google Scholar 

  11. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)

    MATH  Google Scholar 

  12. Bishop, C.: Neural Networks for Pattern Recognition. Clarendon Press, Oxford (1995)

    Google Scholar 

  13. Murata, N., Yoshizawa, S., Amari, S.: Network information criterion - determining the number of hidden units for an artificial neural network model. IEEE Transactions on Neural Networks 5, 865–872 (1994)

    Article  Google Scholar 

  14. Onoda, T.: Neural network information criterion for the optimal number of hidden units. In: Proc. ICNN 1995, pp. 275–280 (1995)

    Google Scholar 

  15. Orr, J., Müller, K.R. (eds.): NIPS-WS 1996. LNCS, vol. 1524. Springer, Heidelberg (1998)

    Google Scholar 

  16. Boser, B., Guyon, I., Vapnik, V.: A training algorithm for optimal margin classifiers. In: Haussler, D. (ed.) 5th Annual ACM Workshop on COLT, pp. 144–152. ACM Press, Pittsburgh (1992)

    Google Scholar 

  17. Schölkopf, B., Smola, A., Williamson, R., Bartlett, P.: New support vector algorithms. Neural Computaion 12, 1083–1121 (2000)

    Google Scholar 

  18. Schapire, R., Singer, Y., Singhal, A.: Boosting and rocchio applied to text filtering. In: Proceedings of the Twenty-First Annual International ACM SIGIR, pp. 215–223 (1998)

    Google Scholar 

  19. Kernel-Machines, http://www.kernel-machines.org/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Onoda, T., Murata, H., Yamada, S. (2005). Relevance Feedback Document Retrieval Using Support Vector Machines. In: Tsumoto, S., Yamaguchi, T., Numao, M., Motoda, H. (eds) Active Mining. Lecture Notes in Computer Science(), vol 3430. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11423270_4

Download citation

  • DOI: https://doi.org/10.1007/11423270_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26157-5

  • Online ISBN: 978-3-540-31933-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics