Skip to main content

Patent Retrieval Based on Multiple Information Resources

  • Conference paper
  • First Online:
Information Retrieval Technology (AIRS 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9994))

Included in the following conference series:

Abstract

Query expansion methods have been proven to be effective to improve the average performance of patent retrieval, and most of query expansion methods use single source of information for query expansion term selection. In this paper, we propose a method which exploits external resources for improving patent retrieval. Google search engine and Derwent World Patents Index were used as external resources to enhance the performance of query expansion methods. LambdaRank was employed to improve patent retrieval performance by combining different query expansion methods with different text fields weighting strategies of different resources. Experiments on TREC data sets showed that our combination of multiple information sources for query formulation was more effective than using any single source to improve patent retrieval performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Xu, J., Croft, W.B.: Query expansion using local and global document analysis. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 4–11. ACM (1996)

    Google Scholar 

  2. Cronen-Townsend, S., Zhou, Y., Croft, W.B.: A framework for selective query expansion. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, pp. 236–237. ACM (2004)

    Google Scholar 

  3. Metzler, D., Croft, W.B.: Latent concept expansion using Markov random fields. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 311–318. ACM (2007)

    Google Scholar 

  4. Tao, T., Zhai, C.: Regularized estimation of mixture models for robust pseudo-relevance feedback. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 162–169. ACM (2006)

    Google Scholar 

  5. Lee, K.S., Croft, W.B., Allan, J.: A cluster-based resampling method for pseudo-relevance feedback. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 235–242. ACM (2008)

    Google Scholar 

  6. Magdy, W., Jones, G.J.: An efficient method for using machine translation technologies in cross-language patent search. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 1925–1928. ACM (2011)

    Google Scholar 

  7. Ganguly, D., Leveling, J., Magdy, W., et al.: Patent query reduction using pseudo relevance feedback. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 1953–1956. ACM (2011)

    Google Scholar 

  8. Leveling, J., Magdy, W., Jones, G.J.: An investigation of decompounding for cross-language patent search. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1169–1170. ACM (2011)

    Google Scholar 

  9. Xu, K., Liu, W., Lin, H., et al.: Patent query expansion using text fields. J. Comput. Inf. Syst. 8(13), 5607–5614 (2012)

    Google Scholar 

  10. Xu, K., Lin, H., Liu, W., et al.: Learning to rank based query expansion for patent retrieval. J. Comput. Inf. Syst. 9(13), 5387–5394 (2013)

    Google Scholar 

  11. Diaz, F., Metzler, D.: Improving the estimation of relevance models using large external corpora. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 154–161. ACM (2006)

    Google Scholar 

  12. Lin, Y., Lin, H., Jin, S., et al.: Social annotation in query expansion: a machine learning approach. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 405–414. ACM (2011)

    Google Scholar 

  13. Xu, Y., Jones, G.J., Wang, B.: Query dependent pseudo-relevance feedback based on wikipedia. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 59–66. ACM (2009)

    Google Scholar 

  14. Liu, T.-Y.: Learning to rank for information retrieval. Found. Trends Inf. Retrieval 3(3), 225–331 (2009)

    Article  Google Scholar 

  15. Konishi, K.: Query terms extraction from patent document for invalidity search

    Google Scholar 

  16. Itoh, H., Mano, H., Ogawa, Y.: Term distillation in patent retrieval. In: Proceedings of the ACL-2003 Workshop on Patent Corpus Processing, vol. 20, pp. 41–45. Association for Computational Linguistics (2003)

    Google Scholar 

  17. Xue, X., Croft, W.B.: Transforming patents into prior-art queries. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 808–809. ACM (2009)

    Google Scholar 

  18. Liu, S.-H., Liao, H.-L., Pi, S.-M., et al.: Development of a patent retrieval and analysis platform–a hybrid approach. Expert Syst. Appl. 38(6), 7864–7868 (2011)

    Article  Google Scholar 

  19. Mahdabi, P., Crestani, F.: The effect of citation analysis on query expansion for patent retrieval. Inf. Retrieval 17(5–6), 412–429 (2014)

    Article  Google Scholar 

  20. Chen, Y.-L., Chiu, Y.-T.: An IPC-based vector space model for patent retrieval. Inf. Process. Manag. 47(3), 309–322 (2011)

    Article  Google Scholar 

  21. Rusiñol, M., de las Heras, L.-P., Terrades, O.R.: Flowchart recognition for non-textual information retrieval in patent search. Inf. Retrieval 17(5–6), 545–562 (2014)

    Article  Google Scholar 

  22. Wanagiri, M.Z., Adriani, M.: Prior art retrieval using various patent document fields contents. In: CLEF (Notebook Papers/LABs/Workshops) (2010)

    Google Scholar 

  23. Rocchio, J.J.: Relevance feedback in information retrieval. In: Proceedings of the Smart Retrieval System, pp. 313–323 (1971)

    Google Scholar 

  24. Robertson, S.E., Walker, S., Beaulieu, M., et al.: Okapi at TREC-4. In: Proceedings of the Fourth Text Retrieval Conference, pp. 73–97. NIST Special Publication (1996)

    Google Scholar 

  25. Lavrenko, V., Croft, W.B.: Relevance based language models. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 120–127. ACM (2001)

    Google Scholar 

  26. Zhai, C., Lafferty, J.: Model-based feedback in the language modeling approach to information retrieval. In: Proceedings of the Tenth International Conference on Information and Knowledge Management, pp. 403–410. ACM (2001)

    Google Scholar 

  27. Yeung, D.L., Clarke, C.L., Cormack, G.V., et al.: Task-specific query expansion (MultiText Experiments for TREC 2003). In: TREC, pp. 810–819 (2003)

    Google Scholar 

  28. Wu, Q., Burges, C.J., Svore, K.M., et al.: Adapting boosting for information retrieval measures. Inf. Retrieval 13(3), 254–270 (2010)

    Article  Google Scholar 

  29. Salton, G., Wong, A., Yang, C.-S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)

    Article  MATH  Google Scholar 

  30. Robertson, S., Zaragoza, H., Taylor, M.: Simple BM25 extension to multiple weighted fields. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, pp. 42–49. ACM (2004)

    Google Scholar 

  31. Amati, G.: Probability Models for Information Retrieval Based on Divergence from Randomness. University of Glasgow (2003)

    Google Scholar 

Download references

Acknowledgement

This work is partially supported by grant from the Natural Science Foundation of China (No. 61272370, 61402075, 61572102, 61572098, 61272373), Natural Science Foundation of Liaoning Province, China (No. 201202031, 2014020003), State Education Ministry and The Research Fund for the Doctoral Program of Higher Education (No. 20090041110002), the Fundamental Research Funds for the Central Universities.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongfei Lin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Xu, K., Lin, H., Lin, Y., Xu, B., Yang, L., Zhang, S. (2016). Patent Retrieval Based on Multiple Information Resources. In: Ma, S., et al. Information Retrieval Technology. AIRS 2016. Lecture Notes in Computer Science(), vol 9994. Springer, Cham. https://doi.org/10.1007/978-3-319-48051-0_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-48051-0_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-48050-3

  • Online ISBN: 978-3-319-48051-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics