Skip to main content

Query Enhancement for Patent Prior-Art-Search Based on Keyterm Dependency Relations and Semantic Tags

  • Conference paper
Multidisciplinary Information Retrieval (IRFC 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7356))

Included in the following conference series:

Abstract

Prior art search is one of the most common forms of patent search, whose goal is to find patent documents that constitute prior art for a given patent being examined. Current patent search systems are mostly keyword-based, and due to the unique characteristics of patents and their usage, such as embedded structure and the length of patent documents, there are rooms for further improvements. In this paper, we propose a new query formulation method by using keyword dependency relations and semantic tags, which have not been used for prior art search. The key idea of this paper is to make use of patent structure, linguistic clues and use word relations to identify important terms. Moreover, to formulate better queries we attempt to identify what technology area a patent belongs to and what problems/solutions it addresses. Based on our experiments where IPC codes are used for relevance judgments, we show that keyword dependency relation approach achieved 13~18% improvement in MAP over the traditional tf-idf based term weighting method when a single field is used for query formulation. Furthermore, we obtain 42~46% improvement in MAP when additional terms are used through pattern-based semantic tagging.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Iwayama, M., Fujii, A., Kando, N., Takano, A.: Overview of patent retrieval task at NTCIR-3. In: Proceedings of NTCIR Workshop (2009)

    Google Scholar 

  2. Fujii, A., Iwayama, M., Kando, N.: Overview of Patent Retrieval Task at NTCIR- 4. In: Proceedings of NTCIR-4 Workshop (2004)

    Google Scholar 

  3. Kim, Y., et al.: Automatic Discovery of Technology Trends from Patent. In: Proceedings of the 2009 ACM Symposium on Applied Computing, pp. 1480–1487 (2009)

    Google Scholar 

  4. Kazuya, K.: Query Term Extraction from patent documents for invalidity search. In: Proceedings of NTCIR-5 Workshop Meeting, Tokyo, Japan, December 6-9 (2005)

    Google Scholar 

  5. Roda, G., Tait, J., Piroi, F., Zenz, V.: CLEF-IP 2009: Retrieval experiments in the Intellectual Property domain. In: CLEF-IP (2009)

    Google Scholar 

  6. Susan, V., Eva, D.: Prior Art retrieval using the claims section as a bag of words. In: CLEF-IP (2010)

    Google Scholar 

  7. Toucedo, J.C., Losada, D.E.: University of Santiago de Compostela at CLEF-IP09. In: 1st CLEF-IP, Corfu, Greece (2009)

    Google Scholar 

  8. Xiaobing, X., Bruce Croft, W.: Transforming Patents into Prior Art Queries. In: SIGIR 2009 (2010)

    Google Scholar 

  9. Metti, Z., et al.: Prior art retrieval using various patent document fields contents. In: CLEF-IP (2010)

    Google Scholar 

  10. Mai, F.-D., Hwang, F., Chien, K.-M., Wang, Y.-M., Chen, C.-Y.: Patent map and analysis of carbon nanotube. Science and Technology Information Center, National Science Council, ROC (2002)

    Google Scholar 

  11. Young Gil, K., et al.: Visualization of patent analysis for emerging technology. Expert Systems with Applications: An International Journal archive 34(3) (April 2008)

    Google Scholar 

  12. Lent, B., et al.: Discovering trends in text databases. In: Proc. 3rd Int. Conf. Knowledge Discovery and Data Mining, KDD, pp. 227–230 (1997)

    Google Scholar 

  13. The Lemur Toolkit, http://www.lemurproject.org

  14. Takaki, et al.: Associative Document Retrieval by Query Subtopic Analysis and its Application to Invalidity Patent Search. In: Proceedings of CIKM (2004)

    Google Scholar 

  15. Zheng, W., Zhang, Y., Hong, Y., Fan, J., Liu, T.: Topic Tracking Based on Keywords Dependency Profile. In: Li, H., Liu, T., Ma, W.-Y., Sakai, T., Wong, K.-F., Zhou, G. (eds.) AIRS 2008. LNCS, vol. 4993, pp. 129–140. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  16. The USPTO databased, http://www.uspto.gov/

  17. Kim, J.-H., et al.: Patent document categorization based on semantic structural information. Information Processing and Management (2007)

    Google Scholar 

  18. Xue, X., Bruce Croft, W.: Automatic Query Generation for Patent Search. In: Proceeding of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009 (2009)

    Google Scholar 

  19. Lupu, M., Mayer, K., Tait, J., Trippe, A.J.: Current Challenges in Patent Information Retrieval. The Information Retrieval Series 29 (2011)

    Google Scholar 

  20. Hunt, D., Nguyen, L., Rodgers, M.: Patent searching: tools & techniques (2007)

    Google Scholar 

  21. trect_eval program at TRECT website, trec.nist.gov/trec_eval

  22. Open NLP POStagger, http://opennlp.sourceforge.net/

  23. van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworths, London (1979), http://www.dcs.gla.ac.uk/Keith/Preface.html

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nguyen, KL., Myaeng, SH. (2012). Query Enhancement for Patent Prior-Art-Search Based on Keyterm Dependency Relations and Semantic Tags. In: Salampasis, M., Larsen, B. (eds) Multidisciplinary Information Retrieval. IRFC 2012. Lecture Notes in Computer Science, vol 7356. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31274-8_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31274-8_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31273-1

  • Online ISBN: 978-3-642-31274-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics