Abstract
This work belongs to the domain of technical information retrieval (IR) and, more specifically, patent retrieval. We show that the recorded history of patent examiner’s search queries can be used to create a more effective method of finding prior art patents than search methods based on titles and claims. We verify the performance of the proposed method experimentally. Our experiments show that we can almost double the recall measure, compared to classical techniques based on titles and claims. The other contribution of our work is the creation of a database of over half a million patent examiners queries (recorded search activity over the patents prosecution process). The paper also discusses the limitations of the current work and the ongoing research to further improve the proposed approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agatonovic, M., Aswani, N., Bontcheva, K., Cunningham, H., Heitz, T., Li, Y., Roberts, I., Tablan, V.: Large-scale, parallel automatic patent annotation. In: Proceedings of 1st International ACM Workshop on Patent Information Retrieval—PaIR’08, pp. 1–8. California (2008)
Apache Lucene—Query Parser Syntax : Boosting a Term (2013). Accessed from http://lucene.apache.org/core/2_9_4/queryparsersyntax.html
Apache Lucene—Scoring (2013). Accessed from http://lucene.apache.org/core/3_5_0/scoring.html
Azzopardi, L., Vanderbauwhede, W., Joho, H.: Search system requirements of patent analysts. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 775–776. ACM, New York (2010)
Baeza-Yates, R.A., Ribeiro-Neto, B.A.: Modern Information Retrieval —The Concepts and Technology Behind Search, 2nd edn. Pearson Education Ltd., Harlow (2011)
Bashir, S., Rauber, A.: Improving retrievability of patents in prior-art search. In: Advances in Information Retrieval, pp. 457–470. Springer, Heidelberg (2010)
Bravo-Marquez, F., L‘Huillier, G., Ros, S.A., Velsquez, J.D.: A text similarity meta-search engine based on document fingerprints and search results records. In: Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology-, vol. 01, pp. 146–153. IEEE Computer Society, Washington (2011)
Callaert, J., Van Looy, B., Verbeek, A., Debackere, K., Thijs, B.: Traces of prior art: an analysis of non-patent references found in patent documents. Scientometrics 69(1), 3–20 (2006)
Carpineto, C., Romano, G.: A survey of automatic query expansion in information retrieval. ACM. Comput. Surv. 44(1), 1–56 (2012)
Fujii, A.: Enhancing patent retrieval by citation analysis. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 793–794. The Netherlands, Amsterdam (2007)
Ganguly, D., Leveling, J., Jones, G.J.F.: United we fall, divided we stand: a study of query segmentation and PRF for patent prior art search. In: Proceedings of the 4th Workshop on Patent Information Retrieval, pp. 13–18. ACM, New York (2011)
Jansen, B.J., Booth, D.L., Spink, A.: Determining the user intent of web search engine queries. In: Proceedings of the 16th International Conference on World Wide Web, pp. 1149–1150. Banff, Alberta (2007)
Jiang, D., Pei, J., Li, H.: (2013). Mining search and browse logs for web search: a survey. ACM Trans. Intell. Syst. Technol. 4(4), 57:1–57:37
Joho, H., Azzopardi, L.A., Vanderbauwhede, W.: A survey of patent users: an analysis of tasks, behavior, search functionality and system requirements. In: Proceedings of the Third Symposium on Information Interaction in Context, pp. 13–24 (2010)
Jürgens, J., Hansen, P., Womser-Hacker, Ch.: Going beyond CLEF-IP: The reality for patent searchers? In: Catarci T., Forner P., Hiemstra D., PeAas A., Santucci G. (eds.) Information Access valuation. Multilinguality, Multimodality, and Visual Analytics, pp. 30–35 Springer, Heidelberg (2012)
Kim, Y., Seo, J., Croft, W.B.: Automatic boolean query suggestion for professional search. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 825–834. Beijing, China (2011)
Lupu, M.: Patent information retrieval: an instance of domain-specific search. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1189–1190 (2012)
Magdy, W., Jones, G.J.F.: A study on query expansion methods for patent retrieval. In: Proceedings of the 4th Workshop on Patent Information Retrieval, pp. 19–24 (2011)
Mahdabi, P., Andersson, L., Keikha, M., Crestani, F.: Automatic refinement of patent queries using concept importance predictors. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 505–514 (2012)
Nguyen, K.L., Myaeng, S.H.: Query enhancement for patent prior-art-search based on keyterm dependency relations and semantic tags. In: Larsen, Salampasis B. (ed.) Multidisciplinary Information Retrieval, pp. 28–42. Springer, Heidelberg (2012)
Oh, S., Lei, Z., Lee, W.C., Mitra, P., Yen, J.: CV-PCR: A context-guided value-driven framework for patent citation recommendation. In: Proceedings of the 22nd ACM International Conference on Conference on Information; Knowledge Management, pp. 2291–2296 (2013)
Prez-Iglesias, J., Prez-Agüera, J.R. Fresno, V., Feinstein, Y.Z.: Integrating the Probabilistic Models BM25/BM25F into Lucene. CoRR (2009)
Potey, M.A., Patel, D.A., Sinha, P.K.: A survey of query log processing techniques and evaluation of web query intent identification. In: Advance Computing Conference (IACC), 2013 IEEE 3rd International, pp. 1330–1335 (2013)
Sormunen, E.: A novel method for the evaluation of boolean query effectiveness across a wide operational range. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Special Issue of SIGIR Forum 34, pp. 25–32 (2000)
Sparck Jones, K.: A statistical interpretation of term specificity and its application in retrieval. J. Doc. 28, 11–21 (1972)
Takaki, T., Fujii, A., Ishikawa, T.: Associative document retrieval by query subtopic analysis and Its application to invalidity patent search. In: Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, pp. 399–405 (2004)
Tannebaum, W., Rauber, A.: Mining query logs of USPTO patent examiners. In: Forner P., Mller H., Paredes R., Rosso P., Stein B. (eds.) Information Access Evaluation. Multilinguality, Multimodality, and Visualization. Springer, Heidelberg pp. 136–142 (2013)
Tannebaum, W., Rauber, A.: Analyzing query logs of USPTO examiners to identify useful query terms in patent documents for query expansion in patent searching: a preliminary study. In: Proceedings of the 5th Conference on Multidisciplinary Information Retrieval, 127–136 (2012)
Tiwana, S., Horowitz, E.: Findcite: automatically finding prior art patents. In: Proceedings of the 2nd International Workshop on Patent Information Retrieval (2009)
Von Wartburg, I., Teichert, T., Rost, K.: Inventive progress measured by multi-stage patent citation analysis. Res. Policy. 34(10), 1591–1607 (2005)
Wu, H.C., Chen, H.Y., Lee, K.Y., Liu, Y.C.: A method for assessing patent similarity using direct and indirect citation links. In: Proceedings of the 2010 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), pp. 149–152 (2010)
Xue, X., Croft, W.B.: Automatic query generation for patent search. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 2037–2040 (2009)
Xue, X., Croft, W.B.: Transforming patents into prior-art queries. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 808–809 (2009)
Zhao, X., Xiao, C., Lin, X., Wang, W., Ishikawa, Y.: Efficient processing of graph similarity queries with edit distance constraints. VLDB J. 22(6), 727–752 (2013)
Acknowledgments
The first author contribution is supported by the Foundation for Polish Science under International PhD Projects in Intelligent Computing. Project financed from The European Union within the Innovative Economy Operational Programme (2007–2013) and European Regional Development Fund.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Wajda, J., Zadrozny, W. (2016). Prior-Art Relevance Ranking Based on the Examiner’s Query Log Content. In: Trė, G., Grzegorzewski, P., Kacprzyk, J., Owsiński, J., Penczek, W., Zadrożny, S. (eds) Challenging Problems and Solutions in Intelligent Systems. Studies in Computational Intelligence, vol 634. Springer, Cham. https://doi.org/10.1007/978-3-319-30165-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-30165-5_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-30164-8
Online ISBN: 978-3-319-30165-5
eBook Packages: EngineeringEngineering (R0)