Extraction of Cyber-Physical Systems Inventions’ Structural Elements of Russian-Language Patents
The chapter presents software for extracting predicate-argument constructions that characterizing the composition of the structural elements of the inventions from cyber-physics domain and the relationships between them. The extracted structures reconstruct the component structure of the invention in the form of a net. Such data is further converted into a domain ontology and used in the field of information support of automated invention. A new method for extracting structured data from patents has been proposed taking into account the specificity of the text of patents and is based on the shallow parsing and segmentation of sentences. The ontology scheme includes the structural elements of technical objects as the concepts and the relationship between them, as well as supporting information on the invention. The results suggest that the proposed approach is promising. A further direction of research is seen by the authors in improving the existing method for extracting data and expanding ontology.
KeywordsPatents Information extraction SAO CAI-systems Ontology
The reported study was funded by RFBR according to the research projects 18-07-01086, 19-07-01200; and was funded by RFBR and Administration of the Volgograd region according to the research projects 19-47-340007, 19-41-340016.
- 6.Tomita-parser: Developer guide. https://tech.yandex.ru/tomita/doc/dg/concept/about-docpage/ (2019)
- 7.UFAL UDPipe: http://ufal.mff.cuni.cz/udpipe (2019)
- 9.Asiryan, A.K.: Morphological tagging tools comparison. Chapter presented at the Intellectual Potential of the XXI Century ’2017, November. https://www.sworld.com.ua/konferu7-317/27.pdf (2017)
- 10.Blazhievskaya, A., et al.: Morphological Analysis for Russian: Integration and Comparison of Taggers. Analysis of Images, Social Networks and Texts, vol. 661, pp. 162–171. Springer, Berlin (2016)Google Scholar
- 11.Dereza, O.V., et al.: Automatic morphological analysis for Russian: a comparative study. In: Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference «Dialogue» (2016)Google Scholar
- 13.Ulmschneider, K., Glimm, B.: Semantic exploitation of implicit patent information. In: Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence (SSCI 2016), Athens, Greece, December (2016). https://doi.org/10.1109/ssci.2016.7849943
- 14.Korobkin, D., Fomenkov, S., Golovanchikov, A.: Method of identification of patent trends based on descriptions of technical functions. J. Phys. Conf. Ser. 1015, 7 (2018)Google Scholar
- 15.Korobkin, D., Fomenkov, S., Kolesnikov, S.: The method for detecting the dependencies between technical functions and physical effects. In: Proceedings of the MCCSIS 2018, Madrid, pp. 225–228 (2018)Google Scholar