Extracting Semantic Information from Chinese Language Patent Claims

  • Yong Tang
  • Shihan Yang
  • Jiwen Chai
  • Shanmei Liu
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 180)


A big challenge for automatically analyzing patent claims written in Chinese language is how to obtain semantically information from claims which are usually a non-structured, but free text. In this paper, a set of techniques is been provided to extract some valuable semantically information from claims in Chinese language. The method could automatically discovery some usable semantically information from the patent claim texts by means of regular expression pattern and predefined ontology model. It can extract not only surface semantic information but also deeper semantic information. Furthermore, the extracted semantically information is automatically translated into web ontology language (OWL), a machine readable semantic structure specification. The work proposes a potential semantic solution in Chinese language patent analyzing based on patent claims, such as semantic searching, legal status and invalidation checking, and discovering new technique trends. A case study on electronic engineering domain patent claims is also provided.


Semantic Information Extraction Patent Claim Chinese Language Patent Automatically Analyzing Ontology 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Hobbs, J.R., Appelt, D.E., Bear, J., Israel, D.J., Kameyama, M., Stickel, M.E., Tyson, M.: Fastus: A cascaded finite-state transducer for extracting information from natural-language text. CoRR, vol. cmplg/ 9705013 (1997)Google Scholar
  2. 2.
    Yang, S.-Y., Lin, S.-Y., Lin, S.-N., Lee, C.-F., Cheng, S.-L., Soo, V.-W.: Automatic extraction of semantic relations from patent claims (2008)Google Scholar
  3. 3.
    Qingying, Q., Guomin, Z., Pei’en, F., Jianwei, W.: Extraction approach of patent information based on regular expression. Journal of Chinese Machine Engineering 18(19), 2326–2329 (2007) (in Chinese)Google Scholar
  4. 4.
    Yu-qin, L., Xue-feng, W., Xiao-ping, L.: Quality estimation of patent based on text mining and its empirical research. Computer Engineering and Applications 43(33), 12–14 (2007) (in Chinese)Google Scholar
  5. 5.
    Zhang, H.-P., Yu, H.-K., Xiong, D.-Y., Liu, Q.: Hhmm-based chinese lexical analyzer ictclas. In: Proceedings of the Second SIGHAN Workshop on Chinese Language Processing, SIGHAN 2003, vol. 17, pp. 184–187. Association for Computational Linguistics, Stroudsburg (2003)CrossRefGoogle Scholar
  6. 6.
  7. 7.
    Fine, S., Singer, Y., Tishby, N.: The hierarchical hidden markov model: Analysis and applications. Mach. Learn. 32, 41–62 (1998), MATHCrossRefGoogle Scholar
  8. 8.
    Reinhold, M., et al.: Jsr 51: New i/o apis for the java platform (2002),
  9. 9.
    Motik, B., Grau, B.C., Horrocks, I., Wu, Z., Fokoue, A., Lutz, C.: Owl2 web ontology language profiles, (2009)
  10. 10.
    Yang, S., Wu, J.: Mapping relational databases into ontologies through a graph-based formal model. In: International Conference on Semantics, Knowledge and Grid, pp. 219–226 (2010)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Yong Tang
    • 1
  • Shihan Yang
    • 2
  • Jiwen Chai
    • 1
  • Shanmei Liu
    • 1
  1. 1.Sichuan Electric Power Research InstituteChengduChina
  2. 2.Chengdu Documentation and Information Center, CASChengduChina

Personalised recommendations