Skip to main content

Paraphrase Collocations Extraction Based on Concept Expansion

  • Conference paper
  • First Online:
Knowledge Engineering and Management

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 278))

  • 1125 Accesses

Abstract

This paper proposes a method based on concept expansion to extract paraphrase collocation. Collocations of forms of \( \left\langle {{\text{V,}}\;{\text{OBJ,}}\;{\text{N}}} \right\rangle \) (verb-object collocations) and \( \left\langle {{\text{V,}}\;{\text{SUB,}}\;{\text{N}}} \right\rangle \) (subject-predicate collocations) are extracted after syntactic analysis is done to the sentences. Then the words used in the collocations are expanded based on related words getting from concept semantic to get the candidate of paraphrase collocations. In order to filter these paraphrase collocations, following four features are chosen: part of speech feature, mutual information feature, HowNet-based semantic similarity feature, and context-based semantic similarity feature. Compared to existed method, this method does not restrict the word in paraphrase collocation to synonym. The experiment shows that every feature exploited is useful for improving the performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Barzilay R, McKeown KR (2001) Extracting paraphrases from a parallel corpus. In: Proceedings of the 39th annual meeting and the 10th conference of the European chapter of association for computational linguistics (EACL) 2001, pp 50–57

    Google Scholar 

  2. Lin D, Pantel P (2001) Discovery of inference rules for question-answering. Nat Lang Eng 7(4):343–360

    Google Scholar 

  3. Sekine S (2005) Automatic paraphrase discovery based on context and keywords between NE Pairs. In: Proceedings of IWP 2005

    Google Scholar 

  4. Zhang YJ, Yamamoto K (2002) Paraphrasing of Chinese utterances. In: Proceedings of COLING. Morristown: association for computational linguistics, 2002, pp 1163–1169

    Google Scholar 

  5. Wu H, Zhou M (2003) Synonymous collocation extraction using translation information. In: Proceedings of ACL 2003, pp 120–127

    Google Scholar 

  6. Zhao S, Lin Z, Ting L, Sheng L (2010) Paraphrase collocation extraction based on binary classification. J Softw 21(6):1267–1276

    Article  Google Scholar 

  7. Zhang H (2011) Extracting structured information from the Chinese Wikepedia and measuring relatedness between words. Central China Normal University, China

    Google Scholar 

  8. Zhou K (2012) Related words of concept study based on Chinese Wikipedia. Central China Normal University, China

    Google Scholar 

  9. Zhang M, Liu M (2009) The retrieval system based on concept extending. In: 2009 Second Asia-Pacific Conference on computational intelligence and industrial applications, pp 365–368

    Google Scholar 

  10. Liu Q, Li S (2002) Lexical semantic similarity computation based on Hownet. Comput Linguist Chin Lang Process 7(2):59–76

    Google Scholar 

  11. Bannard C, Callison-Burch C (2003) Paraphrasing with bilingual parallel corpora. In: Proceedings of ACL 2003, pp 120–127

    Google Scholar 

Download references

Acknowledgment

This work was supported by the National Natural Science Foundation of China (No. 61003192), the self-determined research funds of CCNU from the colleges’ basic research and operation of MOE(No. CCNU13A05014, No. CCNU13C01001), the Major Project of State Language Commission in the Twelfth Five-year Plan Period (No. ZDI125-1), the Project in the National Science & Technology Pillar Program in the Twelfth Five-year Plan Period (No. 2012BAK24B01), the Program of Introducing Talents of Discipline to Universities (No. B07042) and the NSF of Hubei Province (No. 2011CDA034).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wang Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, M., Li, W., Zhang, H. (2014). Paraphrase Collocations Extraction Based on Concept Expansion. In: Wen, Z., Li, T. (eds) Knowledge Engineering and Management. Advances in Intelligent Systems and Computing, vol 278. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54930-4_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-54930-4_19

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-54929-8

  • Online ISBN: 978-3-642-54930-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics