Abstract
This paper proposes a method based on concept expansion to extract paraphrase collocation. Collocations of forms of \( \left\langle {{\text{V,}}\;{\text{OBJ,}}\;{\text{N}}} \right\rangle \) (verb-object collocations) and \( \left\langle {{\text{V,}}\;{\text{SUB,}}\;{\text{N}}} \right\rangle \) (subject-predicate collocations) are extracted after syntactic analysis is done to the sentences. Then the words used in the collocations are expanded based on related words getting from concept semantic to get the candidate of paraphrase collocations. In order to filter these paraphrase collocations, following four features are chosen: part of speech feature, mutual information feature, HowNet-based semantic similarity feature, and context-based semantic similarity feature. Compared to existed method, this method does not restrict the word in paraphrase collocation to synonym. The experiment shows that every feature exploited is useful for improving the performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barzilay R, McKeown KR (2001) Extracting paraphrases from a parallel corpus. In: Proceedings of the 39th annual meeting and the 10th conference of the European chapter of association for computational linguistics (EACL) 2001, pp 50–57
Lin D, Pantel P (2001) Discovery of inference rules for question-answering. Nat Lang Eng 7(4):343–360
Sekine S (2005) Automatic paraphrase discovery based on context and keywords between NE Pairs. In: Proceedings of IWP 2005
Zhang YJ, Yamamoto K (2002) Paraphrasing of Chinese utterances. In: Proceedings of COLING. Morristown: association for computational linguistics, 2002, pp 1163–1169
Wu H, Zhou M (2003) Synonymous collocation extraction using translation information. In: Proceedings of ACL 2003, pp 120–127
Zhao S, Lin Z, Ting L, Sheng L (2010) Paraphrase collocation extraction based on binary classification. J Softw 21(6):1267–1276
Zhang H (2011) Extracting structured information from the Chinese Wikepedia and measuring relatedness between words. Central China Normal University, China
Zhou K (2012) Related words of concept study based on Chinese Wikipedia. Central China Normal University, China
Zhang M, Liu M (2009) The retrieval system based on concept extending. In: 2009 Second Asia-Pacific Conference on computational intelligence and industrial applications, pp 365–368
Liu Q, Li S (2002) Lexical semantic similarity computation based on Hownet. Comput Linguist Chin Lang Process 7(2):59–76
Bannard C, Callison-Burch C (2003) Paraphrasing with bilingual parallel corpora. In: Proceedings of ACL 2003, pp 120–127
Acknowledgment
This work was supported by the National Natural Science Foundation of China (No. 61003192), the self-determined research funds of CCNU from the colleges’ basic research and operation of MOE(No. CCNU13A05014, No. CCNU13C01001), the Major Project of State Language Commission in the Twelfth Five-year Plan Period (No. ZDI125-1), the Project in the National Science & Technology Pillar Program in the Twelfth Five-year Plan Period (No. 2012BAK24B01), the Program of Introducing Talents of Discipline to Universities (No. B07042) and the NSF of Hubei Province (No. 2011CDA034).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, M., Li, W., Zhang, H. (2014). Paraphrase Collocations Extraction Based on Concept Expansion. In: Wen, Z., Li, T. (eds) Knowledge Engineering and Management. Advances in Intelligent Systems and Computing, vol 278. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54930-4_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-54930-4_19
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54929-8
Online ISBN: 978-3-642-54930-4
eBook Packages: EngineeringEngineering (R0)