Abstract
Paraphrasing techniques aim to recognize, generate, or extract linguistic expressions that express the same meaning. These techniques affects positively or negatively the performance of many natural language-processing systems such as Question Answering, Summarization, Text Generation, and Machine Translation.... In this paper, we propose an efficient Arabic paraphrase recognizer based on kernel function and the specificity of terms, which is computed by term co-occurrence and term frequency - inverse document frequency. The experimental results show that our method outperforms the exiting methods based on similarity measures using a standard Arabic paraphrase database PPDB.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Al-Smadi, M., Jaradat, Z., Al-Ayyoub, M., Jararweh, Y.: Paraphrase identification and semantic text similarity analysis in arabic news tweets using lexical, syntactic, and semantic features. Inf. Process. Manage. Int. J. 53(3), 640–652 (2017)
Bhagat, R., Hovy, E.: What is a paraphrase? Comput. Linguist. 39(3), 463–472 (2013)
Bhagat, R., Hovy, E., Patwardhan, S.: Acquiring paraphrases from text corpora. In: Proceedings of the Fifth International Conference on Knowledge Capture, pp. 161–168. ACM (2009)
Dennis, S., Landauer, T., Kintsch, W., Quesada, J.: Introduction to latent semantic analysis. In: Slides from the Tutorial given at the 25th Annual Meeting of the Cognitive Science Society, Boston (2003)
Doddington, G.: Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 138–145. Morgan Kaufmann Publishers Inc. (2002)
Dolan, B., Quirk, C., Brockett, C.: Unsupervised construction of large paraphrase corpora: exploiting massively parallel news sources. In: Proceedings of the 20th International Conference on Computational Linguistics, p. 350. Association for Computational Linguistics (2004)
Eyecioglu, A., Keller, B.: Twitter paraphrase identification with simple overlap features and svms. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 64–69 (2015)
Fellbaum, C.: A semantic network of english verbs. WordNet: Electron. Lexical database 3, 153–178 (1998)
Fernando, S., Stevenson, M.: A semantic similarity approach to paraphrase detection. In: Proceedings of the 11th Annual Research Colloquium of the UK Special Interest Group for Computational Linguistics, pp. 45–52 (2008)
Finch, A., Hwang, Y.S., Sumita, E.: Using machine translation evaluation techniques to determine sentence-level semantic equivalence. In: Proceedings of the Third International Workshop on Paraphrasing (IWP2005) (2005)
Hassan, S., Mihalcea, R.: Semantic Relatedness using Salient Semantic Analysis. AAAI press, San Francisco (2011)
Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. arXiv preprint cmp-lg/9709008 (1997)
Kozareva, Z., Montoyo, A.: Paraphrase identification on the basis of supervised machine learning techniques. In: Advances in Natural Language Processing, pp. 524–533. Springer (2006)
Manning, C.D., Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT press (1999)
Mihalcea, R., Corley, C., Strapparava, C., et al.: Corpus-based and knowledge-based measures of text semantic similarity. In: AAAI, vol. 6, pp. 775–780 (2006)
Milajevs, D., Kartsaklis, D., Sadrzadeh, M., Purver, M.: Evaluating neural word representations in tensor-based compositional settings. arXiv preprint arXiv:1408.6179 (2014)
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318. Association for Computational Linguistics (2002)
Spärck Jones, K.: IDF term weighting and IR research lessons. J. Documentation 60(5), 521–523 (2004)
Su, K.Y., Wu, M.W., Chang, J.S.: A new quantitative quality measure for machine translation systems. In: Proceedings of the 14th Conference on Computational Linguistics, vol. 2, pp. 433–439. Association for Computational Linguistics (1992)
Tillmann, C., Vogel, S., Ney, H., Zubiaga, A., Sawaf, H.: Accelerated DP based search for statistical translation. In: Fifth European Conference on Speech Communication and Technology (1997)
Ul-Qayyum, Z., Altaf, W.: Paraphrase identification using semantic heuristic features. Res. J. Appl. Sci. Eng. Technol. 4(22), 4894–4904 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Elfaik, H., Bekkali, M., Brahim, H., Lachkar, A. (2019). Arabic Paraphrasing Recognition Based Kernel Function for Measuring the Similarity of Pairs. In: Khoukhi, F., Bahaj, M., Ezziyyani, M. (eds) Smart Data and Computational Intelligence. AIT2S 2018. Lecture Notes in Networks and Systems, vol 66. Springer, Cham. https://doi.org/10.1007/978-3-030-11914-0_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-11914-0_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-11913-3
Online ISBN: 978-3-030-11914-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)