Arabic Paraphrasing Recognition Based Kernel Function for Measuring the Similarity of Pairs

Elfaik, Hanane; Bekkali, Mohammed; Brahim, Habibi; Lachkar, Abdelmonaime

doi:10.1007/978-3-030-11914-0_20

Hanane Elfaik⁵,
Mohammed Bekkali⁵,
Habibi Brahim⁶ &
…
Abdelmonaime Lachkar⁵

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 66))

Included in the following conference series:

International Conference on Advanced Information Technology, Services and Systems

330 Accesses

Abstract

Paraphrasing techniques aim to recognize, generate, or extract linguistic expressions that express the same meaning. These techniques affects positively or negatively the performance of many natural language-processing systems such as Question Answering, Summarization, Text Generation, and Machine Translation.... In this paper, we propose an efficient Arabic paraphrase recognizer based on kernel function and the specificity of terms, which is computed by term co-occurrence and term frequency - inverse document frequency. The experimental results show that our method outperforms the exiting methods based on similarity measures using a standard Arabic paraphrase database PPDB.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Al-Smadi, M., Jaradat, Z., Al-Ayyoub, M., Jararweh, Y.: Paraphrase identification and semantic text similarity analysis in arabic news tweets using lexical, syntactic, and semantic features. Inf. Process. Manage. Int. J. 53(3), 640–652 (2017)
Article Google Scholar
Bhagat, R., Hovy, E.: What is a paraphrase? Comput. Linguist. 39(3), 463–472 (2013)
Article Google Scholar
Bhagat, R., Hovy, E., Patwardhan, S.: Acquiring paraphrases from text corpora. In: Proceedings of the Fifth International Conference on Knowledge Capture, pp. 161–168. ACM (2009)
Google Scholar
Dennis, S., Landauer, T., Kintsch, W., Quesada, J.: Introduction to latent semantic analysis. In: Slides from the Tutorial given at the 25th Annual Meeting of the Cognitive Science Society, Boston (2003)
Google Scholar
Doddington, G.: Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 138–145. Morgan Kaufmann Publishers Inc. (2002)
Google Scholar
Dolan, B., Quirk, C., Brockett, C.: Unsupervised construction of large paraphrase corpora: exploiting massively parallel news sources. In: Proceedings of the 20th International Conference on Computational Linguistics, p. 350. Association for Computational Linguistics (2004)
Google Scholar
Eyecioglu, A., Keller, B.: Twitter paraphrase identification with simple overlap features and svms. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 64–69 (2015)
Google Scholar
Fellbaum, C.: A semantic network of english verbs. WordNet: Electron. Lexical database 3, 153–178 (1998)
Google Scholar
Fernando, S., Stevenson, M.: A semantic similarity approach to paraphrase detection. In: Proceedings of the 11th Annual Research Colloquium of the UK Special Interest Group for Computational Linguistics, pp. 45–52 (2008)
Google Scholar
Finch, A., Hwang, Y.S., Sumita, E.: Using machine translation evaluation techniques to determine sentence-level semantic equivalence. In: Proceedings of the Third International Workshop on Paraphrasing (IWP2005) (2005)
Google Scholar
Hassan, S., Mihalcea, R.: Semantic Relatedness using Salient Semantic Analysis. AAAI press, San Francisco (2011)
Google Scholar
Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. arXiv preprint cmp-lg/9709008 (1997)
Google Scholar
Kozareva, Z., Montoyo, A.: Paraphrase identification on the basis of supervised machine learning techniques. In: Advances in Natural Language Processing, pp. 524–533. Springer (2006)
Google Scholar
Manning, C.D., Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT press (1999)
Google Scholar
Mihalcea, R., Corley, C., Strapparava, C., et al.: Corpus-based and knowledge-based measures of text semantic similarity. In: AAAI, vol. 6, pp. 775–780 (2006)
Google Scholar
Milajevs, D., Kartsaklis, D., Sadrzadeh, M., Purver, M.: Evaluating neural word representations in tensor-based compositional settings. arXiv preprint arXiv:1408.6179 (2014)
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318. Association for Computational Linguistics (2002)
Google Scholar
Spärck Jones, K.: IDF term weighting and IR research lessons. J. Documentation 60(5), 521–523 (2004)
Article Google Scholar
Su, K.Y., Wu, M.W., Chang, J.S.: A new quantitative quality measure for machine translation systems. In: Proceedings of the 14th Conference on Computational Linguistics, vol. 2, pp. 433–439. Association for Computational Linguistics (1992)
Google Scholar
Tillmann, C., Vogel, S., Ney, H., Zubiaga, A., Sawaf, H.: Accelerated DP based search for statistical translation. In: Fifth European Conference on Speech Communication and Technology (1997)
Google Scholar
Ul-Qayyum, Z., Altaf, W.: Paraphrase identification using semantic heuristic features. Res. J. Appl. Sci. Eng. Technol. 4(22), 4894–4904 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Engineering Systems and Applications, Sidi Mohammed Ben Abdellah University, Fez, Morocco
Hanane Elfaik, Mohammed Bekkali & Abdelmonaime Lachkar
Institute of Studies and Research for Arabization, Mohammed V University, Rabat, Morocco
Habibi Brahim

Authors

Hanane Elfaik
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Bekkali
View author publications
You can also search for this author in PubMed Google Scholar
Habibi Brahim
View author publications
You can also search for this author in PubMed Google Scholar
Abdelmonaime Lachkar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hanane Elfaik .

Editor information

Editors and Affiliations

Faculty of Sciences and Technologies, Mohammedia, Morocco
Faddoul Khoukhi
Faculty of Sciences and Technologies, Settat, Morocco
Mohamed Bahaj
Faculty of Sciences and Technologies, Boukhalef Tangier, Morocco
Mostafa Ezziyyani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Elfaik, H., Bekkali, M., Brahim, H., Lachkar, A. (2019). Arabic Paraphrasing Recognition Based Kernel Function for Measuring the Similarity of Pairs. In: Khoukhi, F., Bahaj, M., Ezziyyani, M. (eds) Smart Data and Computational Intelligence. AIT2S 2018. Lecture Notes in Networks and Systems, vol 66. Springer, Cham. https://doi.org/10.1007/978-3-030-11914-0_20

Download citation

DOI: https://doi.org/10.1007/978-3-030-11914-0_20
Published: 01 March 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-11913-3
Online ISBN: 978-3-030-11914-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics