Skip to main content

Arabic Paraphrasing Recognition Based Kernel Function for Measuring the Similarity of Pairs

  • Conference paper
  • First Online:
Smart Data and Computational Intelligence (AIT2S 2018)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 66))

  • 330 Accesses

Abstract

Paraphrasing techniques aim to recognize, generate, or extract linguistic expressions that express the same meaning. These techniques affects positively or negatively the performance of many natural language-processing systems such as Question Answering, Summarization, Text Generation, and Machine Translation.... In this paper, we propose an efficient Arabic paraphrase recognizer based on kernel function and the specificity of terms, which is computed by term co-occurrence and term frequency - inverse document frequency. The experimental results show that our method outperforms the exiting methods based on similarity measures using a standard Arabic paraphrase database PPDB.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Al-Smadi, M., Jaradat, Z., Al-Ayyoub, M., Jararweh, Y.: Paraphrase identification and semantic text similarity analysis in arabic news tweets using lexical, syntactic, and semantic features. Inf. Process. Manage. Int. J. 53(3), 640–652 (2017)

    Article  Google Scholar 

  2. Bhagat, R., Hovy, E.: What is a paraphrase? Comput. Linguist. 39(3), 463–472 (2013)

    Article  Google Scholar 

  3. Bhagat, R., Hovy, E., Patwardhan, S.: Acquiring paraphrases from text corpora. In: Proceedings of the Fifth International Conference on Knowledge Capture, pp. 161–168. ACM (2009)

    Google Scholar 

  4. Dennis, S., Landauer, T., Kintsch, W., Quesada, J.: Introduction to latent semantic analysis. In: Slides from the Tutorial given at the 25th Annual Meeting of the Cognitive Science Society, Boston (2003)

    Google Scholar 

  5. Doddington, G.: Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 138–145. Morgan Kaufmann Publishers Inc. (2002)

    Google Scholar 

  6. Dolan, B., Quirk, C., Brockett, C.: Unsupervised construction of large paraphrase corpora: exploiting massively parallel news sources. In: Proceedings of the 20th International Conference on Computational Linguistics, p. 350. Association for Computational Linguistics (2004)

    Google Scholar 

  7. Eyecioglu, A., Keller, B.: Twitter paraphrase identification with simple overlap features and svms. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 64–69 (2015)

    Google Scholar 

  8. Fellbaum, C.: A semantic network of english verbs. WordNet: Electron. Lexical database 3, 153–178 (1998)

    Google Scholar 

  9. Fernando, S., Stevenson, M.: A semantic similarity approach to paraphrase detection. In: Proceedings of the 11th Annual Research Colloquium of the UK Special Interest Group for Computational Linguistics, pp. 45–52 (2008)

    Google Scholar 

  10. Finch, A., Hwang, Y.S., Sumita, E.: Using machine translation evaluation techniques to determine sentence-level semantic equivalence. In: Proceedings of the Third International Workshop on Paraphrasing (IWP2005) (2005)

    Google Scholar 

  11. Hassan, S., Mihalcea, R.: Semantic Relatedness using Salient Semantic Analysis. AAAI press, San Francisco (2011)

    Google Scholar 

  12. Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. arXiv preprint cmp-lg/9709008 (1997)

    Google Scholar 

  13. Kozareva, Z., Montoyo, A.: Paraphrase identification on the basis of supervised machine learning techniques. In: Advances in Natural Language Processing, pp. 524–533. Springer (2006)

    Google Scholar 

  14. Manning, C.D., Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT press (1999)

    Google Scholar 

  15. Mihalcea, R., Corley, C., Strapparava, C., et al.: Corpus-based and knowledge-based measures of text semantic similarity. In: AAAI, vol. 6, pp. 775–780 (2006)

    Google Scholar 

  16. Milajevs, D., Kartsaklis, D., Sadrzadeh, M., Purver, M.: Evaluating neural word representations in tensor-based compositional settings. arXiv preprint arXiv:1408.6179 (2014)

  17. Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318. Association for Computational Linguistics (2002)

    Google Scholar 

  18. Spärck Jones, K.: IDF term weighting and IR research lessons. J. Documentation 60(5), 521–523 (2004)

    Article  Google Scholar 

  19. Su, K.Y., Wu, M.W., Chang, J.S.: A new quantitative quality measure for machine translation systems. In: Proceedings of the 14th Conference on Computational Linguistics, vol. 2, pp. 433–439. Association for Computational Linguistics (1992)

    Google Scholar 

  20. Tillmann, C., Vogel, S., Ney, H., Zubiaga, A., Sawaf, H.: Accelerated DP based search for statistical translation. In: Fifth European Conference on Speech Communication and Technology (1997)

    Google Scholar 

  21. Ul-Qayyum, Z., Altaf, W.: Paraphrase identification using semantic heuristic features. Res. J. Appl. Sci. Eng. Technol. 4(22), 4894–4904 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hanane Elfaik .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Elfaik, H., Bekkali, M., Brahim, H., Lachkar, A. (2019). Arabic Paraphrasing Recognition Based Kernel Function for Measuring the Similarity of Pairs. In: Khoukhi, F., Bahaj, M., Ezziyyani, M. (eds) Smart Data and Computational Intelligence. AIT2S 2018. Lecture Notes in Networks and Systems, vol 66. Springer, Cham. https://doi.org/10.1007/978-3-030-11914-0_20

Download citation

Publish with us

Policies and ethics