Towards a New Standard Arabic Test Collection for Mono- and Cross-Language Information Retrieval

  • Oussama Ben Khiroun
  • Raja Ayed
  • Bilel Elayeb
  • Ibrahim Bounhas
  • Narjès Bellamine Ben Saoud
  • Fabrice Evrard
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8455)

Abstract

We propose in this paper a new standard Arabic test collection for mono- and cross-language Information Retrieval (CLIR). To do this, we exploit the “Hadith” texts and we provide a portal for sampling and evaluation of Hadiths’ results listed in both Arabic and English versions. The new called “Kunuz” standard Arabic test collection will promote and restart the development of Arabic mono retrieval and CLIR systems blocked since the earlier TREC-2001 and TREC-2002 editions.

Keywords

Mono- and Cross-Language Information Retrieval Arabic Language Standard Test Collection Sampling Evaluation 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abu El-Khair, I.: Arabic information retrieval. Annu. Rev. Inf. Sci. Technol. 41, 505–533 (2007)CrossRefGoogle Scholar
  2. 2.
    Beseiso, M., Ahmad, A.R., Ismail, R.: A Survey of Arabic language Support in Semantic web. Int. J. Comput. Appl. 9, 35–40 (2010)Google Scholar
  3. 3.
    Zayed, O., El-Beltagy, S., Haggag, O.: An Approach for Extracting and Disambiguating Arabic Persons’ Names Using Clustered Dictionaries and Scored Patterns. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds.) NLDB 2013. LNCS, vol. 7934, pp. 201–212. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  4. 4.
    Gey, F.C., Oard, D.W.: The TREC-2001 Cross-Language Information Retrieval Track: Searching Arabic Using English, French or Arabic Queries. In: The Tenth Text REtrieval Conference (TREC), pp. 16–25 (2002)Google Scholar
  5. 5.
    Bounhas, I., Elayeb, B., Evrard, F., Slimani, Y.: Toward a Computer Study of the Reliability of Arabic Stories. J. Am. Soc. Inf. Sci. Technol. 61, 1686–1705 (2010)Google Scholar
  6. 6.
    Clarke, C.L.A., Craswell, N., Soboroff, I., Cormack, G.V.: Overview of the TREC 2010 Web Track. In: The 19th Text REtrieval Conference (TREC) (2011)Google Scholar
  7. 7.
    Ayed, R., Bounhas, I., Elayeb, B., Evrard, F., Bellamine Ben Saoud, N.: Arabic Morphological Analysis and Disambiguation Using a Possibilistic Classifier. In: Huang, D.-S., Ma, J., Jo, K.-H., Gromiha, M.M. (eds.) ICIC 2012. LNCS, vol. 7390, pp. 274–279. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  8. 8.
    Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., Lioma, C.: Terrier: A High Performance and Scalable Information Retrieval Platform. In: Proceedings of ACM SIGIR 2006 Workshop on Open Source Information Retrieval (OSIR), pp. 18–25 (2006)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Oussama Ben Khiroun
    • 1
  • Raja Ayed
    • 1
  • Bilel Elayeb
    • 1
    • 3
  • Ibrahim Bounhas
    • 2
  • Narjès Bellamine Ben Saoud
    • 1
    • 4
  • Fabrice Evrard
    • 5
  1. 1.RIADI Research LaboratoryENSI Manouba UniversityTunisia
  2. 2.LISI Lab. of Computer Science for Industrial SystemsISD Manouba UniversityTunisia
  3. 3.Emirates College of TechnologyAbu DhabiUnited Arab Emirates
  4. 4.Higher Institute of Informatics (ISI)Tunis El Manar UniversityTunisia
  5. 5.Informatics Research Institute of Toulouse (IRIT)ToulouseFrance

Personalised recommendations