Abstract
Arabic is understudied in second-language research (L2) and lacks rapid and adequate tools for measuring proficiency. Drawing inspiration from LexTALE and its extensions, this study created and validated a quick receptive vocabulary size test to estimate L2 Arabic proficiency. In Experiment 1, the initial version of LexArabic was designed and evaluated with 192 L1 and L2 speakers. Item analysis using point-biserial correlations and item response theory (IRT) was conducted to refine the test, resulting in a final version with 90 items (60 words, 30 nonwords). Experiment 2 assessed LexArabic reliability, validity, and accuracy using a new group of 260 L1 and L2 speakers. The validity of LexArabic was evaluated through objective tasks (a general Arabic proficiency test, L1 to L2 translation task, L2 to L1 translation task) as well as subjective tasks (self-rated proficiency). Results showed that LexArabic demonstrated good reliability (α > 90), validity (correlation with two objective tasks and one subjective task), as well as accuracy (AUC value = .88). The introduction of LexArabic has the potential to facilitate experimental research on both L1 and L2 Arabic speakers and contributes to the development of standardized L2 proficiency assessment across languages.
Similar content being viewed by others
Data Availability
Data and analysis codes are available upon request.
References
Abou-Ghazaleh, A., Khateb, A., & Nevat, M. (2018). Lexical competition between spoken and literary Arabic: A new look into the neural basis of diglossia using fMRI. Neuroscience, 393, 83–96.
Abu-Rabia, S., & Siegel, L. S. (2003). Reading skills in three orthographies: The case of trilingual Arabic–Hebrew–English-speaking Arab children. Reading and Writing, 16(7), 611–34.
Aiken, L. S., & West, S. G. (1991). Multiple regression: Testing and interpreting interactions. Sage.
Al-Gahtani, S., & Roever, C. (2014). Preference structure in L2 Arabic requests. Intercultural Pragmatics, 11(4). https://doi.org/10.1515/ip-2014-0027
Alamry, A. (2014). The acquisition of gender agreement in adult learners of Arabic. Carleton University. Doctoral dissertation.
Alavi, S. M., & Akbarian, I. H. (2012). The role of vocabulary size in predicting performance on TOEFL reading item types. System, 40(3), 376–385. https://doi.org/10.1016/j.system.2012.07.002
Albirini, A. (2016). Modern Arabic sociolinguistics: Diglossia, variation, codeswitching, attitudes and identity. Routledge.
Alhawary, M. T. (2007). Null subjects use by english and spanish learners of arabic as an L2. In E. Benmamoun (Ed.), Perspectives on Arabic Linguistics XIX: Papers from the Nineteenth Annual Symposium on Arabic Linguistics (pp. 217–45). John Benjamins.
Alhawary, M. T. (2009). Arabic Second Language Acquisition of Morphosyntax. Yale University Press.
Alhawary, M. T. (2019). Arabic second language learning and effects of input, transfer, and typology. Georgetown University Press.
Allal-Sumoto, T. K., Miyoshi, K., & Mizuhara, H. (2023). The effect of productive vocabulary knowledge on second language comprehension. Frontiers in Psychology, 14. https://doi.org/10.3389/fpsyg.2023.1049885
Alshargi, F., Dibas, S., Alkhereyf, S., Faraj, R., Abdulkareem, B., Yagi, S., & Rambow, O. (2019). Morphologically annotated corpora for seven Arabic dialects: Taizi, Sanaani, Najdi, Jordanian, Syrian, Iraqi and Moroccan. In El-Hajj, W., Belguith, L. H., Bougares, F., Magdy, W., Zitouni, I., Tomeh, N., El-Haj, M., & Zaghouani, W. (Eds.), Proceedings of the Fourth Arabic Natural Language Processing Workshop (137–147). https://doi.org/10.18653/v1/W19-4615
Amenta, S., Badan, L., & Brysbaert, M. (2021). LexITA: A quick and reliable assessment tool for Italian L2 receptive vocabulary size. Applied Linguistics, 42(2), 292–314. https://doi.org/10.1093/applin/amaa020
Anderson, R. C., & Freebody, P. (1983). Reading comprehension and the assessment and acquisition of word knowledge. In B. A. Hutson (Ed.), Advances in Reading/Language Research (pp. 132–255). JAI Press.
Andria, S., Madi-Tarabya, B., & Khateb, A. (2022). Behavioural and electrophysiological analyses of written word processing in spoken and literary Arabic: New insights into the diglossia question. European Journal of Neuroscience, 56(6), 4819–4836. https://doi.org/10.1111/ejn.15781
Andringa, S., & Godfroid, A. (2020). Sampling bias and the problem of generalizability in applied linguistics. Annual Review of Applied Linguistics, 40, 134–142. https://doi.org/10.1017/S0267190520000033
Anwyl-Irvine, A. L., Massonnié, J., Flitton, A., Kirkham, N., & Evershed, J. K. (2020). Gorilla in our midst: An online behavioral experiment builder. Behavior Research Methods, 52(1), 388–407. https://doi.org/10.3758/s13428-019-01237-x
Arts, T., Belinkov, Y., Habash, N., Kilgarriff, A., & Suchomel, V. (2014). arTenTen: Arabic corpus and word sketches. Journal of King Saud University-Computer and Information Sciences, 26(4), 357–371. https://doi.org/10.1016/j.jksuci.2014.06.009
Asadi, I. A., & Asli-Badarneh, A. (2023). The impact of Diglossia-Effect on reading acquisition among Arabic-speaking children: A longitudinal study. Journal of Psycholinguistic Research, 1–19. https://doi.org/10.1007/s10936-023-09969-w
Asadi, I. A., Khateb, A., & Shany, M. (2017). How simple is reading in Arabic? A cross‐sectional investigation of reading comprehension from first to sixth grade. Journal of Research in Reading, 40. https://doi.org/10.1111/1467-9817.12093
Asli-Badarneh, A., & Asadi, I. (2023). The impact of lexical and phonological distance on reading acquisition: The diglossic context of Arabic. Journal of Research in Reading. https://doi.org/10.1111/1467-9817.12433
Attanal Alarabi. (n.d.). Attanal Alarabi. https://attanalalarabi.com/. Accessed Oct 2023.
Baayen, R. H., Piepenbrock, R., & Gulikers, L. (1995). CELEX2 LDC96L14. Linguistic Data Consortium.
Baker, F. B., & Kim, S. H. (2017). The basics of item response theory using R. Springer.
Barrow, J., Nakanishi, Y., & Ishino, H. (1999). Assessing Japanese college students’ vocabulary knowledge with a self-checking familiarity survey. System, 27(2), 223–247. https://doi.org/10.1016/S0346-251X(99)00018-4
Beeckmans, R., Eyckmans, J., Janssens, V., Dufranne, M., & Van de Velde, H. (2001). Examining the yes/no vocabulary test: Some methodological issues in theory and practice. Language Testing, 18(3), 235–274.
Beglar, D. (2010). A rasch-based validation of the vocabulary size test. Language Testing, 27(1), 101–118. https://doi.org/10.1177/0265532209340194
Bidaoui, A., Foote, R., & Abunasser, M. (2016). Relative clause attachment in native and L2 Arabic. International Journal of Arabic Linguistics, 2(2), 75–95.
Blasi, D. E., Henrich, J., Adamou, E., Kemmerer, D., & Majid, A. (2022). Over-reliance on English hinders cognitive science. Trends in Cognitive Sciences, 26(12), 1153–1170. https://doi.org/10.1016/j.tics.2022.09.015
Brysbaert, M. (2013). A fast, free, and efficient test to measure language proficiency in French. Psychologica Belgica, 53(1), 23–37. https://doi.org/10.5334/pb-53-1-23
Bunyan, N. (2022). Evaluation of the placement test in Arabic Linguistics Institute at King Saud University. [Doctoral dissertation, Imam Mohammad Ibn Saud Islamic University].
Bylund, E., Khafif, Z., & Berghoff, R. (2023). Linguistic and geographic diversity in research on second language acquisition and multilingualism: An analysis of selected journals. Applied Linguistics.https://doi.org/10.1093/applin/amad022
Cameron, L. (2002). Measuring vocabulary size in English as an additional language. Language Teaching Research, 6(2), 145–173.
Canning, D. N., McLean, S., & Vitta, J. P. (2022). Rater judgments and word difficulty: Conceptualizing the substantive validity of the VST. Vocabulary Learning and Instruction, 11(2), 30–37.
Chalmers, R. P. (2012). mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48(6), 1–29. https://doi.org/10.18637/jss.v048.i06
Chan, I. L., & Chang, C. B. (2018). LEXTALE_CH: A quick, character-based proficiency test for Mandarin Chinese. In: Proceedings of the 42nd Annual Boston University Conference on Language Development, pp. 114–130.
Chen, C., & Liu, Y. (2020). The role of vocabulary breadth and depth in IELTS academic reading tests. Reading in a Foreign Language, 32(1), 1–27. 10125/66574.
Cheng, J., & Matthews, J. (2018). The relationship between three measures of L2 vocabulary knowledge and L2 listening and reading. Language Testing, 35, 3–25. The relationship between three measures of L2 vocabulary knowledge and L2 listening and reading.
Clahsen, H., & Felser, C. (2017). Some notes on the shallow structure hypothesis. Studies in Second Language Acquisition, 1–14. https://doi.org/10.1017/s0272263117000250
Clark, J. M., & Paivio, A. (2004). Extensions of the Paivio, Yuille, and Madigan (1968) norms. Behavior Research Methods, Instruments, & Computers, 36(3), 371–383.
Davies, M. (2008). The corpus of contemporary american english (COCA): 1.0 billion, US, 1990–2019. https://www.english-corpora.org/coca/. Accessed June 2023.
DeKeyser, R. (2007). Skill acquisition theory. In B. VanPatten & J. Williams (Eds.), Theories in second language acquisition: An introduction (pp. 97–113). Lawrence Erlbaum Associates Publishers.
Dong, Y., Tang, Y., Chow, B. W. Y., Wang, W., & Dong, W. Y. (2020). Contribution of vocabulary knowledge to reading comprehension among Chinese students: A meta-analysis. Frontiers in Psychology. https://doi.org/10.3389/fpsyg.2020.525369
Embretson, S. E., & Reise, S. P. (2013). Item response theory. Psychology Press.
Enayat, M. J., & Derakhshan, A. (2021). Vocabulary size and depth as predictors of second language speaking ability. System, 99.
Eyckmans, J. (2004). Measuring receptive vocabulary size. Reliability and validity of the Yes/No vocabulary test for French-speaking learners of Dutch. LOT.
Fan, J., Upadhye, S., & Worster, A. (2006). Understanding receiver operating characteristic (ROC) curves. Canadian Journal of Emergency Medicine, 8(1), 19–20. https://doi.org/10.1017/s1481803500013336
Ferguson, C. (1991). Diglossia revisited. Southwest Journal of Linguistics, 10, 214–234.
Fitzpatrick, T., & Clenton, J. (2010). The challenge of validation: Assessing the performance of a test of productive vocabulary. Language Testing, 27(4), 537–554.
Foote, R., Qasem, M., & Trentman, E. (2020). Morphological decomposition in L2 Arabic: A masked priming study. Journal of Psycholinguistic Research, 49(2), 291–317. https://doi.org/10.1007/s10936-020-09688-6
Garcia, R., Roeser, J., & Kidd, E. (2022). Online data collection to address language sampling bias: Lessons from the COVID-19 pandemic. Linguistics Vanguard.
González, R. A., & Píriz, A. M. P. (2016). Measuring the productive vocabulary of secondary school CLIL students: Is Lex30 a valid test for low-level school learners? Vigo International Journal of Applied Linguistics, 13, 31–54.
Gupta, R., Day, C. N., Tobin, W. O., & Crowson, C. S. (2022). Understanding the effect of categorization of a continuous predictor with application to neuro-oncology. Neuro-Oncology Practice, 9(2), 87–90.
Gyllstad, H., Vilkaitė, L., & Schmitt, N. (2015). Assessing vocabulary size through multiple-choice formats. International Journal of Applied Linguistics, 166(2), 278–306. https://doi.org/10.1075/itl.166.2.04gyl
Ha, H. T. (2021). Exploring the relationships between various dimensions of receptive vocabulary knowledge and L2 listening and reading comprehension. Language Testing in Asia, 11(1), 1–20.
Hacking, J. F., & Tschirner, E. (2017). The contribution of vocabulary knowledge to reading proficiency: The case of college Russian. Foreign Language Annals, 50, 500–518.
Hajian-Tilaki, K. (2013). Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Caspian Journal of Internal Medicine, 4(2), 627–635.
Harkio, N., & Pietilä, P. (2016). The role of vocabulary breadth and depth in reading comprehension: a quantitative study of finnish EFL learners. Journal of Language Teaching and Research, 7(9), 1079–1088. https://doi.org/10.17507/jltr.0706.03
Harrington, M., & Carey, M. (2009). The on-line yes/no test as a placement tool. System, 37, 614–626.
Hartsuiker, R., & Bernolet, S. (2017). Syntactic representations in late learners of a second language: A learning trajectory. In D. Miller, F. Bayram, J. Rothman, & L. Serratrice (Eds.), Bilingual cognition and language : The state of the science across its subfields (pp. 205–224). John Benjamins. https://doi.org/10.1075/sibil.54.10ber
Hashimoto, B. J., & Egbert, J. (2019). More than frequency? Exploring predictors of word difficulty for second language learners. Language Learning, 69(4), 839–872. https://doi.org/10.1111/lang.12353
Henrich, J., Heine, S. J., & Norenzayan, A. (2010). Most people are not WEIRD. Nature, 466(7302), 26–26.
Henriksen, B. (1999). Three dimensions of vocabulary development. Studies in Second Language Acquisition, 21(2), 303–317.
Hermena, E. W., Liversedge, S. P., Bouamama, S., & Drieghe, D. (2019). Orthographic and root frequency effects in Arabic: Evidence from eye movements and lexical decision. Journal of Experimental Psychology: Learning, Memory, and Cognition, 45(5), 934–954. https://doi.org/10.1037/xlm0000626
Huibregtse, I., Admiraal, W., & Meara, P. (2002). Scores on a yes-no vocabulary test: Correction for guessing and response style. Language Testing, 19(3), 227–245.
Humayoun, M., Hammarström, H., & Ranta, A. (2022). Urdu morphology, orthography and lexicon extraction. ArXiv Preprint. 2204.03071.
Izura, C., Cuetos, F., & Brysbaert, M. (2014). Lextale-Esp: A test to rapidly and efficiently assess the Spanish vocabulary size. Piscológica, 35(1), 49–66.
Jabbar, A., Iqbal, S., Khan, M. U. G., & Hussain, S. (2018). A survey on urdu and urdu-like language stemmers and stemming techniques. Artificial Intelligence Review, 49, 339–373.
Jaeger, T. F., & Norcliffe, E. J. (2009). The cross-linguistic study of sentence production. Language and Linguistics Compass, 3(4), 866–887. https://doi.org/10.1111/j.1749-818X.2009.00147.x
Jeon, E. H., & Yamashita, J. (2014). L2 reading comprehension and its correlates: A meta-analysis. Language Learning, 64(1), 160–212.
Kachru, Y. (2008). Hindi–Urdu–Hindustani. In B. Kachru, Y. Kachru, & S. Sridhar (Eds.), Language in South Asia (pp. 81–102). Cambridge University Press. https://doi.org/10.1017/CBO9780511619069.006
Kidd, E., & Garcia, R. (2022). How diverse is child language acquisition research? First Language, 42(6), 703–735. https://doi.org/10.1177/01427237211066405
Kremmel, B., & Schmitt, N. (2016). Interpreting vocabulary test scores: What do various item formats tell us about learners’ ability to employ words? Language Assessment Quarterly, 13(4), 377–392.
Kruschke, J. (2013). Bayesian estimation supersedes the t test. Journal of Experimental Psychology: General, 142(2), 573–603. https://doi.org/10.1037/a0029146
Kruschke, J., & Meredith, M. (2021). BEST: Bayesian estimation supersedes the t-test. R package. (0.5.4). https://cran.r-project.org/package=BEST. Accessed June 2023.
Kupisch, T., Arona, S., Besler, A., Cruschina, S., Ferin, M., Gyllstad, H., & Venagli, I. (2023). LexSIC: A quick vocabulary test for Sicilian. Isogloss, 9(1), 1–24. https://doi.org/10.5565/rev/isogloss.302
Kwaik, K. A., Saad, M., Chatzikyriakidis, S., & Dobnik, S. (2018). A lexical distance study of Arabic dialects. Procedia Computer Science, 142, 2–13. https://doi.org/10.1016/j.procs.2018.10.456
Laufer, B., & Aviad-Levitzky, T. A. M. I. (2017). What type of vocabulary knowledge predicts reading comprehension: Word meaning recall or word meaning recognition? The Modern Language Journal, 101(4), 729–741.
Leal, T. (2018). Data analysis and sampling: Methodological issues concerning proficiency in SLA research. In A. Gudmestad & A. Edmonds (Eds.), Critical reflections on data in second language acquisition (pp. 63–88). John Benjamins Publishing Company. https://doi.org/10.1075/lllt.51.04lea
Lee, S. T., van Heuven, W. J., Price, J. M., & Leong, C. X. R. (2023). LexMAL: A quick and reliable lexical test for Malay speakers. Behavior Research Methods, 1–19. https://doi.org/10.3758/s13428-023-02202-5
Lemhöfer, K., & Broersma, M. (2012). Introducing LexTALE: A quick and valid lexical test for advanced learners of English. Behavior Research Methods, 44, 325–343. https://doi.org/10.3758/s13428-011-0146-0
Li, Y., & Zhang, X. (2019). L2 vocabulary knowledge and L2 listening comprehension: A structural equation model. Canadian Journal of Applied Linguistics, 22(1).
Mandrekar, J. N. (2010). Receiver operating characteristic curve in diagnostic test assessment. Journal of Thoracic Oncology, 5(9), 1315–1316.
Marian, V., Blumenfeld, H. K., & Kaushanskaya, M. (2007). The language experience and proficiency questionnaire (LEAP-Q): Assessing language profiles in bilinguals and multilinguals. Journal of Speech Language and Hearing Research, 50(4), 940–967. https://doi.org/10.1044/1092-4388(2007/067)
Marian, V., Blumenfeld, H. K., & Kaushanskaya, M. (2020). The LEAP-Q language experience and proficiency questionnaire: Ten years later. Bilingualism: Language and Cognition, 23(5), 945–950. https://doi.org/10.1017/S1366728919000038
Masrai, A. (2022a). The development and validation of a lemma-based yes/no vocabulary size test. SAGE Open, 12(1). https://doi.org/10.1177/21582440221074355
Masrai, A. (2022). The relationship between two measures of L2 phonological vocabulary knowledge and L2 listening comprehension. TESOL Journal, 13(1), 1–16. https://doi.org/10.1002/tesj.612
Masrai, A., & Milton, J. (2019). How many words do you need to speak Arabic? An Arabic vocabulary size test. The Language Learning Journal, 47(5), 519–536.
McLean, S., Stewart, J., & Batty, A. O. (2020). Predicting L2 reading proficiency with modalities of vocabulary knowledge: A bootstrapping approach. Language Testing, 37(3), 389–411.
Meara, P. (1996a). English vocabulary tests: 10 k. Unpublished manuscript. Swansea: Center for applied language studies.
Meara, P. (1996). The dimensions of lexical competence. In G. Brown, K. Malmkjaer, & J. Williams (Eds.), Performance and competence in second language acquisition (pp. 35–53). Cambridge University Press.
Meara, P., & Buxton, B. (1987). An alternative to multiple choice vocabulary tests. Language Testing, 4(2), 142–154. https://doi.org/10.1177/026553228700400202
Meara, P., & Jones, G. (1990). Eurocentre’s vocabulary size test. Eurocentres.
Meara, P., & Milton, J. (2003). The Swansea levels test. Newbury Express.
Melka, F. (1997). Receptive vs. productive aspects of vocabulary. In N. Schmitt & M. McCarthy (Eds.), Vocabulary: Description, acquisition, and pedagogy (pp. 84–102). Cambridge University Press.
Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons’ responses and performances as scientific inquiry into score meaning. American Psychologist, 50(9), 741–749. https://doi.org/10.1037/0003-066X.50.9.741
Messick, S. (2005). Standards of validity and the validity of standards in performance assessment. Educational Measurement: Issues and Practice, 14(4), 5–8. https://doi.org/10.1111/j.1745-3992.1995.tb00881.x
Milton, J. (2009). Measuring second language vocabulary acquisition. Multilingual Matters.
Milton, J., & Fitzpatrick, T. (2017). Introduction: Deconstructing Vocabulary Knowledge. In J. Milton & T. Fitzpatrick (Eds.), Dimensions of vocabulary knowledge (pp. 1–12). Bloomsbury Publishing. https://doi.org/10.1007/978-1-137-36831-7_1
Miralpeix, I., & Muñoz, C. (2018). Receptive vocabulary size and its relationship to EFL language skills. International Review of Applied Linguistics in Language Teaching, 56(1), 1–24. https://doi.org/10.1515/iral-2017-0016
Mochida, K., & Harrington, M. (2006). The yes/no test as a measure of receptive vocabulary knowledge. Language Testing, 23(1), 73–98.
Moret-Tatay, C., & Perea, M. (2011). Is the go/no-go lexical decision task preferable to the yes/no task with developing readers? Journal of Experimental Child Psychology, 125–132.
Mustafawi, E. (2017). Arabic phonology. In Elabbas Benmamoun & R. Bassiouney (Eds.), The Routledge handbook of Arabic linguistics (pp. 11–31). Routledge.
Nakata, T., Tamura, Y., & Aubrey, S. (2020). Examining the validity of the LexTALE test for Japanese college students. The Journal of Asia TEFL, 17(2), 335–348. https://doi.org/10.18823/asiatefl.2020.17.2.2.335
Nation, P., & Beglar, D. (2007). A vocabulary size test. The Language Teacher, 31(7), 9–13.
Nevat, M., Khateb, A., & Prior, A. (2014). When first language is not first: an functional magnetic resonance imaging investigation of the neural basis of diglossia in Arabic. European Journal of Neuroscience, 40(9), 3387–3395.
Noor, H. H. (2007). Competency in first language: Does it affect the quality of second language writing? Dirasat: Human and Social Sciences, 34(2), 412–24.
Olson, D. J. (2023). A systematic review of proficiency assessment methods in bilingualism research. International Journal of Bilingualism, 1–36. https://doi.org/10.1177/13670069231153720
Park, H. I., Solon, M., Dehghan-Chaleshtori, M., & Ghanbar, H. (2022). Proficiency reporting practices in research on second language acquisition: Have we made any progress? Language Learning, 72(1), 198–236. https://doi.org/10.1111/lang.12475
Pavlović, T. (2013). Exploring directionality in translation studies. ExELL (Explorations in English Language and Linguistics), 1(2), 149–165.
Pellicer-Sánchez, A., & Schmitt, N. (2012). Scoring Yes–No vocabulary tests: Reaction time vs. nonword approaches. Language Testing, 29(4), 489–509.
Perea, M., Mallouh, R. A., & Carreiras, M. (2014). Are root letters compulsory for lexical access in semitic languages? The case of masked form-priming in arabic. Cognition, 132(3), 491–500.
Plonsky, L. (2023). Sampling and generalizability in Lx research: A second-order synthesis. Languages, 8(1). https://doi.org/10.3390/languages8010075
Pokorn, N. K., Blake, J., Reindl, D., & Pisanski Peterlin, A. (2020). The influence of directionality on the quality of translation output in educational settings. The Interpreter and Translator Trainer, 14(1), 58–78. https://doi.org/10.1080/1750399X.2019.1594563
Puig-Mayenco, E., Chaouch-Orozco, A., Liu, H., & Martín-Villena, F. (2023). The LexTALE as a measure of L2 global proficiency: A cautionary tale based on a partial replication of Lemhöfer and Broersma (2012). Linguistic Approaches to Bilingualism, 13(3), 299–314. https://doi.org/10.1075/lab.22048.pui
Qi, S., Teng, M. F., & Fu, A. (2022). LexCH: a quick and reliable receptive vocabulary size test for Chinese Learners. Applied Linguistics Review.
Quiles, C., Kūriákī, K., & López-Menchero, F. (2011). A grammar of modern Indo-European. Indo-European Association.
R Core Team. (2022). R: A language and environment for statistical computing. R Foundation for Statistical Computing https://www.r-project.org/
Raish, M. (2017). The measurement of the complexity, accuracy, and fluency of written Arabic. Georgetown University.
Raish, M. (2021). Issues in Arabic Language Testing and Assessment. In K. Ryding & D. Wilmsen (Eds.), The Cambridge Handbook of Arabic Linguistics (pp. 83–105). Cambridge University Press. https://doi.org/10.1017/97811082
Revelle, W. (2023). Psych: Procedures for psychological, psychometric, and personality research. R package. (2.3.6). https://cran.r-project.org/package=psych. Accessed June 2023.
Ricks, R. S. (2015). The development of frequency-based assessments of vocabulary breadth and depth for L2 Arabic. Unpublished doctoral dissertation. Georgetown University.
Robin, X., Turck, N., Hainard, A., Tiberti, N., Lisacek, F., Sanchez, J. C., & Müller, M. (2011). pROC: An open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics, 12(1), 1–8. https://doi.org/10.1186/1471-2105-12-77
Saiegh-Haddad, E., & Spolsky, B. (2014). Acquiring literacy in a diglossic context: Problems and prospects. In E. Saiegh-Haddad & M. Joshi (Eds.), Handbook of Arabic literacy: Insights and perspectives (pp. 225–240). Springer.
Salmela, R., Lehtonen, M., Garusi, S., & Bertram, R. (2021). Lexize: A test to quickly assess vocabulary knowledge in finnish. Scandinavian Journal of Psychology, 62(6), 806–819.
Schmitt, N. (2014). Size and depth of vocabulary knowledge: What the research shows. Language Learning, 64(4), 913–951. https://doi.org/10.1111/lang.12077
Sharoff, S. (2006). Open-source corpora: Using the net to fish for linguistic data. International Journal of Corpus Linguistics, 11(4), 435–62.
Signorell, A. (2023).DescTools: Tools for descriptive statistics. R package. (version 0.99.49). https://cran.r-project.org/package=DescTools. Accessed June 2023.
Silva, R., & Clahsen, H. (2008). Morphologically complex words in L1 and L2 processing: Evidence from masked priming experiments in English. Bilingualism Language and Cognition, 11(2), 245–260. https://doi.org/10.1017/S1366728908003404
Skalicky, S., Crossley, S. A., & Berger, C. M. (2019). Predictors of second language english lexical recognition. The Mental Lexicon, 14(3), 333–356. https://doi.org/10.1075/ml.19028.ska
Sketch Engine. (n.d.). urTenTen. https://www.sketchengine.eu/urtenten-urdu-corpus/. Accessed May 2023.
Stansfield, C. W., Gao, J., & Rivers, W. P. (2010). A concurrent validity study of self-assessments and the federal interagency language roundtable oral proficiency interview. Russian Language Journal, 60, 299–315.
Stewart, J. (2014). Do multiple-choice options inflate estimates of vocabulary size on the VST? Language Assessment Quarterly, 11(3), 271–282.
Streiner, D. L. (2002). Breaking up is hard to do: The heartbreak of dichotomizing continuous data. The Canadian Journal of Psychiatry, 47(3), 262–266.
Streiner, D. L., & Cairney, J. (2007). What’s under the ROC? An introduction to receiver operating characteristics curves. The Canadian Journal of Psychiatry, 52(2), 121–128. https://doi.org/10.1177/070674370705200210
Stubbe, R. (2012). Do pseudoword false alarm rates and overestimation rates in yes/no vocabulary tests change with Japanese university students’ English ability levels? Language Testing, 29(4), 471–488. https://doi.org/10.1177/0265532211433033
Summers, M. M., Cox, T. L., McMurry, B. L., & Dewey, D. P. (2019). Investigating the use of the ACTFL can-do statements in a self-assessment for student placement in an intensive english program. System, 80, 269–287. https://doi.org/10.1016/j.system.2018.12.012
Surrain, S., & Luk, G. (2019). Describing bilinguals: A systematic review of labels and descriptions used in the literature between 2005–2015. Bilingualism Language and Cognition, 22(2), 401–415. https://doi.org/10.1017/S1366728917000682
Suzuki, Y., & DeKeyser, R. (2017). The interface of explicit and implicit knowledge in a second language: Insights from individual differences in cognitive aptitudes. Language Learning, 67(4), 747–790. https://doi.org/10.1111/lang.12241
Tachicart, R., Bouzoubaa, K., Harrat, S., & Smaïli, K. (2022). Morphological Analyzers of Arabic Dialects: A survey. In M. Al-Emran & K. Shaalan (Eds.), Recent Innovations in Artificial Intelligence and Smart Applications (pp. 189–203).
Taha, H., Taha, H., & Shaheen, H. (2023). The effect of the linguistic status of text previewing in Arabic on the reading comprehension outcomes among second and sixth grade native Arabs readers: A cross-sectional view. Journal of Psycholinguistic Research.https://doi.org/10.1007/s10936-023-10013-0
Tahir, N. (2014). Impact of Arabic language on Urdu language. VFAST Transactions on Islamic Research, 5(1).
The University of Jordan. (n.d.). Arabic exam for graduate studies. https://graduatestudies.ju.edu.jo/Lists/ArabicExam/List.aspx. Accessed Oct 2023.
Thomas, M. (2006). Research synthesis and historiography: the case of assessment of second language proficiency. In J. M. Norris & L. Ortega (Eds.), Synthesizing research on language learning and teaching (pp. 279–298). John Benjamins Publishing Company.
Thomure, H., Tamim, M., & Griffiths, M. (2021). The effect of Arabic language diglossia on teaching and learning. https://doi.org/10.13140/RG.2.2.31501.46565
Tomoschuk, B., Ferreira, V. S., & Gollan, T. H. (2019). When a seven is not a seven: Self-ratings of bilingual language proficiency differ between and within language populations. Bilingualism Language and Cognition, 22(3), 516–536. https://doi.org/10.1017/S1366728918000421
Tremblay, A. (2011). Proficiency assessment standards in second language acquisition research: “Clozing” the gap. Studies in Second Language Acquisition, 33(3), 339–372. https://doi.org/10.1017/S0272263111000015
United Arab Emirates University. (n.d.). Alain Arabic proficiency test. https://www.arabic-cec.ae/copy-of-alreysyh-1. Accessed Oct 2023.
Walters, J. (2012). Aspects of validity of a test of productive vocabulary: Lex30. Language Assessment Quarterly, 9(2), 172–185.
Webb, S. (2008). Receptive and productive vocabulary sizes of L2 learners. Studies in Second Language Acquisition, 30(1), 79–95.
Wen, Y., Qiu, Y., Leong, C. X. R., & Van Heuven, W. J. (2023). LexCHI: A quick lexical test for estimating language proficiency in chinese. Behavior R. https://doi.org/10.3758/s13428-023-02151-z
Winke, P., Zhang, X., & Pierce, S. J. (2023). A closer look at a marginalized test method: Self-assessment as a measure of speaking proficiency. Studies in Second Language Acquisition, 45(2), 416–441.
Youssef, I. (2019). The phonology and micro-typology of Arabic R. Glossa: A Journal of General Linguistics, 4(1). https://doi.org/10.5334/gjgl.1002
Yusoff, N. M. R. N., Sulong, S., & Zakaria, W. N. W. (2016). The level of Cloze reading comprehension and its relationship with Arabic language achievement. Islamiyyat The International Journal of Islamic Studies, 38(1), 71–5.
Zanon, C., Hutz, C. S., Yoo, H. H., & Hambleton, R. K. (2016). An application of item response theory to psychological test development. Psicologia: Reflexão e Crítica, 29(18). https://doi.org/10.1186/s41155-016-0040-x
Zhang, S., & Zhang, X. (2022). The relationship between vocabulary knowledge and L2 reading/listening comprehension: A meta-analysis. Language Teaching Research, 26(4), 696–725. https://doi.org/10.1177/1362168820913998
Zhang, X., Liu, J., & Ai, H. (2020). Pseudowords and guessing in the yes/no format vocabulary test. Language Testing, 37(1), 6–30. https://doi.org/10.1177/0265532219862265
Zhou, C., & Li, X. (2022). LextPT: A reliable and efficient vocabulary size test for L2 portuguese proficiency. Behavior Research Methods, 54(6), 2625–2639.
Zughoul, M. R. (1980). Diglossia in arabic: Investigating solutions. Anthropological Linguistics, 22(5), 201–217.
Acknowledgements
I would like to thank the very kind and generous people who helped me in data collection: Layan Alrasheed, Bushra Alzahrani, Fatima Aldijani, and Aljowhara for L1 data collection, as well as Hassan Abu Almukrim and Ibrahim Alansari for L2 data collection. I thank the reviewers for their very helpful and insightful comments and suggestions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Open Practices Statement
Data or materials for the experiments are available upon request.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
LexArabic: test instructions and items
التعليمات
ستقرأ تسعين كلمة في الاختبار. بعض هذه الكلمات هي كلمات عربية حقيقية والبعض الآخر كلمات تم تركيبها لتبدو وكأنها
كلمات حقيقية. فضلاً اختر الكلمات التي تعرفها أو تستطيع استخدامها. قم بالاختيار بوضع علامة صح بجانب الكلمة.
-
الاجابة الخاطئة في الاختبار ٌستنقص من درجتك ولهذا فضلاً لا تختر الكلمات التي لا تعرفها. ضع علامة صح بجانب الكلمات التي تعرفها فحسب.
-
الرجاء إكمال الاختبار بمفردك.
-
الرجاء عدم استخدام القاموس.
-
مدة الاختبار غير محددة
n | Word | Word_type |
---|---|---|
1 | تقوية | Word |
2 | متعمد | Word |
3 | ارتشف | Word |
4 | انتكاسة | Word |
5 | مؤجل | Word |
6 | متسق | Word |
7 | انعقاد | Word |
8 | تكاتف | Word |
9 | منزوي | Word |
10 | مفخخ | Word |
11 | تدنيس | Word |
12 | تكييف | Word |
13 | صحوق | Nonword |
14 | مواذد | Nonword |
15 | جثدية | Nonword |
16 | خزاكين | Nonword |
17 | تسويع | Nonword |
18 | كتاسية | Nonword |
19 | صديد | Word |
20 | خصام | Word |
21 | تفريغ | Word |
22 | عتيد | Word |
23 | ممانعة | Word |
24 | تقصي | Word |
25 | سامق | Word |
26 | لاعب | Word |
27 | جلجلة | Word |
28 | متناحرة | Word |
29 | تزحزح | Word |
30 | مستعر | Word |
31 | تغفيش | Nonword |
32 | تزذير | Nonword |
33 | فتراف | Nonword |
34 | ثموح | Nonword |
35 | فسغالي | Nonword |
36 | سالضين | Nonword |
37 | استهل | Word |
38 | جعجع | Word |
39 | كبح | Word |
40 | مقاضاة | Word |
41 | أعفر | Word |
42 | خشونة | Word |
43 | طغى | Word |
44 | دبيب | Word |
45 | زورق | Word |
46 | سؤدد | Word |
47 | واقي | Word |
48 | كفكف | Word |
49 | طعاكة | Nonword |
50 | متغقتة | Nonword |
51 | هارخة | Nonword |
52 | خسم | Nonword |
53 | ممتخثين | Nonword |
54 | خادثي | Nonword |
55 | استوضح | Word |
56 | صياغة | Word |
57 | عصف | Word |
58 | احتجاج | Word |
59 | أمة | Word |
60 | وهج | Word |
61 | مخلب | Word |
62 | مهشم | Word |
63 | جثمان | Word |
64 | طعم | Word |
65 | اعتناق | Word |
66 | صيرورة | Word |
67 | تهاعب | Nonword |
68 | تمغيل | Nonword |
69 | إشطاقات | Nonword |
70 | مسضون | Nonword |
71 | سكيم | Nonword |
72 | ضماع | Nonword |
73 | تنويه | Word |
74 | تزييف | Word |
75 | مزاولة | Word |
76 | متزمت | Word |
77 | عقاب | Word |
78 | اجتثاث | Word |
79 | مسجى | Word |
80 | فظاظة | Word |
81 | وجنة | Word |
82 | ترهل | Word |
83 | لاذع | Word |
84 | تناغم | Word |
85 | مؤشغات | Nonword |
86 | مشجكة | Nonword |
87 | جاشوزة | Nonword |
88 | أعفام | Nonword |
89 | أرضلة | Nonword |
90 | متحغر | Nonword |
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Alzahrani, A. LexArabic: A receptive vocabulary size test to estimate Arabic proficiency. Behav Res (2023). https://doi.org/10.3758/s13428-023-02286-z
Accepted:
Published:
DOI: https://doi.org/10.3758/s13428-023-02286-z