The Accountability Culture: a Systematic Review of High-Stakes Testing and English Learners in the United States During No Child Left Behind

Acosta, Sandra; Garza, Tiberio; Hsu, Hsien-Yuan; Goodson, Patricia; Padrón, Yolanda; Goltz, Heather H.; Johnston, Anna

doi:10.1007/s10648-019-09511-2

The Accountability Culture: a Systematic Review of High-Stakes Testing and English Learners in the United States During No Child Left Behind

Review Article
Published: 18 December 2019

Volume 32, pages 327–352, (2020)
Cite this article

Educational Psychology Review Aims and scope Submit manuscript

3474 Accesses
12 Citations
6 Altmetric
1 Mention
Explore all metrics

Abstract

The purpose of our study was twofold: (1) to review the literature on high-stakes accountability testing and English learners (ELs) in the USA, applying qualitative systematic review methodology, and (2) to draw substantive conclusions about the impact of high-stakes accountability testing on ELs, reported in studies published between 2001 and 2016. Thirty-seven studies (n = 37) meeting the eligibility criteria were evaluated using the Methodological Quality Questionnaire. Findings indicate construct irrelevant variance (non-random factors that systematically affect ELs’ test performance) challenges the appropriateness of inferences drawn about ELs’ content knowledge based on assessment performance. Further, language policy in schools, driven by high-stakes tests, promotes increased instructional time for basic skills tutorials with fewer opportunities for problem solving and other higher-order learning activities. Moreover, theory utilization was rare (n = 8, 22%). Among studies that utilized theory, few explicitly referenced it to discuss study findings.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Consequential research of accountability testing: the case of the CET

Article Open access 11 July 2022

Interconnection between constructs and consequences: a key validity consideration in K–12 English language proficiency assessments

Article Open access 07 October 2022

The Validity and Social Legitimacy of the Chilean National Assessment of Learning Outcomes (SIMCE): The Role of Evidence and Deliberation

Notes

Language programs for ELs comprise the following: (1) one-language programs (English as a second language (ESL) without mother tongue or native language instruction) or (2) two-language programs: ESL and native language instruction (see Baker 2001).
Although the EL students recruited for the study of Noble et al. (2012) were classified as formerly limited English proficient (FLEP), the authors of the study (Noble et al. 2012) considered these students to be ELs. The authors justified the inclusion of the FLEP students as follows: “We recruited 12 ELLs who were classified as Formerly Limited English Proficient (FLEP) by their schools because they had been considered Limited English Proficient (LEP) within the school year or the previous 2 years (MA DOE, 2004b). The FLEP students we interviewed demonstrated that they were still learning the English language skills needed to interpret science assessments, and thus we considered this group of 12 students to be ELLs” (Noble et al. 2012, p. 784). The authors of the current review agreed and included the study of Noble et al. (2012) in the sample of eligible studies.

References

^*Studies included in the systematic review

*Abedi, J. (2002). Standardized achievement tests and English language learners: psychometrics issues. Educational Assessment, 8, 231–257. https://doi.org/10.1207/S15326977EA0803_02.
Article Google Scholar
Acosta, S., & Garza, T. (2011). The podcasting playbook: A typology of evidence-based podagogy for prek-12 Classrooms with English language learners. Research in the Schools, 18(2), 39–56 Retrieved from https://search.proquest.com/results/2AEE762C76DE4B35PQ/1?accountid=7082.
Acosta, S., & Goltz, H. (2014). Transforming practices: A primer on action research. Health Promotion Practice, 15(4), 465–470. https://doi.org/10.1177/1524839914527591.
Article Google Scholar
Acosta, S., Goltz, H. H., & Goodson, P. (2015). Autoethnography in action research for health education practitioners. Action Research, 13(4), 411–431. https://doi.org/10.1177/1476750315573589.
Article Google Scholar
Acosta, S., Garza, T., & Hsu, H-Y. (2017, April). Assessing Study Quality Using the Methodological Quality Questionnaire: A Case Study of Novice Rater Training. American Educational Research Association (AERA) 2017 Annual Conference, San Antonio, TX. AERA Online Paper Repository http://www.aera.net/repository.
Acosta, S., Chen, X., Goltz, H., Goodson, P., & Padrón, Y. (2018). A case study of novice bilingual education teachers conducting action research and diffusing teaching innovations. Urban Education. Advance online publication. https://doi.org/10.1177/0042085918805148.
Acosta, S., Garza, T., Hsu, H-Y., & Goodson, P. (under review). Assessing Quality in Systematic Literature Reviews: A Study of Novice Rater Training.
American Educational Research Association. (2006). Standards for reporting on empirical social science research in AERA publications. Educational Researcher, 35(6), 33–40. https://doi.org/10.3102/0013189X035006033.
Article Google Scholar
American Psychological Association. (2010). Publication manual of the American Psychological Association (6th). Washington, DC: Author.
Andrews, R. (2005). The place of systematic reviews in education research. British Journal of Educational Studies, 53(4), 399–416. https://doi.org/10.1111/j.1467-8527.2005.00303.x.
Article Google Scholar
Au, W. (2007). High-stakes testing and curricular control: a qualitative metasynthesis. Educational Researcher, 36(5), 258–267. https://doi.org/10.3102/0013189X07306523.
Article Google Scholar
Bachman, L. F. (1990). Fundamental considerations in language testing. Oxford: Oxford University Press.
Google Scholar
Baker, C. (2001). Foundations of bilingual education and bilingualism (3rd ed.). Clevedon: Multilingual Matters.
Google Scholar
*Betts, J., Bolt, S., Decker, D., Muyskens, P., & Marston, D. (2009). Examining the role of time and language type in reading development for English language learners. Journal of School Psychology, 47(3), 143–166. https://doi.org/10.1016/j.jsp.2008.12.002.
Article Google Scholar
*Brown, C. L. (2005). Equity of literacy-based math performance assessments for English language learners. Bilingual Research Journal, 29, 337–363. https://doi.org/10.1080/15235882.2005.10162839.
Article Google Scholar
*Callahan, R. M. (2006). The intersection of accountability and language: can reading intervention replace English language development? Bilingual Research Journal, 30, 1–21. https://doi.org/10.1080/15235882.2006.10162863.
Article Google Scholar
*Capraro, M. M., Capraro, R. M., Yetkiner, Z. E., Rangel-Chavez, A. F., & Lewis, C. W. (2010). Examining Hispanic student mathematics performance on high-stakes tests: an examination of one urban school district in Colorado. The Urban Review, 42(3), 193–209. https://doi.org/10.1007/s11256-009-0127-0.
Article Google Scholar
*Cawthon, S., Leppo, R., Carr, T., & Kopriva, R. (2013). Toward accessible assessments: the promises and limitations of test item adaptations for students with disabilities and English language learners. Educational Assessment, 18, 73–98. https://doi.org/10.1080/10627197.2013.789294.
Article Google Scholar
Chiu, M. M., & Khoo, L. (2005). Effects of resources, inequality, and privilege bias on achievement: country, school, and student level analyses. American Educational Research Journal, 42(4), 575–603. https://doi.org/10.3102/00028312042004575.
Article Google Scholar
Collier, V. P., & Thomas, W. P. (2007). Predicting second language academic success in English using the Prism model. In J. Cummins & C. Davison (Eds.), International handbook of English language teacher, part 1 (pp. 333–348). New York: Springer.
Google Scholar
Colquhoun, H. L., Brehaut, J. C., Sales, A., Ivers, N., Grimshaw, J., Michie, S., Carroll, K., Chalifoux, M., & Eva, K. W. (2013). A systematic review of the use of theory in randomized controlled trials of audit and feedback. Implementation Science, 8(66), 2–8 Retrieved from http://www.implementationscience.com/content/8/1/66.
Google Scholar
Cooper, H. (2010). Research syntheses and meta-analysis: a step-by-step approach (4th ed.). Thousand Oaks: Sage.
Google Scholar
Cummins, J. (1984). Bilingualism and special education: issues in assessment and pedagogy. Clevedon: Multilingual Matters.
Google Scholar
Cummins, J. (2000). Language, power, and pedagogy: bilingual children in the crossfire. Clevedon: Multilingual Matters.
Book Google Scholar
Daly, A. J. (2010). Studying the terrain ahead: social networking theory and educational change. In A. J. Daly (Ed.), Social network theory (pp. 259–274). Cambridge: Harvard Education Press.
Google Scholar
Dorans, N. J., & Kulick, E. (1986). Demonstrating the utility of the standardization approach to assessing unexpected differential item performance on the Scholastic Aptitude Test. Journal of Educational Measurement, 23(4), 355–368 Retrieved from https://www.jstor.org/stable/1434554?seq=1#page_scan_tab_contents. Accessed 15 Dec 2019..
Durán, R. P. (2011). Ensuring valid educational assessment. In M. d. R. Basterra, E. Trumbull, & G. Solano-Flores (Eds.), Cultural validity in assessment: addressing linguistic and cultural diversity (pp. 115–142). New York: Routledge.
Google Scholar
*Escamilla, K., Mahon, E., Riley-Bernal, H., & Rutledge, D. (2003). High-stakes testing, Latinos, and English language learners: lessons from Colorado. Bilingual Research Journal, 27, 25–49. https://doi.org/10.1080/15235882.2003.10162590.
Article Google Scholar
*Escamilla, K., Chávez, L., & Vigil, P. (2005). Rethinking the “gap” high-stakes testing and Spanish-speaking students in Colorado. Journal of Teacher Education, 56, 132–144. https://doi.org/10.1177/0022487104273791.
Article Google Scholar
Every Student Succeeds Act, S.1177, 114th Cong. (2015). Retrieved from https://www.gpo.gov/fdsys/pkg/BILLS-114s1177enr/pdf/BILLS-114s1177enr.pdf
Federal Interagency Forum on Child and Family Statistics. (2013). America’s children: key national indicators of well-being, 2013. Washington: U.S. Government Printing Office Retrieved from https://www.childstats.gov/pdf/ac2013/ac_13.pdf. Accessed 15 Dec 2019.
Feliciano, C., & Lanuza, Y. R. (2017). An immigrant paradox? Contextual attainment and intergenerational educational mobility. American Sociological Review, 82(1), 211–241. https://doi.org/10.1177/0003122416684777.
Article Google Scholar
Fry, R. (2007). The changing racial and ethnic composition of U.S. public schools. Washington: Pew Hispanic Center Retrieved from http://www.pewhispanic.org/2007/08/30/the-changing-racial-and-ethnic-composition-of-us-public-schools/. Accessed 15 Dec 2019.
Gándara, P., Losen, D., August, D., Uriarte, M., Gómez, M. C., & Hopkins, M. (2010). Forbidden language: a brief history of U.S. language policy. In P. Gándara & M. Hopkins (Eds.), Forbidden language: English learners and restrictive language policies (pp. 20–33). New York: Teacher College Press.
Google Scholar
Gardner, H. (1983). Frames of mind: the theory of multiple intelligences. New York: Basic Books.
Google Scholar
Garrard, J. (2011). Health sciences literature review made easy: the matrix method (3rd ed.). Boston: Jones and Bartlett.
Google Scholar
Garza, T., Acosta, S., & Hsu, H-Y. (2013, July). Evaluating a systematic rating review scale: The Methodology Quality Questionnaire. American Psychological Association (APA), 121st Annual Convention, Honolulu, HI. AERA Online Paper Repository http://www.aera.net/repository.
*Goldstein, D., & Alibrandi, M. (2013). Integrating GIS in the middle school curriculum: impacts on diverse students’ standardized test scores. Journal of Geography, 112(2), 68–74. https://doi.org/10.1080/00221341.2012.692703.
Article Google Scholar
Gonzalez, J. E., Acosta, S., Davis, H., Durodola, S. P., Saenz, L., Soares, D., Resendez, N., & Zhu, L. (2017). Latino maternal literacy beliefs and practices mediating socioeconomic status and maternal education effects in predicting child receptive vocabulary. Early Education and Development, 28(1), 78–95. https://doi.org/10.1080/10409289.2016.1185885.
Article Google Scholar
Goodson, P., Buhi, E., & Dunsmore, S. (2006). Self-esteem and adolescent sexual behaviors, attitudes, and intentions: a systematic review. Journal of Adolescent Health, 38(3), 310–319. https://doi.org/10.1016/j.jadohealth.2005.05.026.
Article Google Scholar
Greenhalgh, T., & Peacock, R. (2005). Effectiveness and efficiency of search methods in systematic reviews of complex evidence: audit of primary sources. British Medical Journal, 331(7524), 1064–1065. https://doi.org/10.1136/bmj.38636.593461.68.
Article Google Scholar
*Guerrero, M., & Sloan, K. (2001). When exemplary gets blurry: a descriptive analysis of four exemplary K-3 Spanish reading programs. Bilingual Research Journal, 25, 173–201. https://doi.org/10.1080/15235882.2001.10162789.
Article Google Scholar
Gutiérrez, K. D., & Penuel, W. R. (2014). Relevance to practice as a criterion for rigor. Educational Researcher, 43(1), 19–23. https://doi.org/10.3102/0013189X13520289.
Article Google Scholar
Hamilton, L. S., Stecher, B. M., & Yuan, K. (2008). Standards-based reform in the United States: history, research, and future directions. Santa Monica: RAND Corporation Retrieved from http://www.rand.org/pubs/reprints/RP1384. Accessed 15 Dec 2019.
Hannes, K., Raes, E., Vangencheten, K., Hayvaert, M., & Dochy, F. (2013). Experiences from employees with team learning in a vocational learning or work setting: a systematic review of qualitative evidence. Educational Research Review, 10, 116–132. https://doi.org/10.1016/j.edurev.2013.10.002.
Article Google Scholar
*Heilig, J. V. (2011). Understanding the interaction between high-stakes graduation tests and English learners. Teachers College Record, 113(12), 2633–2669 Retrieved from http://www.tcrecord.org/Content.asp?ContentId=16201. Accessed 15 Dec 2019.
*Hosp, J. L., Hosp, M. A., & Dole, J. K. (2011). Potential bias in predictive validity of universal screening measures across disaggregation subgroups. School Psychology Review, 40(1), 108–131.
Google Scholar
*Irby, B. J., Tong, F., Lara-Alecio, R., Meyer, D. J., & Rodriguez, L. (2007). The critical nature of language of instruction compared to observed practices and high-stakes tests in transitional bilingual classrooms. Research in the Schools, 14(2), 27–36 Retrieved from http://search.ebscohost.com.srvproxy2.library.tamu.edu/login.aspx?direct=true&db=eue&AN=32428702&site=ehost-live. Accessed 15 Dec 2019.
Jadad, A. R., Moore, R. A., Carroll, D., Jenkinson, C., Reynolds, D. J. M., Gavaghan, D. J., & McQuay, H. J. (1996). Assessing the quality of reports of randomized clinical trials: is blinding necessary? Controlled Clinical Trials, 17(1), 1–12. https://doi.org/10.1016/0197-2456(95)00134-4.
Article Google Scholar
*Johnson, E., & Monroe, B. (2004). Simplified language as an accommodation on math tests. Assessment for Effective Intervention, 29, 35–45. https://doi.org/10.1177/073724770402900303.
Article Google Scholar
Kieffer, M. J., Lesaux, N. K., Rivera, M., & Francis, D. J. (2009). Accommodations for English language learners taking large-scale assessments: a meta-analysis on effectiveness and validity. Review of Educational Research, 79(3), 1168–1201. https://doi.org/10.3102/0034654309332490.
Article Google Scholar
Koretz, D. (2017). The testing charade: pretending to make schools better. Chicago: University of Chicago Press.
Book Google Scholar
Lazear, E. P. (2006). Speeding, terrorism, and teaching to the test. The Quarterly Journal of Economics, 121(3), 1029–1061. https://doi.org/10.1162/qjec.121.3.1029.
Article Google Scholar
Lee, J. (2008). Is test-driven external accountability effective? Synthesizing the evidence form cross-state causal-comparative and correlational studies. Review of Educational Research, 78(3), 608–644. https://doi.org/10.3102/0034654308324427.
Article Google Scholar
*Lee, O., Maerten-Rivera, J., Penfield, R. D., LeRoy, K., & Secada, W. G. (2008). Science achievement of English language learners in urban elementary schools: results of a first-year professional development intervention. Journal of Research in Science Teaching, 45, 31–52. https://doi.org/10.1002/tea.20209.
Article Google Scholar
Li, H., & Suen, H. K. (2012). The effects of test accommodations for English language learners: a meta-analysis. Applied Measurement in Education, 25, 327–346. https://doi.org/10.1080/08957347.2012.714690.
Article Google Scholar
Littell, J., Corcoran, J., & Pillai, V. (2008). Systematic reviews and meta-analysis. New York: Oxford University Press.
Book Google Scholar
*López, O. S. (2010). The digital learning classroom: improving English language learners’ academic success in mathematics and reading using interactive whiteboard technology. Computers & Education, 54, 901–915. https://doi.org/10.1016/j.compedu.2009.09.019.
Article Google Scholar
*Maerten-Rivera, J., Lee, O., & Penfield, R. (2010). Student and school predictors of high-stakes assessment in science. Science Education, 94, 937–962. https://doi.org/10.1002/sce.20408.
Article Google Scholar
*Maerten-Rivera, J., Ahn, S., Lanier, K., Diaz, J., & Lee, O. (2016). Effect of a multiyear intervention on science achievement of all students including English language learners. The Elementary School Journal, 116(4), 600–624. https://doi.org/10.1086/686250.
Article Google Scholar
*Mahon, E. A. (2006). High-stakes testing and English language learners: questions of validity. Bilingual Research Journal, 30, 479–497. https://doi.org/10.1080/15235882.2006.10162886.
Article Google Scholar
*Martiniello, M. (2008). Language and the performance of English-language learners in math word problems. Harvard Educational Review, 78, 333–368 429. Retrieved from https://www.hepg.org/her-home/issues/harvard-educational-review-volume-78-issue-2/herarticle/_652. Accessed 15 Dec 2019.
Article Google Scholar
*Menken, K. (2006). Teaching to the test: how No Child Left Behind impacts language policy, curriculum, and instruction for English language learners. Bilingual Research Journal, 30, 521–546. https://doi.org/10.1080/15235882.2006.10162888.
Article Google Scholar
Menken, K. (2008). English learners left behind: standardized testing as language policy. Clevedon: Multilingual Matters.
Book Google Scholar
Menken, K., & Solorza, C. (2014). No Child Left Behind: accountability and the elimination of bilingual education programs in New York City Schools. Educational Policy, 28(1), 96–125. https://doi.org/10.1177/0895904812468228.
Article Google Scholar
Moja, L. P., Telaro, E., D’Amico, R., Moschetti, I., Coe, L., & Liberati, A. (2005). Assessment of methodological quality of primary studies by systematic reviews: results of the metaquality study cross sectional study. BMJ [British Medical Journal], 330(7499), 1053–1055 Retrieved from http://www.jstor.org/stable/25459599.
Article Google Scholar
*Muyskens, P., Betts, J., Lau, M. Y., & Marston, D. (2009). Predictive validity of curriculum-based measures in the reading assessment of students who are English language learners. The California School Psychologist, 14, 11–21 Retrieved from http://www.casponline.org.
Article Google Scholar
No Child Left Behind Act of 2001. (2001). 20 U.S.C. §1.
*Noble, T., Suarez, C., Rosebery, A., O’Connor, M. C., Warren, B., & Hudicourt-Barnes, J. (2012). “I never thought of it as freezing”: how students answer questions on large-scale science tests and what they know about science. Journal of Research in Science Teaching, 49(6), 778–803. https://doi.org/10.1002/tea.21026.
Article Google Scholar
Office of English Language Acquisition, Language Enhancement, and Academic Achievement for Limited English Proficient Students. (2012). Biennial report to Congress on the implementation of the Title III State Formula Grant Program, school years 2006–07 and 2007–08. Washington, D. C. Retrieved from http://www.ncela.gwu.edu/files/uploads/3/Biennial_Report_0608.pdf. Accessed 15 Dec 2019.
*Olson, K. (2007). Lost opportunities to learn: the effects of education policy on primary language instruction for English learners. Linguistics and Education, 18, 121–141. https://doi.org/10.1016/j.linged.2007.07.001.
Article Google Scholar
*Pacheco, M. (2010). English-language learners’ reading achievement: dialectical relationships between policy and practices in meaning-making opportunities. Reading Research Quarterly, 45, 292–317. https://doi.org/10.1598/RRQ.45.3.2.
Article Google Scholar
Paivio, A. (1986). Mental representations. New York: Oxford University Press.
Google Scholar
Petticrew, M., & Roberts, H. (2006). Systematic reviews in the social sciences: a practical guide. Oxford: Blackwell.
Book Google Scholar
Polanin, J. R., Maynard, B. R., & Dell, N. A. (2017). Overviews in education research: a systematic review and analysis. Review of Educational Research, 87(1), 172–203. https://doi.org/10.3102/0034654316631117.
Article Google Scholar
Popham, W. J. (2008). The role of assessment in federal education programs. Washington: Center on Education Policy Retrieved from http://files.eric.ed.gov/fulltext/ED503908.pdf. Accessed 15 Dec 2019.
Reljić, G., Ferring, D., & Martin, R. (2015). A meta-analysis on the effectiveness of bilingual programs in Europe. Review of Educational Research, 85(1), 92–128. https://doi.org/10.3102/0034654314548514.
Article Google Scholar
*Richardson, R. D., Hawken, L. S., & Kircher, J. (2012). Bias using maze to predict high-stakes test performance among Hispanic and Spanish-speaking students. Assessment for Effective Intervention, 37(3), 159–170. https://doi.org/10.1177/1534508411430320.
Article Google Scholar
*Rivera, C., & Stansfield, C. W. (2004). The effect of linguistic simplification of science test items on score comparability. Educational Assessment, 9, 79–105. https://doi.org/10.1080/10627197.2004.96.
Article Google Scholar
*Roehrig, A. D., Petscher, Y., Nettles, S. M., Hudson, R. F., & Torgesen, J. K. (2008). Accuracy of the DIBELS oral reading fluency measure for predicting third grade reading comprehension outcomes. Journal of School Psychology, 46(3), 343–366. https://doi.org/10.1016/j.jsp.2007.06.006.
Article Google Scholar
Ruiz, R. (1984). Orientations in language planning. NABE Journal, 9(2), 15–34. https://doi.org/10.1080/08855072.1984.10668464.
Article Google Scholar
*Shaftel, J., Belton-Kocher, E., Glasnapp, D., & Poggio, J. (2006). The impact of language characteristics in mathematics test items on the performance of English language learners and students with disabilities. Educational Assessment, 11, 105–126. https://doi.org/10.1207/s15326977ea1102_2.
Article Google Scholar
*Shyyan, V., Thurlow, M. L., & Liu, K. K. (2008). Instructional strategies for improving achievement in reading, mathematics, and science for English language learners with disabilities. Assessment for Effective Intervention, 33, 145–155. https://doi.org/10.1177/1534508407313239.
Article Google Scholar
Slavin, R. E., & Cheung, A. (2005). A synthesis of research on language of reading instruction for English language learners. Review of Educational Research, 75(2), 247–284 Retrieved from http://www.jstor.org/stable/3516050. Accessed 15 Dec 2019.
Solano-Flores, G. (2011). The cultural validity of assessment practices. In M. D. R. Basterra, E. Trumbull, & G. Solano-Flores (Eds.), Cultural validity in assessment: addressing linguistic and cultural diversity (pp. 3–21). New York: Routledge.
Google Scholar
Solano-Flores, G., Wang, C., Kachchaf, R., Soltero-Gonzalez, L., & Nguyen-Le, K. (2014). Developing testing accommodations for English language learners: illustrations as visual supports for item accessibility. Educational Assessment, 19, 267–283. https://doi.org/10.1080/10627197.2014.964116.
Article Google Scholar
Solórzano, R. (2008). High stakes testing: issues, implications, and remedies for English language learners. Review of Educational Research, 78(2), 260–329. https://doi.org/10.3102/0034654308317845.
Article Google Scholar
Spolsky, B. (2012). What is language policy? In B. Spolsky (Ed.), The Cambridge handbook of language policy (pp. 3–15). New York: Cambridge University Press.
Chapter Google Scholar
*Strunk, K. O., & McEachin, A. (2014). More than sanctions: closing achievement gaps through California’s use of intensive technical assistance. Educational Evaluation and Policy Analysis, 36(3), 281–306. https://doi.org/10.3102/0162373713510967.
Article Google Scholar
Sunderman, G. L., Kim, J. S., & Orfield, G. (2005). NCLB meets school realities: lessons from the field. Thousand Oaks: Corwin.
Google Scholar
Supovitz, J. (2009). Can high stakes testing leverage educational improvement? Prospects from the last decade of testing and accountability reform. Journal of Educational Change, 10(2–3), 211–227. https://doi.org/10.1007/s10833-009-9105-2.
Article Google Scholar
Thompson, B. (2006). The foundations of behavioral statistics: an insight-based approach. New York: Guilford.
Google Scholar
Torgerson, C. (2003). Systematic reviews. London: Continuum.
Google Scholar
*Tsang, S.-L., Katz, A., & Stack, J. (2008). Achievement testing for English language learners, ready or not? Education Policy Analysis Archives, 16(1), 1–29 Retrieved from https://epaa.asu.edu/ojs/issue/view/vol16. Accessed 15 Dec 2019.
U.S. Department of Education. (2012). National evaluation of Title III Implementation—a survey of states’ English language proficiency standards. Washington, DC: Author. Retrieved from http://www2.ed.gov/rschstat/eval/title-iii/national-implementation-report.pdf. Accessed 15 Dec 2019.
Umansky, I. M., & Reardon, S. F. (2014). Reclassification patterns among Latino English learner students in bilingual, dual immersion, and English immersion classrooms. American Educational Research Journal, 51(5), 879–912. https://doi.org/10.3102/0162373716664802.
Article Google Scholar
Valentino, R. A., & Reardon, S. F. (2015). Effectiveness of four instructional programs designed to serve English learners: variation by ethnicity and initial English proficiency. Educational Evaluation and Policy Analysis, 37(4), 612–637. https://doi.org/10.3102/0162373715573310.
Article Google Scholar
Valenzuela, A. (2005). Introduction: The accountability debate in Texas: continuing the conversation. In A. Valenzuela (Ed.), Leaving children behind: how “Texas-style” accountability fails Latino youth (pp. 1–32). Albany: State University of New York Press.
Google Scholar
Valenzuela, A. (Ed.). (2016). Growing critically conscious teachers: a social justice curriculum for educators of Latino/a youth. New York: Teachers College.
Google Scholar
*Vanderwood, M. L., Tung, C. Y., & Checca, C. J. (2014). Predictive validity and accuracy of oral reading fluency for English learners. Journal of Psychoeducational Assessment, 32(3), 249–258. https://doi.org/10.1177/0734282913502937.
Article Google Scholar
*Wiley, H. I., & Deno, S. L. (2005). Oral reading and maze measures as predictors of success for English learners on a state standards assessment. Remedial and Special Education, 26(4), 207–214. https://doi.org/10.1177/07419325050260040301.
Article Google Scholar
Willig, A. C. (1985). A meta-analysis of selected studies on the effectiveness of bilingual education. Review of Educational Research, 55(3), 269–317 Retrieved from www.jstor.org/stable/1170389.
Article Google Scholar
*Wolf, M. K., & Faulkner-Bond, M. (2016). Validating English language proficiency assessment uses for English learners: academic language proficiency and content assessment performance. Educational Measurement: Issues and Practice, 35(2), 6–18. https://doi.org/10.1111/emip.12105.
Article Google Scholar
*Wolf, M. K., & Leon, S. (2009). An investigation of the language demands in content assessments for English language learners. Educational Assessment, 14, 139–159. https://doi.org/10.1080/10627190903425883.
Article Google Scholar
*Young, J. W., Cho, Y., Ling, G., Cline, F., Steinberg, J., & Stone, E. (2008). Validity and fairness of state standards-based assessments for English language learners. Educational Assessment, 13, 170–192. https://doi.org/10.1080/10627190802394388.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Educational Psychology, Texas A&M University, College Station, TX, USA
Sandra Acosta & Yolanda Padrón
Department of Educational Psychology and Higher Education, University of Nevada, Las Vegas, NV, USA
Tiberio Garza
Department of Curriculum and Instruction, University of Massachusetts, Lowell, MA, USA
Hsien-Yuan Hsu
Department of Health and Kinesiology, Texas A&M University, College Station, TX, USA
Patricia Goodson
Department of Criminal Justice and Social Work, University of Houston-Downtown, Houston, TX, USA
Heather H. Goltz
Gilbert, AZ, USA
Anna Johnston

Authors

Sandra Acosta
View author publications
You can also search for this author in PubMed Google Scholar
Tiberio Garza
View author publications
You can also search for this author in PubMed Google Scholar
Hsien-Yuan Hsu
View author publications
You can also search for this author in PubMed Google Scholar
Patricia Goodson
View author publications
You can also search for this author in PubMed Google Scholar
Yolanda Padrón
View author publications
You can also search for this author in PubMed Google Scholar
Heather H. Goltz
View author publications
You can also search for this author in PubMed Google Scholar
Anna Johnston
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sandra Acosta.

Ethics declarations

Conflict of interest

The authors declare they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Our task in the present systematic review is to capture the impact on English learners (ELs) of the accountability culture as framed by the No Child Left Behind Act of 2001 (NCLB). Specifically, we aim to identify and review what researchers considered important in relation to high-stakes achievement testing and ELs [also known as limited English proficient (LEP) students]. To accomplish this, we examine original studies published between 2001 and 2016. This time span begins with the passage of NCLB and ends with the reauthorization of the Elementary and Secondary Education Act (ESEA) in 2015: the Every Student Succeeds Act (ESSA) in 2015, which took effect during the 2016–2017 school year.

Electronic supplementary material

ESM 1

(DOCX 870 kb)

ESM 2

(DOCX 832 kb)

ESM 3

(DOCX 44 kb)

ESM 4

(DOCX 66 kb)

ESM 5

(DOCX 44 kb)

ESM 6

(DOCX 45 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Acosta, S., Garza, T., Hsu, HY. et al. The Accountability Culture: a Systematic Review of High-Stakes Testing and English Learners in the United States During No Child Left Behind. Educ Psychol Rev 32, 327–352 (2020). https://doi.org/10.1007/s10648-019-09511-2

Download citation

Published: 18 December 2019
Issue Date: June 2020
DOI: https://doi.org/10.1007/s10648-019-09511-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Accountability Culture: a Systematic Review of High-Stakes Testing and English Learners in the United States During No Child Left Behind

Abstract

Access this article

Similar content being viewed by others

Consequential research of accountability testing: the case of the CET

Interconnection between constructs and consequences: a key validity consideration in K–12 English language proficiency assessments

The Validity and Social Legitimacy of the Chilean National Assessment of Learning Outcomes (SIMCE): The Role of Evidence and Deliberation

Notes

References

^*Studies included in the systematic review

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Electronic supplementary material

ESM 1

ESM 2

ESM 3

ESM 4

ESM 5

ESM 6

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The Accountability Culture: a Systematic Review of High-Stakes Testing and English Learners in the United States During No Child Left Behind

Abstract

Access this article

Similar content being viewed by others

Notes

References

*Studies included in the systematic review

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

^*Studies included in the systematic review