Previous research studies indicate that developing writing skills is a challenging process particularly for EFL learners. Academic writing puts an additional burden on the learners’ shoulders as it requires some further advanced skills, such as genre awareness, lexical flexibility, and complex syntactic knowledge, to name but a few. Corpus Linguistic Approaches to language analysis (i.e., Data-Driven Learning) has the potential to guide L2 writers in their attempt to follow the academic genre and learn the required writing skills inductively. Corpora can be exploited in three stages: observation of concordance evidence, classification of salient features and generalization of rules. Learners as the discoverers of language in this approach can benefit from the versatile features of corpora and learn from the patterns they observe through the concordance lines. In the light of the given approach and its potential to create more autonomous EFL learners, this chapter attempts to (a) explain what data-driven learning is and how it may shape the learning experience in an EFL context, (b) elaborate on how corpora can guide EFL learners in academic writing and (c) provide some hands-on uses of corpora in teaching/learning (academic) writing.
- Academic writing
- Data-driven learning
- Corpus linguistics
This is a preview of subscription content, access via your institution.
Tax calculation will be finalised at checkout
Purchases are for personal use onlyLearn about institutional subscriptions
See the website https://www.coventry.ac.uk/research/research-directories/current-projects/2015/british-academic-written-english-corpus-bawe/ for more details.
See the website https://www.sketchengine.eu/cambridge-learner-corpus/ for more details.
See the website https://www.english-corpora.org/coca/ to reach the interface.
Go to the website http://www.collocaid.uk to reach the interface.
See the website http://www.just-the-word.com for more information.
See the website https://skell.sketchengine.co.uk/run.cgi/skell for more information.
See the website http://pelcra.pl/hask_en/ for more information.
See the website https://www.lextutor.ca/vp/eng/ for more information.
See the website https://writecrow.org/ for more information.
See the website http://corpora.lancs.ac.uk/lancsbox/ for more information.
Aston, G. (2015). Learning phraseology from speech corpora. In A. Lenko-Szymanska & A. Boulton (Eds.), Multiple affordances of language corpora for data-driven learning (pp. 65–84). John Benjamins.
Baisa, V., & Suchomel, V. (2014). SkELL – Web interface for English language learning. In Eighth workshop on recent advances in Slavonic natural language processing (pp. 63–70). Tribun EU.
Boulton, A. (2009). Testing the limits of data-driven learning: Language proficiency and training. ReCALL, 21(1), 37–54.
Boulton, A. (2010a). Data-driven learning: Taking the computer out of the equation. Language Learning, 60/3, 534–572.
Boulton, A. (2010b). Language awareness and medium-term benefits of corpus consultation. In A. Gimeno Sanz (Ed.), New trends in computer-assisted language learning: Working together (pp. 39–46). Macmillan ELT.
Boulton, A., & Cobb, T. (2017). Corpus use in language learning: A meta-analysis. Language Learning, 67(2), 348–393.
Brezina, V., Weill-Tessier, P., & McEnery, A. (2020). #LancsBox v. 5.x. [software]. Available at: http://corpora.lancs.ac.uk/lancsbox
Chang, J. Y. (2014). The use of general and specialized corpora as reference sources for academic English writing: A case study. ReCALL, 26(2), 243–259.
Charles, M. (2014). Getting the corpus habit: EAP learners’ long-term use of personal corpora. English for Specific Purposes, 35, 30–40.
Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34, 213–238.
Davies, M. (2008). The Corpus of Contemporary American English (COCA). Available online at https://www.english-corpora.org/coca/
Durrant, P. (2016). To what extent is the Academic Vocabulary List relevant to university student writing? English for Specific Purposes, 43, 49–61.
Ellis, N. C. (2017). Cognition, corpora, and computing: Triangulating research in usage-based language learning. Language Learning, 67(S1), 40–65.
Flowerdew, L. (2010). Using corpora for writing instruction. In A. O’Keeffe & M. McCarthy (Eds.), (pp. 444–457).
Frankenberg-Garcia, A., Rees, G., Lew, R., Roberts, J., Sharma, N., & Butcher, P. (2019). ColloCaid: A tool to help academic English writers find the words they need. In F. Meunier, J. Van de Vyver, L. Bradley, & S. Thouësny (Eds.).
Friginal, E. (2018). Corpus linguistics for English teachers: New tools, online resources, and classroom activities. Routledge.
Hamdan, N., McKnight, P., McKnight, K., & Arfstrom, K. M. (2013). A review of flipped learning. Retrieved from http://www.flippedlearning.org/review
Heatley, A., Nation, I. S. P., & Coxhead, A. (2002). RANGE and FREQUENCY programs. Available at http://www.victoria.ac.nz/lals/staff/paul-nation.aspx
Hoey, M. (2005). Lexical priming. A new theory of words and language. Routledge.
Hunston, S., & Francis, G. (2000). Pattern Grammar: A corpus-driven approach to the lexical grammar of English. John Benjamins.
Johns, T. (1991). From printout to handout: Grammar and vocabulary teaching in the context of data-driven learning. In T. Johns & P. King (Eds.), Classroom concordancing. English Language Research Journal, 4, 27–45.
Lee, H., Warschauer, M., & Lee, J. H. (2019). The effects of corpus use on second language vocabulary learning: A multilevel meta-analysis. Applied Linguistics, 40(5), 721–753.
Lewis, M. (1993). The lexical approach. The state of ELT and a way forward. Language Teaching Publications.
Luo, Q., & Liao, Y. (2015). Using corpora for error correction in EFL learners’ writing. Journal of Language Teaching and Research, 6(6), 1333–1342.
Luo, Q., & Zhou, J. (2017). Data-driven learning in second language writing class: A survey of empirical studies. International Journal of Emerging Technologies in Learning, 12(3), 182–196.
Mueller, C., & Jacobsen, N. (2016). A comparison of the effectiveness of EFL students’ use of dictionaries and an online corpus for the enhancement of revision skills. ReCALL, 28(1), 3–21.
Nation, P. (2001). Learning vocabulary in another language. Cambridge University Press.
Nesi, H., Gardner, S., Thompson, P., et al. (2008). British academic written English corpus. Oxford Text Archive. http://hdl.handle.net/20.500.12024/2539
Nesselhauf, N. (2004). Learner corpora and their potential for language teaching. In J. Sinclair (Ed.), How to use corpora in language teaching (pp. 125–152). Benjamins.
Nolen, M. (2017). Exploring corpora: Developing exploratory habits with data-driven learning in a study abroad setting [Unpublished Master’s paper]. Department of Applied Linguistics and ESL, Georgia State University.
Paquot, M. (2010). Academic vocabulary in learner writing: From extraction to analysis. Continuum.
Pérez-Paredes, P. (2019). A systematic review of the uses and spread of corpora and data-driven learning in CALL research during 2011–2015. Computer Assisted Language Learning, 35(1–2), 36–61.
Pęzik, P. (2014). Graph-based analysis of collocational profiles. In V. Jesenšek & P. Grzybek (Eds.), Phraseologie Im Wörterbuch Und Korpus (Phraseology in Dictionaries and Corpora) (pp. 227–243. ZORA 97). Filozofska fakuteta.
Römer, U. (2011). Corpus research applications in second language teaching. Annual Review of Applied Linguistics, 31, 205–225.
Schmitt, N. (2010). Researching vocabulary: A vocabulary research manual. Palgrave Macmillan.
Sinclair, J. M. (1991). Corpus concordance collocation. Oxford University Press.
Staples, S., & Dilger, B. (2018). Corpus and repository of writing [Learner corpus articulated with repository]. Available at https://crow.corporaproject.org
The British National Corpus, version 3 (BNC XML Edition). (2007). Distributed by Bodleian Libraries, University of Oxford, on behalf of the BNC Consortium. http://www.natcorp.ox.ac.uk/
Tomasello, M. (2003). Constructing a language: A usage-based theory of language acquisition. Harvard University Press.
Tribble, C. (2015). Teaching and language corpora: Perspectives from a personal journey. In A. Leńko-Szymańska & A. Boulton (Eds.), Multiple affordances of language corpora for data-driven learning (pp. 15–36). John Benjamins.
Yanto, E. S., & Indra Nugraha, S. (2017). The implementation of corpus-aided discovery learning in English grammar pedagogy. Journal of ELT Research, 2, 66–83.
Yoon, H. (2008). More than a linguistic reference: The influence of corpus technology on L2 academic writing. Language, Learning and Technology, 12(2), 31–48.
Yoon, H., & Hirvela, A. (2004). ESL student attitudes toward corpus use in L2 writing. Journal of Second Language Writing, 13, 257–283.
Editors and Affiliations
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Cangır, H. (2022). Using Corpus Tools for Academic Writing in EFL Settings: A Data-Driven Learning Approach. In: Yangın-Ekşi, G., Akayoglu, S., Anyango, L. (eds) New Directions in Technology for Writing Instruction. English Language Education, vol 30. Springer, Cham. https://doi.org/10.1007/978-3-031-13540-8_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-13539-2
Online ISBN: 978-3-031-13540-8