Automated Feedback on Writing

Cotos, Elena

doi:10.1007/978-3-031-36033-6_22

Elena Cotos ORCID: orcid.org/0000-0002-2515-9857⁸

5611 Accesses
1 Citations

Abstract

Automated Writing Evaluation (AWE) tools have not only confidently entered the scene of digital writing technologies but also secured a prominent space in teaching and learning practices due to their formative automated feedback on various traits of the writing construct. The nature and types of AWE feedback vary depending on the history and origins of the tools. Having evolved through multiple generations, they can be broadly categorized as assessment-driven and genre-based. This chapter elaborates on both of these ramifications of AWE, describing their purposes, functional specifications, and two exemplary tools. AWE research has been increasingly adopting the validity argument framework, which allows to consolidate multifarious evidence into a progression of empirically supported inferences about AWE uses. To that end, AWE effectiveness has been investigated in terms of: how well the tools represent the target writing domains; how accurate, consistent, and appropriate the feedback is; whether the feedback extrapolates to other contexts and feedback sources; how the tools are utilized; and what their beneficial effects are. The implications discussed following a brief synthesis of this research highlight the need for advancing the theory-research-practice interface, contemplating the potential for theorizing the modelling of cognitive writing in a digital environment, which could further inform the design and implementation of the next-generation of AWE tools.

You have full access to this open access chapter, Download chapter PDF

1 Overview

The definition commonly attributed to automated evaluation of writing is “the ability of computer technology to evaluate and score written prose” (Shermis & Burstein, 2003, p. xiii). Digital writing environments that provide automated feedback on writing are known as Automated Writing Evaluation (AWE) tools. AWE tools originated from automated essay scoring (AES) (chapter “Automated Scoring of Writing”). It must be noted, however, that AWE is not interchangeable with AES and its sister-term automated essay evaluation (AEE). Unlike, AES/AEE whose focus is on summative assessment, AWE tools support the process of writing by providing formative feedback that is typically displayed on an engaging graphic interface. Moreover, AWE is a more encompassing term, where writing covers any genre and evaluation extrapolates to uses beyond scoring.

As AES derivatives, AWE tools employ computational engines that rely on natural language processing (NLP), artificial intelligence, and statistical modelling approaches (chapters “Automated Text Generation and Summarization for Academic Writing to Analytic Techniques for Automated Analysis of Writing;” also Burstein et al., 2003a) to analyze lexical, syntactic, semantic, and discourse traits in written texts. Therefore, their design, development, and implementation are grounded in multi-disciplinary perspectives including Applied Linguistics, Educational Measurement, Computer and Information Sciences, Psychometrics and Quantitative Psychology, Cognitive Psychology and Psycholinguistics, and Writing Studies in first and second languages.

Noting that a “comprehensive history of AWE has yet to be written,” Hazelton et al. (2021) delineate AWE tools into three generations based on how technological capabilities developed over time (p. 43). The first-generation exemplar, in their view, is represented by Project Essay Grade (PEG) introduced in the 1960s. While PEG is indeed the pioneer that spearheaded AWE, it aimed to address the challenge of time-intensive grading of student writing and thus essentially falls within the purview of AES (chapter “Automated Scoring of Writing”). Second-generation AWE, which emerged in the 1980s also primarily as efficiency-driven technology, includes tools that provide immediate individualized feedback aiming to alleviate the labour-intensive task for teachers needing to respond to student writing in formative ways. The Writer’s Workbench was among the first tools of this kind that provided feedback on aspects of writing including errors and topic sentences, followed by Criterion, MY Access!, Write-To-Learn, etc. It is worth noting that, while initially AWE tools hardly accounted for the needs of second and foreign language learners, language learning theories began to gain a steady influence on AWE research and development in the 2000s (Xi, 2010). The third generation of AWE has taken a “left turn” expanding the ability of this technology to analyze student writing across academic disciplines and writing genres (Burstein et al., 2016a, p. 6). Most recent third generation tools (e.g., freely available Writing Mentor app installed from the Google Docs add-on store) are approaching the functionality of intelligent tutoring systems (ITS) since they provide guided activities to complement the feedback. The Writing Pal is the only ITS representative tool that has an AWE component (McCarthy et al., 2022). Writing Pal is modular, and the AWE component can be used solely for feedback as well as for instruction (chapter “The Future of Intelligent Tutoring Systems for Writing”).

2 Core Idea of the Technology

AWE tools serve the purpose of formative assessment and provide practice for writing development. They have been promoted and largely implemented as enhancements for process writing instruction, emphasizing the value of multiple drafting fostered by feedback and other forms of scaffolding. Aligned with the move towards individualized teaching and assessment, AWE is deemed to enhance the dynamics of classroom instruction and to also ensure cross-curricular consistency of writing evaluation. For students, automated feedback is intended as a motivational factor that can guide revision and sustain learner autonomy.

Considering that feedback is at the core of AWE, a two-pronged categorization of AWE alternative to Hazelton et al.’s (2021) can be conceptualized based on the origin of the automated feedback. As mentioned above, most existing AWE tools are descendants of traditional AES used to assess writing performance on constructed-response writing tasks. Such tools can be categorized as assessment-driven. Their feedback is corrective in nature, flagging writing traits that may need to be addressed. Most assessment-driven AWE tools are asynchronous and attempt to address grammatical errors as well as more global discourse traits. There are also a few tools such as Grammarly and CyWrite that deliver the feedback synchronously. The second category comprises genre-based AWE, whose design is guided by discourse analysis studies of the target domain, learning theories, and pedagogical principles (Cotos, 2022). The first genre-based automated analysis tool called Mover was introduced by Anthony and Lashkia (2003), and the Research Writing Tutor (RWT) and AcaWriter are more recent. What sets them apart is that their asynchronous feedback is operationalized to reflect the rhetorical conventions of specific genres and not to facilitate error correction. The development of genre-based AWE requires large-scale corpus-based research of particular genres, which is why there are still very few such tools. This is perhaps the reason why they were not explicitly noted within Hazelton et al.’s (2021) third generation.

Both assessment-driven and genre-based tools have been used by teachers as a complement to instruction and by writers as aids for independent self-paced and self-regulated writing and revision. Assessment-driven AWE has been widely implemented at all levels of formal instruction, from elementary to higher education and to non-traditional adult learning environments. Higher education has witnessed most implementations in English composition courses at undergraduate level as well as in English as a second and foreign language academic writing university courses. There is hardly a ‘prescribed’ use. Rather, teachers make decisions regarding the uses of AWE based on instructional needs and learning goals or based on their level of familiarity with the tool. Some teachers encourage students to process and respond to AWE feedback on lower-level concerns and complement that with their own feedback on more global aspects of writing. Others prefer to incentivize students’ revision by directing them to the summative, scoring-based feedback on specific writing traits. Yet others tend to disregard automated formative feedback and resort to scoring capabilities only for assessment or test preparation purposes (Stevenson, 2016).

3 Functional Specifications

AWE tools are user-facing systems powered by back-end engines used to generate feedback. For assessment-driven tools, these are scoring engines; for example, Criterion as well as Turnitin’s Revision Assistant and Draft Coach use e-rater, Write-To-Learn uses Intelligent Essay Assessor, and MY Access! uses IntelliMetric (chapter “Automated Scoring of Writing”). For genre-based tools, the engines are analytic, trained to ‘learn’ the rhetorical traits of the genre from a representative annotated corpus and then apply the ‘learned’ information to identify those traits in new texts. These analytic engines use different text classification approaches (chapter “Analytic Techniques for Automated Analysis of Writing”). For example, AntMover uses a NaïveBayes classifier, RWT uses support vector machine classifiers, and AcaWriter uses a rule-based parser. Distinct from its counterparts whose classifiers adopt models of consecutive words, AcaWriter’s parser identifies words or expressions and syntactic dependencies that may instantiate rhetorical concepts.

Given that the scoring engines are trained to detect numerous characteristics of texts, assessment-driven tools’ feedback is manifold targeting grammatical forms, syntactic complexity, lexical complexity, style, organization, topical content, idea development, redundancy, relevance, deviance, semantic coherence, mechanics, etc. The formative feedback is commonly embedded in the student’s draft. Some tools flag errors and suggest corrections, which are mostly based on how the scoring engine was trained to evaluate writing but can also draw on individual students’ error correction history (e.g., TechWriter). Summative feedback can also be offered as a performance summary containing a holistic score, a quantification of errors based on the analyzed traits of writing, and hyperlinks to detailed descriptive feedback on each error category. While most AWE tools are for writing in English, some generate multilingual feedback for second language writers (e.g., Criterion and MY Access!).

Genre-based exemplars address higher order concerns related to rhetorical effectiveness as expected by target discourse communities. Their feedback is operationalized per Swales’ (1981) theorizing of genre conventions in terms of communicative goals called ‘moves’ and functional strategies called ‘steps’. Swales’ Create-A-Research-Space (CARS) model comprising three moves (Establishing a Territory, Identifying a Niche, Addressing the Niche) and their respective steps (e.g., Claiming Centrality, Highlighting a Problem, Stating the Value, etc.) is to some extent at the core of all existing genre-based tools’ analytic engines. While different tools articulate and present their feedback in different ways, essentially writers receive feedback indicating what the sentences in their text are doing communicatively. AntMover, trained to analyze research article abstracts, displays the text split into sentences that are labeled with CARS categories. IADE’s feedback visualized the rhetorical composition of research article introductions by color-coding all the sentences in a text for moves, and its RWT successor has expanded this feature with step-level, move-level, and discipline-specific comparative feedback on all the sections of research articles—Introduction-Methods-Results-Discussion/Conclusion (IMRD/C). AcaWriter, on the other hand, gives feedback only for sentences where its rule-based parser identifies concepts indicative of moves (e.g., summarizing issues, describing an open question).

Regardless of the origin and nature of the feedback, AWE tools incorporate a vast array of additional scaffolding features for students. In the interest of brevity, I will only mention select examples here. First, automated feedback may be accompanied by interface features enabling students to solicit feedback from their instructor, who can point to more subtle and more global issues not identifiable automatically. There are also features designed to facilitate guided practice and to help foster the more germane activities of pre-writing, drafting, and revision. Criterion, for instance, contains a Make a Plan feature with a number of templates for planning strategies. MY Access! offers graphical pre-writing tools to assist students with the formulation and organization of their ideas, a word bank for appropriate vocabulary use, a checklist for scoring rubrics for self-assessment, a so-called ‘writing coach’ suggesting revision goals and remediation activities, and an ‘editor’ that supplies suggestions for editing. In addition to such features, WriteToLearn uses text-to-speech technologies so that students can hear the text and see the definitions of words in on-demand pop-up windows. MI Write and MI Tutor, the legacy of PEG, offer students graphic organizers, peer review options for giving and receiving peer feedback, and portfolios that allow them to chart their progress toward grade-level proficiency. The WritingRoadmap embeds model sentence diagrams, tutorials on grammar and syntax, a thesaurus, and tips for essay improvement. RWT provides video tutorials for all IMRD/C moves and steps, a move/step annotated multi-disciplinary corpus of published research articles, and a concordancer searchable for examples of all the steps in all the IMRD/C texts in the corpus. Being an ITS, the Writing Pal provides the most tailored scaffolding focused on writing strategies during prewriting, drafting, and revising stages of the writing process.

Apart from this variety of student-focused features, most tools integrate features for teachers. Perhaps most popular are features like chat or electronic sticky notes that bring teacher’s comments into the feedback loop for the student. Writing prompts, whether ready-made or created by teachers based on stimulus reading materials pre-packaged in the system, enable them to customize writing assignments for better alignment with learning objectives. Additionally, there are options for monitoring students’ use of available scaffolding features and for tracking student progress, as well as for generating proficiency reports for individual students and for full classes or across demographic groups.

4 Main Products

While there are a number of AWE tools that can be considered main products, this section reviews one representative assessment-driven tool and one genre-based tool. Among the former, Criterion is perhaps the most researched and widely implemented commercial product, with features similar to most such tools. Genre-based AWE is well represented by RWT. This non-commercial tool can be considered paradigmatic because it is truly genre-specific, with features most comprehensively covering the rhetorical traits characteristic of the research article genre.

4.1 Criterion

The Educational Testing Service developed Criterion, formally called The Criterion Online Writing Evaluation service, for writers of various age groups in primary, secondary, and higher education settings. The developer describes it as an instructor-led system aimed to help teachers assess student writing performance and progress, and to provide students with self-paced independent writing practice guided by immediate automated feedback. Criterion’s technical capabilities are based on two complementary applications: e-rater and Critique. The former is a scoring engine that assigns a holistic score based on statistical modelling of how linguistic and text features are related to overall writing quality; the latter contains a suite of programs that generate feedback (Burstein et al., 2003b). The feedback covers five major traits: grammar, usage, mechanics, style, and organization and development, detailing specific types of errors within each trait (see Table 1).

Table 1 Criterion’s feedback traits and error types

Full size table

It takes Criterion less than twenty seconds to assess a submitted text and generate a performance summary presenting a holistic score, the number of errors, and feedback comments corresponding to each error. Note that it does not display the errors of all types at the same time; rather, students can view the feedback selectively by clicking on one of the tabs of the Trait Feedback Analysis Menu, which opens a trait-specific feedback screen. Figure 1 is a screenshot of the feedback screen for Style (Repetition of words). A roll-over message appears when moving the cursor over a highlighted word, expression, or stretch of text, presenting formative feedback on the identified type of error; e.g.:

A screenshot of the feedback screen for Style tab in Criterion. The right panel is labeled repetition of words. It highlights a word repeated 10 times in the essay. At the left there are 6 summary of style comments, and below it details like number of words, sentences are given. — **Fig. 1**

Grammar—Fragment or missing comma: This sentence may be a fragment or may have incorrect punctuation. Proofread the sentence to be sure that it has correct punctuation and that it has an independent clause with a complete subject and predicate.
Usage—Missing comma: You may need to place a comma after this word.
Style—Passive voice: You have used the passive voice in this sentence. Depending upon what you wish to emphasize in the sentence, you may wish to revise it using the active voice.

Another form of feedback is provided along with the holistic score, summarizing the trait feedback analysis to reflect the overall quality of the text and the number of errors (per trait and per error type). To help students understand the meaning of their score, Criterion makes available a score guide with descriptions for basic, proficient, and advanced levels. According to the First Year 6pt Scale—Criterion Scoring Guide (n.d.), an author whose essay scores 2 out of 6, for instance, would receive feedback specifying the following weaknesses of the essay:

You have work to do to improve your writing skills. You probably have not addressed the topic or communicated your ideas effectively. Your writing may be difficult to understand. In one or more of the following areas, your essay:

Misunderstands the topic or neglects important parts of the task

Does not coherently focus or communicate your ideas

Is organized very weakly or doesn’t develop ideas enough

Generalizes and does not provide examples or support to make your points clear

Uses sentences and vocabulary without control, which sometimes confuses rather than clarifies your meaning.

Criterion’s feedback, multiple revision, and unlimited resubmission features are meant to support revising and editing. Like other AWE tools, Criterion has additional features for students planning and writing, offering planning templates editable while completing the writing assignment, a catalogue of well-written essays, and a thesaurus. Its online Writer’s Handbook can be tailored to different levels of English language proficiency, to a certain first language (Spanish, Simplified Chinese, Japanese, Korean), and to elementary, middle school, high school, or college educational levels. Students’ communication and access are supported by features that facilitate dialogue and development of online portfolios. Teachers, in turn, can enable available pre-writing features, designate a particular planning template, adjust assignments to target specific abilities, and select resources appropriate for the development of those abilities. They can also operate with a library of more than 400 essay topics at various skill levels and pertaining to different kinds of essays (narrative, expository, persuasive). When designing a writing assignment, teachers can select options most suitable for the writing task (e.g., time allocated, number of allowed submissions of revised text). Importantly, they can set the type of automated feedback to be displayed and can also comment on their students’ work through different modalities. For a description of how teachers and students can engage with this tool procedurally, see Lim and Kahng (2012).

4.2 Research Writing Tutor (RWT)

RWT was developed for advanced academic writers needing to learn how to produce publishable quality research articles responsive to the expectations of their socio-disciplinary discourse communities (Cotos, 2014). This tool comprises three standalone yet interconnected modules. ‘Understand Writing Goals’ is a learning module, which contains multimodal content explaining the communicative purposes of the moves and the functions of the steps (see the IMRD/C move-step framework in Cotos et al., 2015), as well as the patterns of language use characteristic of those rhetorical traits. ‘Explore Published Writing’ serves as a demonstration module with IMRD/C Section Structure, Move/Step Examples, and original Research Articles components, which expose students to different forms of a move/step annotated corpus of 960 published articles representative of authentic discourse in 32 disciplines. ‘Analyze my writing’ is the AWE feedback module providing different forms of individualized automated feedback designed for scaffolded revision.

A notable strength of this tool is its integrative theoretical grounding in socio-disciplinary and cognitive dimensions of scientific writing that are important for the development of genre knowledge and research writing competence. From a socio-disciplinary standpoint, the features in the feedback module are designed to render the rhetorical composition of research articles (informed by Swalesian genre theory) and the language choices that instantiate functional meaning (informed by systemic functional linguistics). From a cognitive standpoint, it operationalizes tenets from writing, language learning, and skill acquisition theories. With this grounding, RWT’s features depicted in Fig. 2 are designed to create the learning affordances summarized in Table 2. In ensemble, its features and affordances create conditions for scaffolded writing practice, during which students are able to detect and address discourse-level shortcomings in their drafts, whether related to rhetorical structure, intended mental representation of ideas, or language choices needed to convey specific functional meanings (Cotos, 2017; Cotos et al., 2017, 2020).

A screenshot of the feedback module of R W T. It has 3 sections. Section 1. 4 options of editing introduction, add methods, add results, and add discussions are at the top, and a textbox below it has an article. An analyze button is below the textbox, for iterative revision and submission. — **Fig. 2**

Table 2 The features and affordances of the ‘Analyze My Writing’ feedback module of RWT

Full size table

RWT is used in various contexts, including credit-bearing writing courses employing data-driven learning pedagogy, hands-on workshops, peer review group activities, individual tutoring with writing consultants, and independent revision. The feedback and scaffolding features provide writers with exposure to authentic disciplinary discourse, directions for how to discern the writing norms of their discourse community, guided writing practice, and productive interaction.

5 Research

Over the last decade, the fields of AES/AEE and AWE have emerged as distinct areas of scholarship. Both these areas still adjoin under the validity argument framework (Kane, 1992), which consists of a chain of inferences that guide research. While describing the framework is beyond the scope of this chapter, highlighting it as an increasingly prolific heuristic adopted in AWE studies is necessary. It has enabled researchers to consolidate various types of empirically supported claims into a systematic progression of inferences about the effectiveness of AWE tools, thus strengthening the defensibility of decisions regarding their uses. For Criterion and RWT reviewed above, claims systematized under this framework can be found in Chapelle et al. (2015) and in Cotos (forthcoming), respectively. Unlike more recent studies, earlier works, many of which were reviewed in meta-analyses (Graham et al., 2015; Nunes et al., 2022; Stevenson & Phakiti, 2014), are not are explicitly positioned within the validity argument framework but still address different inferences. Table 3 synthesizes the findings from example studies to show that there is substantial positive evidence for the successful application of AWE across these key areas.

Table 3 AWE validity argument inferences and claims

Full size table

As with other educational technologies, some studies unveil rebuttal evidence, or issues that weaken the strength of the claims one would like to make about AWE. For instance, Extrapolation cannot be confidently claimed because AWE feedback may not always be as good as teacher or peer feedback (Dikli & Bleyle, 2014). Impact may be affected because assessment-driven AWE feedback tends to promote surface-level revisions, may have no or low uptake on some writing traits, and can inhibit revising of propositional content (Li et al., 2015; Ranalli, 2021; Ware, 2014).

Such variability in outcomes is not surprising because it depends not so much on the tools themselves but on how they are implemented. Moreover, the research methods adopted stem from different disciplinary paradigms. Mixed methods have gained ground, but there is a clear need for longitudinal studies examining the effects of AWE feedback over an extended period of time. Variability in findings is also due to differing assumptions about what constitutes effectiveness (e.g., engagement, motivation, affect, writing improvement, skill development in first and second language) and how it is measured. Measures like error frequency and error reduction, for instance, are confined to impact on revised texts and do not extrapolate well to new compositions. In future research, revision quantity should be reported along with large-scale analyses of specific qualitative changes in writing performance. Not to overemphasize writing products, they should be examined vis-à-vis the process of writing with AWE feedback, and interaction behaviours should be scrutinized to reveal the metacognitive processes activated by writers along with the strategies they develop when drafting and revising.

6 Implications of This Technology for Writing Theory and Practice

Considering the snapshots of the research and the representative tools above, it can be argued that AWE technology appears to have reached significant milestones in its specific goals to address the challenges inherent in writing development and the teaching of writing. However, this does not mean that AWE has arrived at a standard solution. First, assessment-driven and genre-based strands have been developing in parallel. In the future, it is likely that a fourth generation of AWE will emerge drawing on the features and affordances of both assessment-driven and genre-based tools. The AWE evolution will also leverage the capabilities of ITSs with animated agents (as those of the Writing Pal, chapter “The Future of Intelligent Tutoring Systems for Writing”) and biometric technology (chapter “Investigating Writing Processes with Keystroke Logging”) to personify the feedback and generate interactive, strategic, and data-driven feedback fit for particular stages of the writing process.

To materialize these envisioned directions, it is of utmost importance for research to enrich existing writing theories. One possible scenario falls under the framework of cognitive writing models, where theoretical understanding could be deepened in terms of whether and how cognitive writing modelling applies to the revision process when assisted by AWE tools. Empirical investigations of the effects of cognitive mechanisms activated during AWE-assisted revision will have direct implications for writing theory, as empirical results will lead to devising an enhanced cognitive model of writing that would incorporate the role of technology as the digital environment.

This, in turn, will have ramifications for the next-generation of AWE, as it will enable developers to efficiently map metacognitive participatory engagement and to design an AWE-assisted writing conceptual ‘corridor’ linkable to different realizations of cognitive activities. In other words, when developing writers appear to drift away from critical cognitive and metacognitive paths, advanced artificial intelligence-enabled features might steer them through successful AWE-interaction trajectories with feedback and scaffolding that would facilitate the activation of appropriate aspects of metacognition at appropriate stages of drafting and revision (see Banawan et al., chapter “The Future of Intelligent Tutoring Systems for Writing”).

Furthermore, research conducted in different instructional settings with different learner characteristics and targeting different genres will address the relationship between the cognitive processes activated during AWE-facilitated writing and the instructional practices brought into play by teachers. This intersection with practice will yield potentially generalizable insights informing principles for creating optimal digital conditions for AWE-supported writing skill development and implementation guidelines for effective broader use and integration. Those principles and guidelines would be developed to support possible variations in enactment and to allow practitioners to create AWE-facilitated instructional ecosystems that would be appropriate for different types of learners, contexts, and writing tasks.

Before this (and other) theory-research-practice concatenation scenarios become reality, teachers are encouraged to begin developing what Argyris (1997) terms theory-in-use models for educational effectiveness of an innovation. Hazelton et al. (2021) demonstrate two theory of action models based on instructors’ standpoints for using an AWE tool (Writing Mentor) with non-traditional adult learners and with two-year college students. Their models account for the features of the tool (as instances of digital-technology mediation of the writing construct), demonstrated and hypothesized pedagogical actions (as defined teaching objectives), and intended and unintended consequences (positive and negative, intermediate and long-term effects). With all these model components maintaining a constant focus on learners, Hazelton et al. (2021) argue that the pedagogical future for formative AWE “may best be charted by standpoint theory of action” (p. 81).

7 AWE Tools

See Table 4.

Table 4 Select AWE tools

Full size table

References

Anthony, L., & Lashkia, G. (2003). Mover: A machine learning tool to assist in the reading and writing of technical papers. IEEE Transactions on Professional Communication, 46(3), 185–193.
Article Google Scholar
Argyris, C. (1997). Learning and teaching: A theory of action perspective. Journal of Management Education, 21(1), 9–26.
Google Scholar
Burstein, J., Beigman Klebanov, B., Elliot, N., & Molloy, H. (2016a). A left turn: Automated feedback and activity generation for student writers. Paper presentation. Proceedings of the 3rd Language Teaching, Language & Technology Workshop, co-located with Interspeech 2016, San Francisco, CA. https://doi.org/10.21437/LTLT.2016-2
Burstein, J., Elliot, N., & Molloy, H. (2016b). Informing automated writing evaluation using the lens of genre: Two studies. CALICO Journal, 33(1), 117–141.
Article Google Scholar
Burstein, J., Marcu, D., & Knight, K. (2003a). Finding the WRITE stuff: Automatic identification of discourse structure in essays. IEEE Intelligent Systems, 18(1), 32–39.
Article Google Scholar
Burstein, J., Chodorow, M., & Leacock, C. (2003b, August 12–14). Criterion^SM online essay evaluation: An application for automated evaluation of student essays. In J. Riedl & R. W. Hill Jr. (Eds.), Proceedings of the Fifteenth Conference on Innovative Applications of Artificial Intelligence (IAAI 2003) (pp. 3–10). Retrieved on July 13, 2019, from https://www.aaai.org/Papers/IAAI/2003/IAAI03-001.pdf
Chapelle, C. A., Cotos, E., & Lee, J. (2015). Validity arguments for diagnostic assessment using automated writing evaluation. Language Testing, 32(3), 385–405.
Article Google Scholar
Chodorow, M., Gamon, M., & Tetreault, J. (2010). The utility of article and preposition error correction systems for English language learners: Feedback and assessment. Language Testing, 27(3), 419–436.
Article Google Scholar
Cotos, E. (2009). Designing an intelligent discourse evaluation tool: Theoretical, empirical, and technological considerations. In C. A. Chapelle, H.-S. Jun, & I. Katz (Eds.), Developing and evaluating language learning materials (pp. 103–127). Iowa State University.
Google Scholar
Cotos, E. (2014). From prototyping to principled practical realization. In E. Cotos, Genre-based automated writing evaluation for L2 research writing: From design to evaluation and enhancement. Palgrave Macmillan.
Google Scholar
Cotos, E. (2017). Computer-assisted research writing in the disciplines. In S. A. Crossley & D. S. McNamara (Eds.), Adaptive educational technologies for literacy instruction (pp. 225–242). Taylor & Francis, Routledge.
Google Scholar
Cotos, E. (2022). Genre-based automated writing evaluation. In H. Mohebbi & C. Coombe (Eds.), Research questions in language education: A reference guide for teachers (pp. 645–650). Springer.
Google Scholar
Cotos, E. (forthcoming). Towards a validity argument for genre-based AWE. In J. Xu & G. Yu (Eds.), Language test validation in a digital age. Cambridge Assessment English and Cambridge University Press.
Google Scholar
Cotos, E., Huffman, S., & Link, S. (2015). Furthering and applying move/step constructs: Technology-driven marshalling of Swalesian genre theory for EAP pedagogy. Journal of English for Academic Purposes, 19, 52–72.
Article Google Scholar
Cotos, E., Huffman, S., & Link, S. (2020). Understanding graduate writers’ interaction with and impact of the Research Writing Tutor during revision. Journal of Writing Research, 12(1), 187–232.
Article Google Scholar
Cotos, E., Link, S., & Huffman, S. (2017). Effects of DDL technology on genre learning. Language Learning & Technology, 21(3), 104–130.
Google Scholar
Dikli, S., & Bleyle, S. (2014). Automated essay scoring feedback for second language writers: How does it compare to instructor feedback? Assessing Writing, 22, 1–17.
Article Google Scholar
Dizon, G., & Gayed, J. M. (2021). Examining the impact of Grammarly on the quality of mobile L2 writing. JALT CALL Journal, 17(2), 74–92.
Article Google Scholar
Educational Testing Service (ETS). (2007). The Criterion^SM teaching guide. Using the Criterion^SM online writing evaluation service for differentiated instruction in the college classroom: A guide for faculty and administrators. Retrieved on November 3, 2021, from https://www.ets.org/Media/Resources_For/Higher_Education/pdf/Criterion_Teacher_Guide_web_6487.pdf
Feng, H.-H., Saricaoglu, A., & Chukharev-Hudilainen, E. (2016). Automated error detection for developing grammar proficiency of ESL learners. CALICO Journal, 33(1), 49–70.
Article Google Scholar
First year 6pt scale—Criterion scoring guide. (n.d.). Retrieved on October 29, 2022, from https://criterion.ets.org/Content/topics/co-1s.htm
Graham, S., Hebert, M., & Harris, K. R. (2015). Formative assessment and writing: A meta-analysis. The Elementary School Journal, 115(4), 523–547.
Article Google Scholar
Grimes, D., & Warschauer, M. (2010). Utility in a fallible tool: A multi-site case study of automated writing evaluation. Journal of Technology, Learning, and Assessment, 8(6), 4–44.
Google Scholar
Hazelton, L., Nastal, J., Elliot, N., Burstein, J., & McCaffrey, D. (2021). Formative automated writing evaluation: A standpoint theory of action. Journal of Response to Writing, 7(1), 37–91.
Google Scholar
Kane, M. T. (1992). An argument-based approach to validity. Psychological Bulletin, 112, 527–535.
Article Google Scholar
Knight, S., Shibani, A., Abel, S., Gibson, A., Ryan, P., Sutton, N., Wight, R., Lucas, C., Sándor, Á., Kitto, K., Liu, M., Vijay Mogarkar, R., & Buckingham Shum, S. (2020). AcaWriter: A learning analytics tool for formative feedback on academic writing. Journal of Writing Research, 12(1), 141–186. https://doi.org/10.17239/jowr-2020.12.01.06
Koltovskaia, S. (2020). Student engagement with automated written corrective feedback (AWCF) provided by Grammarly: A multiple case study. Assessing Writing, 100450. https://doi.org/10.1016/j.asw.2020.100450
Lavolette, E., Polio, C., & Kahng, J. (2014). The accuracy of computer-assisted feedback and students’ responses to it. Language Learning & Technology, 19(2), 50–68.
Google Scholar
Li, J., Link, S., & Hegelheimer, V. (2015). Rethinking the role of automated writing evaluation (AWE) feedback in ESL writing instruction. Journal of Second Language Writing, 27, 1–18.
Article Google Scholar
Lim, H., & Kahng, J. (2012). Review of Criterion®. Language Learning & Technology, 16(2), 38–45.
Google Scholar
Link, S., Mehrzad, M., & Rahimi, M. (2020). Impact of automated writing evaluation on teacher feedback, student revision, and writing improvement. Computer Assisted Language Learning, 35(4), 605–634.
Article Google Scholar
Liu, S., & Kunnan, A. J. (2015). Investigating the application of automated writing evaluation to Chinese undergraduate English majors: A case study of WriteToLearn. CALICO Journal, 33(1), 71–91.
Article Google Scholar
Ma, H., & Slater, T. (2016). Connecting Criterion scores and classroom grading contexts: A systemic functional linguistic model for teaching and assessing causal language. CALICO Journal, 33(1), 1–18.
Article Google Scholar
McCarthy, K. S., Roscoe, R. D., Allen, L. K., Likens, A. D., & McNamara, D. S. (2022). Automated writing evaluation: Does spelling and grammar feedback support high-quality writing and revision? Assessing Writing, 52, 100608.
Google Scholar
McNamara, D. S., Crossley, S. A., Roscoe, R. D., Allen, L. K., & Dai, J. (2015). Hierarchical classification approach to automated essay scoring. Assessing Writing, 23, 35–59.
Article Google Scholar
Napolitano, D. M., & Stent, A. (2009). TechWriter: An evolving system for writing assistance for advanced learners of English. CALICO Journal, 26(3), 611–625.
Article Google Scholar
Na, Z., & Ma, X. (2021). Automated writing evaluation (AWE) feedback: A systematic investigation of college students’ acceptance. Computer Assisted Language Learning, 1–26. https://doi.org/10.1080/09588221.2021.1897019
Nunes, A., Cordeiro, C., Limpo, T., & Castro, S. L. (2022). Effectiveness of automated writing evaluation systems in school settings: A systematic review of studies from 2000 to 2020. Journal of Computer Assisted Learning, 38(2), 599–620.
Article Google Scholar
Ranalli, J. (2021). L2 student engagement with automated feedback on writing: Potential for learning and issues of trust. Journal of Second Language Writing, 52, 100816.
Article Google Scholar
Ranalli, J., Link, S., & Chukharev-Khudilaynen, E. (2017). Automated writing evaluation for formative assessment: Investigating accuracy and efficiency as part of argument-based validation. Educational Psychology, 37(1), 8–25.
Article Google Scholar
Ranalli, J., & Yamashita, T. (2022). Automated written corrective feedback: Error-correction performance and timing of delivery. Language Learning & Technology, 26(1), 1–25.
Google Scholar
Roscoe, R. D., & McNamara, D. S. (2013). Writing Pal: Feasibility of an intelligent writing strategy tutor in the high school classroom. Journal of Educational Psychology, 105, 1010–1025.
Article Google Scholar
Shermis, M. D., & Burstein, J. C. (2003). Automated essay scoring: A cross-disciplinary perspective. Lawrence Erlbaum Associates.
Book Google Scholar
Stevenson, M. (2016). A critical interpretative synthesis: The integration of Automated Writing Evaluation into classroom writing instruction. Computers and Composition, 42, 1–16. https://doi.org/10.1016/j.compcom.2016.05.001
Article Google Scholar
Stevenson, M., & Phakiti, A. (2014). The effects of computer-generated feedback on the quality of writing. Assessing Writing, 19, 51–65.
Article Google Scholar
Swales, J. M. (1981). Aspects of articles introductions. The University of Aston.
Google Scholar
Xi, X. (2010). Automated scoring and feedback systems: Where are we and where are we heading? Language Testing, 27(3), 291–300.
Article Google Scholar
Ware, P. (2014). Feedback for adolescent writers in the English classroom: Exploring pen-and-paper, electronic, and automated options. Writing & Pedagogy, 6(2), 223–249.
Article Google Scholar
Wilson, J. (2017). Associated effects of automated essay evaluation software on growth in writing quality for students with and without disabilities. Reading and Writing: An Interdisciplinary Journal, 30(4), 691–718.
Article Google Scholar
Wilson, J., Ahrendt, C., Fudge, E. A., Raiche, A., Beard, G., & MacArthur, C. (2021). Elementary teachers’ perceptions of automated feedback and automated scoring: Transforming the teaching and learning of writing using automated writing evaluation. Computers & Education, 168, 104208.
Article Google Scholar
Wilson, J., & Roscoe, D. (2020). Automated writing evaluation and feedback: Multiple metrics of efficacy. Journal of Educational Computing Research, 58(1), 87–125.
Article Google Scholar
Zhang, Z. (2020). Engaging with automated writing evaluation (AWE) feedback on L2 writing: Student perceptions and revisions. Assessing Writing, 43, 100439.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of English, Iowa State University, 317 Ross Hall, 527 Farmhouse Ln, Ames, IA, USA
Elena Cotos

Authors

Elena Cotos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elena Cotos .

Editor information

Editors and Affiliations

School of Applied Linguistics, Zurich University of Applied Sciences, Winterthur, Switzerland
Otto Kruse
School of Management and Law, Center for Innovative Teaching and Learning, Zurich University of Applied Sciences, Winterthur, Switzerland
Christian Rapp
North Carolina State University, Raleigh, NC, USA
Chris M. Anson
TECFA, Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland
Kalliopi Benetos
English Department, Iowa State University, Ames, IA, USA
Elena Cotos
School of Education, Trinity College Dublin, Dublin, Ireland
Ann Devitt
TD School, University of Technology Sydney, Sydney, NSW, Australia
Antonette Shibani

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Cotos, E. (2023). Automated Feedback on Writing. In: Kruse, O., et al. Digital Writing Technologies in Higher Education . Springer, Cham. https://doi.org/10.1007/978-3-031-36033-6_22

Download citation

DOI: https://doi.org/10.1007/978-3-031-36033-6_22
Published: 15 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36032-9
Online ISBN: 978-3-031-36033-6
eBook Packages: EducationEducation (R0)

Publish with us

Policies and ethics

Automated Feedback on Writing

Abstract

1 Overview

2 Core Idea of the Technology

3 Functional Specifications

4 Main Products

4.1 Criterion

4.2 Research Writing Tutor (RWT)

5 Research

6 Implications of This Technology for Writing Theory and Practice

7 AWE Tools

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation