Key message

  • The fall armyworm is a maize pest in the Americas, Africa and Asia.

  • Several methods are inconsistently used to assess the leaf damage it causes.

  • We reviewed the literature and tested different leaf damage scales.

  • Proposed scales in order of increasing detail and workload are:

    • Yes-No damage scale.

    • Simple 1–5 whole plant damage scale.

    • Davis’ 0–9 whorl & furl damage scale

    • Williams’ 0–9 whole plant damage scale.

    • Toepfer & Fallet 0.0–4.0 leaf damage index.

  • Regressions allow comparisons among damage scales.

Introduction

Damage scales are widely used to rate the impact of pests in agriculture, as well as to assess the efficacy of pest management measures. Unfortunately, it is often not easy to precisely quantify damage, for example a leaf area chewed by an insect or rotted by bacteria, or the decline in quality of produce, such as an apple with some, or many leaf spots of different sizes. Therefore, researchers and practitioners often apply scales of different types to rate damage (Velleman and Wilkinson 1993). However, this approach often leads to ordinal data which are difficult to analyse with parametric statistics (Stevens 1951). Other problems are unsure links between frequency and severity information (Blong 2003), nonlinear scale-damage relationships, and sometimes not accounting for the growth stage of a crop. This is particularly true for leaf feeders. For example, caterpillars of many lepidopteran pests of maize can attack leaves at different crop stages, some may also bore into tassels or cobs or stems, and others may destroy the vegetation growth point of the plant. Consequently, damage ratings become complicated and are often inconsistent (e.g. Ampofo 1986; Reddy et al. 2011; Eichenseer et al. 2008).

The fall armyworm (Spodoptera frugiperda, Lepidoptera: Noctuidae) is just such an intensively studied pest of maize, especially since its recent invasion of Africa and Asia from its origins in the Americas (Goergen et al. 2016; Ward and Kim 2019). The caterpillars of this Spodoptera species are considerably more voracious than many other noctuid maize pests (Day et al. 2017). Each of its six larval instars feeds extensively on young maize leaves often destroying the vegetation growth point of the plant. The caterpillars may also feed on tassels, silks and young maize cobs.

Since its detection in Africa in 2016 (Goergen et al. 2016), then a few years later in India (Ganiger et al. 2018) and finally in China (Ward and Kim 2019), it has become a major target of research (Li et al. 2019). Following its wide-spread invasion, about 190 papers have been published between 2016 and 2019, with regard to fall armyworm in Africa, and more than 350 papers in China in 2019 alone (Li et al. 2019). In total, more than 5000 articles have been published on fall armyworm between 1910 and 2019 (Li et al. 2019), covering all aspects ranging from diagnostics via life history, invasion history and ecology, population genetics, to pest management and socio-economic impacts.

Some of these studies use the feeding damage caused by the caterpillars to assess, for instance, the spread of the species, the efficacy of pest management measures, or its economic impact. Growers and agricultural extension workers use ratings of damage symptoms as a way to monitor pest densities and then to take pest management decisions. However, there is a lack of consistency among the different proxies used to infer feeding damage caused by fall armyworm, leading to difficulties in making comparisons. For instance, some researchers only assess whether plants are damaged or not, leading to nominal data (Aguirre et al. 2019; FAO and CABI 2019). Others apply ordinal damage ratings (= ranks) from no damage up to heavy damage, or completely destroyed. The most used (but also misused) scale to estimate fall armyworm damage on maize plants is the so-called “Davis’ 0 to 9 whorl & furl damage scale” (Davis et al. 1992). Assessment of the damage severity (i.e. intensity) and frequency (i.e. incidence) using this scale is only performed on the whorl & furl area of a plant, as this is where fall armyworm caterpillars mostly feed. In other cases, researchers assess the proportion and severity of damage of all leaves of a plant (Williams et al. 1989; Sisay et al. 2019a, b; Chinwada 2018). Again others prefer to assess cob damage instead of leaf damage, as this directly relates to yield loss (Prasanna et al. 2018; CIMMYT pers. comm.). Unfortunately, for many of these scales, the descriptive part of each score can be interpreted differently by different users, potentially leading to observer-biases during the rating process (Tversky and Kahneman 1974). Therefore, we would like to initiate a discussion on how to assess leaf feeding damage of the fall armyworm caterpillars in a less problematic and more comparable manner.

In this paper, we aim at determining which leaf damage rating scales for fall armyworm are most consistently used for which purposes, might provide most possibilities for statistical analyses, and would be an acceptable compromise between detail and workload. We first conducted a literature review and then validated the most common scales under field and laboratory conditions. This also allowed the establishment of relationships among scales, as well as to pest population densities. Taking the lessons learnt into account, we subsequently propose simplifications of some of the scales as well as a novel 0.0 to 4.0 fall armyworm leaf damage index which improves precision and possibilities for parametric data analyses. Our findings are intended to help researchers to more consistently use the damage scale that is best for their purpose, therefore allowing better compatibility and comparability among studies in the future.

Material and methods

Reviewing characteristics of damage rating scales

A literature review was conducted to assess which scales are most often and most consistently used to asses leaf damage of fall armyworm. We also tried to detect misinterpretations or incorrect uses of published scales in other studies, as well as advantages and disadvantages of the most-used scales (see criteria below).

We screened about 5000+ articles on fall armyworm for assessment methods of damage caused by this caterpillar to maize (Search term “fall armyworm” Or “Spodoptera frugiperda” in article title, 1910 to 2019, CAB Abstracts, Web of Knowledge; Scopus) (Li et al. 2019). More than 500 papers contained damage information (Search term “fall armyworm” Or “Spodoptera frugiperda” in title AND “damage” in abstract). The following papers contained detailed descriptions of damage scales: Wiseman et al. (1966); Williams et al. (1989); Ghidiu and Drake (1989); Davis et al. (1992); Ayala et al. (2013); Zibanda et al. (2017); Chinwada (2018); Prasanna et al. (2018); Cruz and Turpin (1983); Figueiredo et al. (2006); dal Pogetto et al. (2012); Grijalba et al. (2018); Fotso Kuate et al. (2019); dos Santos et al. (2020). Of these, only the leaf damage rating scales were reviewed, and not the methods that were used to assess maize cob damage or caterpillar numbers.

Then, the most common scales, including a newly proposed one (see result section), were evaluated with regard to the following characteristics adapted from Bong (2003):

  • Clarity: Is the scale easily understandable and not sensitive to differences in interpretation by the user?

  • Simplicity / distinctiveness: Do the scale intervals describe classes of damage that can be easily distinguished?

  • Practicability: Is the scale accurate enough at an acceptable level of workload, without the need for additional tools?

  • Validity / trustworthiness: Are the scale and supporting data appropriately chosen with regard to damage patterns and behaviour of the pest?

  • Resolution: Is the scale fine enough to allow meaningful interpretation of data?

  • Data quality: Are the data sufficiently quantitative and can the data be used in parametric statistical inference ?

  • Robustness: Do minor differences in plant damage not result in large differences in scale categories ?

  • Reliability/consistency: Does use of the scale consistently produce the same result?

  • Spatial/temporal suitability: Is the scale suitable at a range of spatial and temporal scales, i.e. for young and older plants, for single plant trials as well as field scale trials?

  • Applicability: Is and can the scale be used internationally, nationally, locally; under different cropping systems, as well as in the field, semi-field, greenhouse, and laboratory?

  • Intuitively comprehensible result: Does the final value on the scale have immediate recognition for users?

  • Decision utility: Does the analysis provide a clear basis for action, i.e. are pest populations, damage and yield sufficiently correlated?

In addition, the scales analysed here (Tables 1, 2, 3, 4) were presented to and reviewed by about 100 maize experts from 21 countries at the 27th IOBC IWGO conference in 2019 (Toepfer et al. 2019).

Table 1 Simple 1 to 5 damage scale for the fall armyworm. Usually used as a whole plant assessment, but adaptable to a whorl & furl assessment
Table 2 Original and simplified Davis’ 0 to 9 whorl & furl damage scale for the fall armyworm as a quick top-view visual assessment of a maize plant
Table 3 Original and simplified Williams’ 0 to 9 whole plant leaf damage scale for the fall armyworm
Table 4 Novel 0.0 to 4.0 fall armyworm leaf damage index

Field and laboratory validation

Five leaf damage scales were validated (i) under field conditions in five maize fields in Huye and Nyamagabe districts in southern Rwanda (two in November 2019, three in February 2020), as well as under standardised laboratory conditions with potted and caged plants in Switzerland in January 2020 (LD 12:12; 24 °C). Laboratory experimentation was added, as researchers not only face problems with assessing caterpillar damage in the field, but also in the laboratory, particularly when maize may grow slimmer and less strong than under field conditions.

Artificial infestation with three third instar caterpillars per plant was used in the two fields of 2019 based on 60 potted maize plants placed into each field, and in the laboratory in 2020 using two potted maize plants per each of 15 sleeve cages. Natural infestation was used in farmer fields in 2020. Rwandan maize grain hybrids were used in all experiments.

Each scale was used to assess the damage on 60 to 80 individual plants per experiment, and this at three intervals of five days in 2019, as well as once in January and once in February 2020. The number of caterpillars per plant were also recorded when making each damage assessment. The assessments covered the maize growth stages from 4 to 14 leaves. The assessments were individually done by three researchers and two technical assistants, regularly switching the person`s responsibility for one of the scales.

Regressions were applied to estimate how scores of the damage scales corresponded to each other, as well as to the pest`s population density of caterpillars (Kinnear and Gray 2000) (Figs. 1, 2, 3).

Fig. 1
figure 1

Relations between the numbers of fall armyworm caterpillars on maize plants and damage, assessed through different leaf damage scales. Data from three naturally infested fields of vegetative-stage maize in southern Rwanda. Linear regression models plotted with adjusted R2 and standard errors (SE) of estimates. Univariate GLMs’ p values shown

Fig. 2
figure 2

Relations between scales of leaf damage caused by fall armyworm caterpillars. Data from fields with natural fall armyworm infestations of vegetative-stage maize in southern Rwanda (two locations in 2019, three locations in 2020), and from artificially infested maize plants in two laboratory experiments. Linear regression models plotted with adjusted R2 and standard errors (SE) of estimates. Univariate GLMs’ p values shown

Fig. 3
figure 3

Relations between percent damage in experimental plots and the medians of leaf damage scales from the same plots. Data from natural fall armyworm infestations of vegetative-stage maize in southern Rwanda (two locations in 2019, three locations in 2020), and from artificially infested maize plants in two laboratory experiments. Linear regression models plotted with adjusted R2 and standard errors (SE) of estimates. Univariate GLMs’ p values shown

Moreover, the time needed for assessment of each plant was recorded. Univariate GLM was used to analyse the influence of damage level, i.e. the damage score values and plant size on the time needed to carry out an assessment.

Results

Characteristics of fall armyworm damage rating scales

Our review revealed that four scales are most widely applied to visually assess leaf damage of fall armyworm caterpillars on maize plants. They can be ranked in the following order of increasing detail and workload: (1) the nominal “yes-no damage scale” (Gómez et al. 2013; De La Rosa-Cancino et al. 2016; Zibanda et al. 2017; Midega et al. 2018; Aguirre et al. 2019; FAO and CABI 2019; Jaramillo-Barrios et al. 2019; Maruthadurai and Ramesh 2020) (Table 1), the ordinal scales (2) “Simple 1 to 5 whole plant damage scale”(Cruz and Turpin 1983; Figueiredo et al. 2006; dal Pogetto et al. 2012; Grijalba et al. 2018; Fotso Kuate et al. 2019; dos Santos et al. 2020), (3) “Davis’ 0 to 9 whorl & furl damage scale” (Davis et al.1992) (Table 2), and (4) “Williams’ whole plant 0 to 9 leaf damage scale for fall armyworm” (Williams et al. 1989) (Table 3). As the latter two originally only consider assessments after artificial infestation with neonates, and as both scales had been widely adapted and occasionally used incorrectly, we propose simplified scales for both (see below and Tables 2, 3).

The existing scales all provide nominal or ordinal (rank) data types, but are not always compatible and sufficiently informative. They are also difficult to be analysed with parametric statistics. We therefore created a novel 0.0 to 4.0 leaf damage index allowing finer, more linear, and more accurate assessments (see below and Table 4).

The “yes–no damage scale” records whether a maize plant is damaged by the fall armyworm, independent of damage severity (Gómez et al. 2013; De La Rosa-Cancino et al. 2016; Zibanda et al. 2017; Midega et al. 2018; Aguirre et al. 2019; FAO and CABI 2019; Jaramillo-Barrios et al. 2019; Maruthadurai and Ramesh 2020). It is a rough estimate of fall armyworm presence in a certain area and widely used for pest monitoring and decision-making. The advantage of the scale is that (a) it is less labour and time intensive than other scales, (b) it can be applied to all maize growth stages, including tasselling or ripening, and (c) it reflects the fall armyworm population density relatively well because usually only one caterpillar is found per plant (except for neonates that can be numerous) (Fig. 1). A disadvantage is that it does not provide information on the severity of damage. Moreover, its nominal data type is statistically problematic at low sample sizes (Blong 2003). Therefore, depending on the variation in infestation in the experimental area, 20 but often up to 50 plants are usually examined to allow the calculation of reliable percent damage (Dent and Walton 1998). Consequently, frequency analyses are applied. Moreover, as long as there are not too many 0 and 100% values, percent damage can be considered quasi-interval data type, therefore allowing parametric statistical analyses. Otherwise, binomial GLMs and logistic regression will have to be used.

The “simple 1 to 5 whole plant damage scale for fall armyworm” (Table 1) is the most used among the 5-category scales (1 to 5 or 0 to 4 scales; Cruz and Turpin 1983; Figueiredo et al. 2006; dal Pogetto et al. 2012; Grijalba et al. 2018; Fotso Kuate et al. 2019; dos Santos et al. 2020). It allows a rough, quick assessment of frequency and severity of leaf damage. It can be used for pest monitoring and decision-making in armyworm management. The scale usually only considers the plant as a whole and not each leaf separately. However, it can also be used to study short-term treatment effects by only assessing the whorl and furl area of a plant. The scale is of the ordinal data type, resulting in a damage ranking. The advantages of this scale are that (a) it is less labour and time intensive than most other scales, and (b) it can be applied to all maize growth stages, including tasselling or ripening (Table 5). The disadvantages are that (a) fine differences between damage levels cannot be distinguished, and (b) human bias may influence the results due to different judgements on what little, medium or heavy mean with regard to damage. Median and percentiles are calculated rather than arithmetic means. However, its ordinal data type may prevent applying certain parametric statistical inference methods (Blong 2003).

Table 5 Suggested comparison of leaf damage rating scales used for assessing damage by fall armyworm caterpillars

The “Davis’ 0 to 9 whorl & furl damage scale for fall armyworm” (Table 2) (Davis et al. 1992) is of the ordinal data type, and ranks damage in combination with frequency and severity information. It is historically and currently the most used leaf damage scale for the fall armyworm (Wiseman et al. 1996; Davis et al. 1996; Williams et al. 1999; Lynch et al. 1999a, 1999b; Rea et al. 2000, 2002; Buntin et al. 2001, 2004, Buntin 2008; Michelotto et al. 2017; Lourenço et al. 2017; Sisay et al. 2019a; Vassallo et al. 2019; Nboyine et al. 2020; Teixeira Silva et al. 2020). It has been particularly used to assess resistance levels of maize hybrids to fall armyworm feeding. Originally, two such scales were developed (Davis et al. 1992), one for a 7-day and one for a 14-day assessment after artificial infestation with neonates (Table 2). The 7-day assessment has been more frequently used, often for any instar, whereas it was originally developed to assess damage caused by one-week old caterpillars only. Here, we combined the two scales (7- and 14-day assessment) into one that allows assessments at any instar, and we simplified the descriptive part of each damage level to facilitate the distinction between these levels (Table 2). The Davis scale is exclusively used for a quick top-view visual assessment of damage to the leaf whorl and the furl area, because the fall armyworm caterpillar almost exclusively feeds on the leaves inside the whorl. This implies that the scale only assesses recent damage, and not previous damage usually found on older, lower leaves.

Median and percentiles are calculated rather than arithmetic means. The advantage of the scale is that (a) it is a good compromise between detail and labour intensity, (b) relatively fine differences in damage levels can be assessed, (c) it mainly considers recent damage, thus allowing assessment of recent treatment effects, and (d) many studies have used this scale (Table 5). The disadvantages are that (a) human bias may influence the results due to its rather complicated, non-consistently explained rating levels across the scores and therefore differently interpretable descriptive parts, particularly between scores 5 and 7, (b) the scale is made for whorl leaf stages, thus vegetative growth stages only, (c) it only assesses the most recent damage, although this can be an advantage when assessing treatment effects; but is less suitable for longer assessment periods such as for maize tolerance or resistance, (d) it is of the nonlinear, ordinal data type limiting the use of parametric statistical inference (Blong 2003), and finally (e) researchers tend to frequently adapt and change the scale for their own purposes (Chinwada 2018; Prasanna et al. 2018), such as changing it from a 0–9 to a 1–9 scale (Ni et al. 2008; Aguirre et al. 2019; Baudron et al. 2019; Sisay et al. 2019b, a) or using the scale for whole plant assessment (Sisay et al. 2019b). This leads to misinterpretations when comparing studies, something we try to resolve by proposing a simplified instar-independent scale (Table 2).

The “Williams’ whole plant 0 to 9 leaf damage scale for the fall armyworm” (Table 3) (Williams et al. 1989) assesses the frequency and severity of damage across the whole plant, leading to a more comprehensive estimation of plant damage than the Davis scale used for the whorl and furl area only. It is about half as often cited as the Davis scale (e.g. in Williams and Buckley 2008; Phambala et al. 2020). It has been mostly used in maize hybrid trials as a 14-day assessment after artificial infestation with neonates. Here, we combined Williams’ scale and the whole plant Davis’ scale adaptations (Sisay et al. 2019b) into one whole plant assessment scale, and simplified the descriptive part of each damage level, to facilitate the distinction between levels (Table 3). Median and percentiles are calculated rather than arithmetic means. The advantages of this scale are that (a) relatively fine differences in damage levels can be assessed and (b) its descriptive part attempts a quantitative assessment by providing the proportion of damaged leaves of an entire plant, (c) it includes old and recent damage, and (d) it can be applied to young and older plants (Table 5). The disadvantages are that (a) it is time consuming, (b) the scale is made for vegetative growth stages only, and (c) it is of nonlinear, ordinal data type limiting some parametric statistical inference (Blong 2003), and (d) researchers tend to frequently adapt and change the scale to their purposes. This leads to misinterpretations when comparing studies, which we try to resolve here by proposing a simplified instar-independent scale (Table 3).

The novel “0.0 to 4.0 fall armyworm leaf damage index” is a fine-resolution damage assessment that combines severity and frequency information (Table 4). It is designed for research trials, particularly for assessments of treatment effects on the fall armyworm and its damage.

Each leaf is assessed individually for damage, and the scores of each leaf are summed up and divided by the total number of assessed leaves. The minimum index for an entire plant is 0.0 (i.e. no damage), and the maximum index is 4.0 (i.e. total damage). When first, second or third leaves of older and larger plants have dried out (senesced), they are not assessed and not included in the calculation. The obtained index is of ratio data type, which comes close to continuous, linear data (i.e. it is only ordinal at the leaf rating step, but then standardised to the total number of leaves). The advantages of this scale are that (a) fine differences in damage levels can be assessed, which can be useful if precise research results are required, (b) it can be similarly applied to small, young and larger, older plants, and most importantly, and (c) the obtained ratio data allow calculations of means, standard deviations, coefficients of variation, and the application of parametric statistics as for interval data types (Blong 2003) (Table 5). The disadvantages are that (a) this scale is labour intensive in terms of assessment as well as data entry and (b) the scale is suitable for vegetative growth stages only.

Relationships between fall armyworm damage rating and caterpillar populations

None of the leaf damage rating scales are suitable for predicting densities of populations of fall armyworm caterpillars in the field (see low R2 values, and p values in Fig. 1). One reason is that neonates and young caterpillars are often found in larger numbers per plant; but only one larger older caterpillar usually inhabits a single plant. Moreover, several tiny caterpillars cause relatively little damage, whereas a single large caterpillar can heavily damage a plant.

Relationships among fall armyworm damage rating scales

The trendlines in Figs. 2 and 3 reflect how scores relate from one damage scale to another. The “Simple 1 to 5 whole plant damage scale”, “Davis’ 0 to 9 whorl & furl damage scale”, “Williams’ 0 to 9 whole plant leaf damage scale” and the “0.0 to 4.0 fall armyworm leaf damage index” appeared relatively well associated among each other (Fig. 2). Those indicate that comparisons among studies and with previous research might be possible. The scales based on whole plant assessments are slightly more comparable to each other than to the scales based on whorl and furl assessments. Variability appears particularly high at low and high damage levels (for example see variability pattern when relating the Davis` scale to other scales).

As for the “yes–no damage scale”, only averages of larger sets of assessed plants can be associated with the other damage scales. Relationships were, as expected, poor (see larger standard errors of estimates in Fig. 3). To some extent, small and medium damage as per 1 to 5 whole plant damage scale could be related to percentages of damaged plants. However, the more damage, the poorer the relationship. The reason is that even one single caterpillar can cause heavy or even total damage, particularly the older caterpillars on younger plants.

Relationships between efforts of time in damage assessments and detail

Different leaf damage scales provide different levels of detail in the following order from the least detailed “yes–no damage scale” (2 intervals of detail) via the “Simple 1 to 5 whole plant damage scale” (5 intervals), the “Davis’ 0 to 9 whorl & furl damage scale” (9), the “Williams’ 0 to 9 whole plant leaf damage scale” (9) up to the most detailed “0.0 to 4.0 fall armyworm leaf damage index” with, for example, 20 intervals of detail at 4 leaf stage and 60 levels of detail at 12 leaf stage coming close to true interval data.

The scales required different investments of time for damage assessment (GLM, F4;126 = 29; p < 0.0001). The fastest damage assessments were possible when using the “yes–no damage scale” and the “Simple 1 to 5 whole plant damage scale”, requiring 2.5 ± 2.2 and 2.6 ± 0.9 s per plant, respectively (Fig. 4). About double the time was needed for the “Davis’ 0 to 9 whorl & furl damage scale” (4.8 ± 2.4 s), and about three times the time for the “Williams’ 0 to 9 whole plant leaf damage scale” (6.9 ± 4.5 s). In general, more time was usually needed to assess medium damage than light or heavy damage (see curves in Fig. 4). In contrast, the “Simple 1 to 5 whole plant damage scale” was the only scale where the amount of damage did not influence time needed for assessment (see p values in Fig. 4).

Fig. 4
figure 4

Time needed to assess different levels of leaf damage caused by the fall armyworm caterpillars to maize plants using different leaf damages scales. Polynomial regression lines plotted. Univariate GLM applied and p values show for influence of scores on time

The most detailed scale, the “0.0 to 4.0 fall armyworm leaf damage index” was also the most labour-intensive scale requiring about 12.6 ± 6.5 s per plant. Assessment time increased with increasing damage across leaves as well as with the size of the maize plant, reflected by its number of leaves (p = 0.046). In contrast, leaf numbers had no detectable influence on the time needed to assess damage via the “yes–no damage scale” (p = 0.51), the “Simple 1 to 5 whole plant damage scale” (p = 0.58), the “ Williams’ 0 to 9 whole plant leaf damage scale” (p = 0.89), and logically not via the “Davis’ 0 to 9 whorl & furl damage scale” as only the upper 3 to 4 leaves are assessed for the latter independent of plant size.

Discussion

Our review of over 500 scientific publications related to fall armyworm damage revealed that four scales are most widely applied. They are the nominal “yes–no damage scale” (Gómez et al. 2013; De La Rosa-Cancino et al. 2016; Zibanda et al. 2017; Midega et al. 2018; Aguirre et al. 2019; FAO and CABI 2019; Jaramillo-Barrios et al. 2019; Maruthadurai and Ramesh 2020), the ordinal “Simple 1 to 5 whole plant damage scale”(Cruz and Turpin 1983; Figueiredo et al. 2006; dal Pogetto et al. 2012; Grijalba et al. 2018; Fotso Kuate et al. 2019; dos Santos et al. 2020), “Davis’ 0 to 9 whorl & furl damage scale” (Davis et al.1992), and “Williams’ whole plant 0 to 9 leaf damage scale for fall armyworm” (Williams et al. 1989). As those scales all provide difficult-to-analyse nominal or ordinal (rank) data types, we created a novel 0.0 to 4.0 leaf damage index allowing finer, more linear, and therefore more accurate assessments. Then, we successfully tested and validated those scales in comparison with each other under field and laboratory conditions at different maize growth stages and different pest populations.

A number of other damage rating scales exist in the literature. For example, there is a 0 to 5 leaf %-damage scale (0 = no damage, 1 = slight damage (pinholes), 2 = moderate damage (10 to 25% of leaves or whorl damaged), 3 = heavy damage (25 to 50% of leaves or whorl damaged), 4 = severe damage (50 to 75% of leaves or whorl damaged), 5 = entire whorl destroyed) (Ghidiu and Drake 1989), and another 1 to 5 leaf damage scale (1 = No evident damage, or less than 1–3 pinhole type injuries; 2 = More than 3 pinhole type injuries, and/or 1–3 injuries less than 10 mm each; 3 = More than 3 injuries less than 10 mm, and/or 1–3 injuries larger than 10 mm each (shot-hole-type injuries); 4 = 3 to 6 shot-hole-type injuries, and/or at least 50% of the whorl destroyed; 5 = More than 6 shot-hole-type injuries, and/or whorl totally destroyed) (Ayala et al. 2013). There is also a 0 to 10 leaf damage scale used for damage assessments of maize under greenhouse conditions (0 = no visible damage; 1 = small amount of pinhole type injury; 2 = several pinholes; 3 = small amount of shot-hole type injury with 1 or 2 lesions; 4 = several shot-hole type injuries and few lesions; 5 = several lesions; 6 = several lesions, shot-hole injury and portions eaten away; 7 = several lesions and portions eaten away with some areas dying; 8 = several portions eaten away and areas dying; 9 = the whorl almost or completely eaten away and several lesions with more areas dying; 10 = plant dead, dying or almost completely destroyed (Wiseman et al. 1966)). However, all those scales are less frequently used, and some are in their descriptive parts slightly inconsistent.

Moreover, regardless of which damage rating scale is used for the fall armyworm, it needs to be emphasized that the caterpillars still need to be identified (Toepfer 2017; FAO and CABI 2019). This is, because armyworms, corn borers, stalk/stem borers and other caterpillars of lepidopteran pests can cause similar damage symptoms. This is the case for pinholes, shot holes, window panes, and elongated feeding holes in leaves. Some major differences resulting from fall armyworm feeding compared to other caterpillars’ damage are (a) the extensiveness of feeding seen as large ragged feeding holes, large parts of the leaf edges eaten, and an often completely destroyed vegetation growth point, (b) the large amount of frass in the whorl and furl, and (c) that fall armyworm caterpillars rarely enter the maize stems and thus rarely leave bore holes and broken stems (FAO and CABI 2019).

Our review, analyses and validation of the most commonly used damage scales of the fall armyworm and the novel damage index confirmed problems of a general nature relevant to many damage scales. First, scales are compromises between the need for detailed information and being simple enough for practical use. The scales studied here can be ranked in an order of increasing detail and workload (Table 5). Second, most rating scales are hybrid scales combining frequency and severity information. The unsure link between these two types of information is a common problem with many damage assessment data (Blong 2003). In our study, only the “Williams’ 0 to 9 whole plant leaf damage scale” and the “0.0 to 4.0 leaf damage index” account for this problem. Third, damage scales may be of nominal, ordinal (rank), interval or ratio data type (Stevens 1946; Blong 2003). Although these data concepts have been criticized for their simplicity (Velleman and Wilkinson 1993), they remain frequently related to requirements for statistical tests. Unfortunately, most pest damage scales, including the ones studied here, generate ordinal data that are difficult to analyse. In fact, statistical analyses involving means and standard deviations should be avoided here (Stevens 1951). If one would wish to assess true interval-type of data instead of ordinal, nominal or ratio data, it might be argued that percent leaf damage would need to be assessed, particularly for medium and heavy damage. This is, however, subject to human bias when not done through imaging software, and therefore difficult to implement. We therefore proposed the novel 0.0 to 4.0 leaf damage index that creates data close to interval data, therefore allowing the application of some parametric statistical inference methods. Finally, most of the scales do not account for leaf numbers and none for plant maturation status, with older plants being usually more relevant economically to a farmer than younger plants. For all these reasons, some researchers argue it might be scientifically more accurate to assess the pest population itself rather than damage. This can be achieved by counting caterpillars per plant, or the total weight of caterpillars per plant (Wiseman and Davis 1979), as well as through capturing moths in traps (Prasanna et al. 2018; FAO and CABI 2019). However, the population density measures are rarely linearly correlated with damage and even less correlated with yield loss (Dent and Walton 1998). This was also confirmed by the weak relationship found between the scales studied here, and fall armyworm caterpillar populations (Fig. 1). Therefore, damage levels will likely remain the most used types of data when studying the impact of this pest.

In summary, all proposed scales have their advantages and disadvantages summarised in Table 5. Except for the 0.0 to 4.0 damage index, they all remain of nominal or ordinal data type, limiting the application of some parametric statistical inferences. Only, the novel “0.0 to 4.0 fall armyworm leaf damage index” is of the ratio data type being more linear than the other scales. However, the associated workload when using this scale is high.

In conclusion, we suggest to use the “simple 1 to 5 whole plant damage scale” for pest monitoring and decision-making at all maize growth stages (Table 6). The original or simplified “Davis’ 0 to 9 whorl & furl damage scale” should be used for research purposes that need to estimate recent effects of treatments against caterpillars, as they are most reflected in reduction in damage in the newly grown parts of the plant, thus in the whorl and furl. The “Williams’ whole plant 0 to 9 leaf damage scale” as well as the “0.0 to 4.0 fall armyworm leaf damage index” should be used when longer periods need to be assessed on different stages of vegetative maize; and the latter scale particularly when high data resolution is required.

Table 6 Proposed use of leaf damage rating scales for assessing damage by fall armyworm caterpillars

Authors’ contributions

ST designed the study. PF and ST reviewed literature. ST, PF, DB, IM, JK implemented the study and collected data. ST, MS analysed the data. ST, JK, TT supervised the study. ST, PF, TT wrote the manuscript.