Introduction

Strawberries are the most produced and commercialized berries in the world. Since 2016, strawberry production has increased by 8% to reach 396,000 ha worldwide, producing 8,893 Mt (FAO, 2019). In Spain, strawberries are grown mainly in Huelva, a province covering most red fruit production. In 2021, this province’s strawberry-cultivated area was around 6,700 ha, representing 93% of the national total. Fusarium oxysporum f sp. fragrariae (Winks and Y.N. Williams) (Fof) is one of the most dangerous soilborne pathogens affecting strawberries worldwide (Henry et al., 2017). Fusarium oxysporum Schltdl. is an asexual ascomycete that causes severe vascular wilt disease in many crops (Booth et al., 1971). This saprophytic/necrotrophic fungus may survive in soil organic matter as chlamydospores for several years. The formae speciales of F. oxysporum usually have a narrow host range, often restricted to a single plant species (Kistler et al., 1998), and there are non-pathogenic strains morphologically indistinguishable from pathogenic strains (Suga et al., 2012). In strawberry crops, Fof inoculum generally includes contaminated soil and plants with latent infection (Katoh et al., 2021).

Since the prohibition of methyl bromide, there has been a notable increase in soilborne fungal diseases affecting strawberries, with unclear alternative control strategies (Holmes et al., 2020). Host plant resistance against these diseases has become paramount in strawberry breeding programs. Extensive research by Pincot et al. (2018, 2022) has identified five dominant resistance genes (Fw1-5) against F. oxysporum in strawberries. Several studies, including those by Borrero et al. (2017) and Fang et al. (2012), have focused on identifying resistant strawberry cultivars. It is essential to associate these cultivars with the corresponding pathotypes of F. oxysporum, as defined by Henry et al. (2021) and Pincot et al. (2022).

Besides breeding for disease resistance, the early detection and timely control of diseases are also critical factors in preventing yield losses (Garrido et al., 2011). Standard methods of early disease detection involve processes such as culturing, isolation, and molecular techniques following visual inspection of the plants (Debode et al., 2009; Koike & Gordon, 2015). Polimerase Chain Reaction primers allowed to detect Fof propagules in field soil, but they may miss specific Fof pathotypes (Koike & Gordon, 2015). Burkhardt et al. (2019) also introduced primers for Fof detection, while Katoh et al. (2021) devised a rapid LAMP assay requiring minimal equipment. Despite their accuracy, these methods are labor-intensive for large-scale scouting (Tapia-Tussell et al., 2008; Lu et al., 2017), and they cannot detect the Spanish pathotype of Fof.

For these reasons, non-destructive methods for detecting plant diseases have generated considerable interest among researchers and farmers. The integration of advanced sensors for proximal and remote sensing with accessible machine-learning algorithms holds promise for monitoring strawberry growth effectively. Such tools can aid breeding programs and disease control in strawberry fields by identifying infected plants before visible symptom occurs. This capability is especially critical for accurately assessing disease tolerance levels among resistant cultivars and determining whether non-pathogenic F. oxysporum strains impact crop physiology, potentially threatening productivity despite the absence of visible disease symptoms.

Various studies have explored non-destructive techniques for plant disease identification, mainly using RGB, multispectral, and hyperspectral images (Zheng et al., 2021). Hyperspectral systems, covering a wide spectral range with high resolution, showed promising results in the early detection of asymptomatic pathogens (Zhang et al., 2020). Siedliska et al. (2018) achieved over 97% accuracy in detecting fungal infections in strawberries using VNIR/SWIR hyperspectral analysis and a back propagation artificial neural network (ANN) model. Similarly, using hyperspectral data and ML algorithms, Lu et al. (2017) successfully detected Anthracnose crown rot in field-grown strawberries. Yeh et al. (2013, 2016) demonstrated early detection of foliar anthracnose in strawberries, achieving around 80% accuracy using different hyperspectral image analysis techniques. These studies underscore the potential of hyperspectral imaging for disease detection before visible symptoms emerge. Additionally, Pujari et al. (2016) compared ML models for classifying crop diseases based on plant symptom characteristics, with support vector machines (SVM) outperforming ANN with 92% accuracy.

The detection of F. oxysporum in other agricultural crop species has undergone significant advancements owing to the integration of diverse innovative techniques. Dang et al. (2020) pioneered a creative approach to detecting radish wilt using RGB and NIR images from UAVs. Studies such as those conducted by Navarro et al. (2022) and Shih et al. (2023) have elucidated distinct spectral features associated with Fusarium wilt, enabling early detection even before visible symptoms manifest in wild rocket and moth orchids, respectively. Through methods like spectral angle mapper (SAM) and constrained energy minimization (CEM), coupled with ANN, SVM, and Random Forest Classifiers (RFC), researchers have effectively identified disease-specific signals while suppressing background noise in hyperspectral spectra. Additionally, integrating thermography, as highlighted by Rispail and Rubiales (2015), showed promising results in assessing the resistance of pea plants to Fusarium wilt and other soil-borne pathogens by detecting subtle temperature or transpiration rate changes, providing valuable insights into plant physiological responses to disease stressors.

These findings highlight the capability of proximal and remote sensors, combined with ML algorithms, to develop practical, non-destructive disease detection methods with high accuracy and efficiency. Building upon these arguments, the objectives of this work are: (1) to assess the visual symptomatology and physiological impairment, particularly leaf gas exchange, induced by six F. oxysporum isolates obtained from strawberry plants; (2) to evaluate the effectiveness of hyperspectral images obtained from ‘non-symptomatic’ leaf tissue, combined with machine learning models, in distinguishing between healthy and Fusarium wilted strawberry plants grown under controlled conditions, and determining the severity of physiological impairment caused by F. oxysporum f sp. fragrariae.

These objectives are designed to address the uncertainties surrounding the physiological impacts of ‘non-pathogenic’ F. oxysporum isolates on apparently healthy strawberry plants, and to explore the potential of remote sensing and machine learning for early detection and severity assessment of Fusarium wilt.

Materials and methods

Plant material and experimental design

This work was carried out at the Higher Technical School of Agricultural Engineering (ETSIA) facilities of the University of Seville (Seville, 37º 21’ N, 5º 56’ W). Strawberry plants cv. Palmeritas (Masiá Ciscar, Lepe, Spain) were grown in pots of 1 L volume using a completely randomized block experimental design with four blocks and seven treatments. One treatment served as a control treatment (non-inoculated). By contrast, the other six treatments involved different Fusarium oxysporum isolates obtained from strawberry plants (inoculated plants), with eight and four replications for control and different isolates, respectively. The plants were inoculated with the following isolates: F74, F111, F115, F134, F138 and F141. The isolate F74 was characterized as Fof causing strawberry Fusarium wilt in different trials (Borrero et al., 2017; Henry et al., 2021). This Spanish isolate has a distinct genotype and phenotype compared to other isolates from other countries (Henry et al., 2021). The other isolates in this study were isolated in Spain and considered non-pathogenic in previous trials (data not shown). Subsequently, the pots were randomly placed in a growth chamber under controlled environmental conditions with temperatures set at 28ºC during the day and 24ºC at night, maintaining a twelve-hour photoperiod. The plants were manually irrigated twice weekly with two distinct nutrient solutions: one comprised of Peter’s foliar feed (27 + 15 + 12; N + P2O5 + K2O) (Scotts, Heerlen, The Netherlands) at 0.5 g L-1, and the other consisting of CaCl2 · 2H2O, with a dosage of 0.7 g L-1 (Panreac Química SLU, Barcelona, Spain).

Inoculation conditions

The fungal isolates used belong to the Plant Pathology Laboratory of ETSIA. All these F. oxysporum isolates were stored at -80 °C before being used. The inoculum of all the Fusaria was grown for seven days in AMAP culture medium (Borrero et al., 2009) containing 10 g of agar, 10 g of malt extract, 2.0 g of L-asparagine, 0.5 g of Peter’s foliar feed and micronutrients per L of distilled water. To obtain a conidial suspension, sterile water (5 mL) was added to each culture plate before scraping the culture surface with a sterile bent metallic rod. The concentration of conidia was determined with a hemocytometer and adjusted to 2 × 106 conidia mL− 1. The plants were inoculated by root-dipping in the abovementioned suspension for 30 min after root pruning. The inoculation was performed one week before further measurements.

Hyperspectral imaging system and data acquisition

A Headwall Nano-Hyperspec hyperspectral pushbroom camera (Headwall Photonics, Bolton, UK) with a spectral range from 397 to 1003 nm and a spectral resolution of 272 bands was used to perform the measurements. The camera sensor is a line-scan, meaning the number of lines scanned (y-axis) depends on the scanning time. By contrast, in the ground projection, 640 constant lines are measured (x-axis), with a spectral resolution of 272 bands per projection line (z-axis). The sensor exposure time was 15 ms, and the frame rate was 15 ms. The camera was fixed to an aluminium frame at a height of 0.5 m with a scanning angle of 0º (nadir view). The pots were placed on a conveyor belt powered by a 12 V motor (electric motor output of 6.1 V and 1 A) under the aluminium structure, moving at a constant speed of 0.06 m s− 1 (Fig. 1). Additional artificial light was provided by two 400 W tungsten halogen lamps (Maurer Ferritalia, Padova, Italy) fixed to both sides of the structure. The hyperspectral camera was connected via an Ethernet cable to a computer to record the images and subsequent data transfer. Measurements were made weekly (six to eight days) for four weeks, described as M1, M2, M3 and M4, respectively.

Fig. 1
figure 1

Conveyor belt and the aluminium structure with the hyperspectral camera (left), details of the hyperspectral camera module (right-up) and example of RGB plant image obtained by the camera (right-down)

Data processing

Hyperspectral data underwent radiometric calibration to obtain spectral reflectance data, for which dark and white reference measurements were performed. Covering the camera shutter, a dark reference value was obtained for each sampling date. For white reference, a diffuse reflectance target (Labsphere, USA, Spectralon® diffuse Reflectance target, 127 × 127 mm, 99% reflectance, calibrated) was placed aside each plant. The pixels corresponding to the white reference panel were identified using the 728 nm wavelength, as this band exhibited the highest signal intensity among the panel pixels. To determine the spectral signature of the white reference, only the pixels with the top 10% highest radiation values in the 728 nm band were selected and then averaged. Hyperspectral data processing and analysis were performed using the open-source programming language Python (Python Software Foundation, https://www.python.org/). Firstly, the hyperspectral image was corrected as follows:

$$\:Corrected\:Image=\frac{raw-dark}{white-dark}\:\:\:\:\:\:\:\:\:\:\:\:\:$$
(1)

where raw is the uncorrected image file and dark and white denote the dark and white reference values, respectively.

The hyperspectral images were then subjected to Savitzky-Golay smoothing. Then, the first 35 and the last 40 wavebands were removed to reduce possible noise due to high and random variation of spectral reflectance in these bands, resulting in images with a total of 197 spectral bands. The dataset was then normalized between 0 and 1. Finally, a manual pixel selection process was carried out within regions of ​​interest (ROIs) to obtain an average spectral signature per plant of leaf tissues without any visual symptoms of the disease. Subsequently, a ‘csv’ file is composed as a dataset in which the rows represent individual plants, and the columns represent the spectral bands. This dataset consists of the selected ROIs’ average spectral signature, representing visually healthy leaf tissue.

Disease severity and physiological measurements

On the same days that hyperspectral images were taken, measurements of the severity and leaf gas exchange were also made. Disease severity was assessed, considering leaf wilting to be the main symptom of Fusarium wilt. Weekly observations were taken for disease evolution. Plants were scored with the relative number of symptomatic leaves per plant (number of symptomatic leaves / total number of leaves) (Hernández-Muñiz et al., 2023). The area-under-the-disease-progress-curve-standardized (AUDPCs) per pot was calculated from the disease severity values by the trapezoidal integration method between the onset of symptoms and the ending time of the assay and by dividing the total duration (days) of the epidemic the assay (Campbell & Madden, 1990). In addition, leaf gas exchange measurements were taken on one fully expanded leaf per plant without any visual symptoms of the disease. Stomatal conductance (gs) and leaf photosynthesis rate (A) were measured using a differential portable system (CIRAS-3 Model, PP-Systems, Amesbury, U.S.A.), in which photosynthetically active radiation (PAR) and air CO2 concentration were set to 1200 µmol m− 2 s− 1 and 400 µmol mol− 1, respectively.

Machine learning models description

Six different classifier models, artificial neural network (ANN), decision tree (DT), K-nearest neighbour (K-NN), support vector machine (SVM), multinomial logistic regression (MLR), and Naïve Bayes (NB), were developed using the complete spectral signatures of both inoculated and non-inoculated. The dataset was divided into training and validation sets, with 80% and 20% of individuals, respectively. Due to the imbalanced nature of the dataset, stratified cross-validation (Szeghalmy & Fazekas, 2023) was conducted to train and validate the models to ensure that the proportion of examples from each class is maintained in the validation set. Early stopping and regularization were used to avoid model overfitting. The algorithms used were tested with different settings. The model configurations finally used in the study are described in what follows. A Multi-Layer Perceptron (MLP) framework was adopted for the ANN model architecture, comprising input, hidden, and output layers. The optimal number of neurons was determined as the number of input data, followed by 150, 100, and 50 hidden layers, and the output corresponding to the number of classes described. Learning rate (0.001), training epochs (500), rectified linear unit (ReLU) activation function after hidden layers, and Adam as the solver function were set. The ANN model was used with two configurations: as a classifier (the output layer is the class to which the plant belongs according to the level of disease severity) and as a regression (the output layer is the value of the physiological indicator used in each case, gs or A). For this purpose, a ‘softmax’ output layer was defined for the ANN classifier version and a ‘linear’ output layer for the regression one.

For the DT algorithm, a decision tree was constructed using the Gini Index criterion, and the ‘best’ attribute was selected to optimize the splitting of the provided dataset. The KNN algorithm was set to minimize the influence of outliers or incorrect classes, with K = 3 (considering the three nearest neighbours) and ‘distance’ as the weight parameter. This choice was made because neighbours closer to a query point will have a greater influence than neighbours further away. The SVM algorithm was set with a linear-based kernel to find an optimal hyperplane for effectively separating data classes. The MLR algorithm, an extension of binary logistic regression for situations with more than two categories, was configured with the ‘newton-cg’ solver and the ‘multinomial’ parameter. The NB algorithm was set to use a Gaussian distribution model due to the dataset size. An approach based on confusion matrices and classification reports was employed to evaluate the model’s performance. Matrices provided a visual representation of each class classified by the corresponding model. The Y-axis of each matrix represented the actual values ​​(true labels), and the X-axis represented the values that the model predicted (estimated labels). Each matrix showed the classification results in terms of true positives (TP), false negatives (FN), false positives (FP) and true negatives (TN). This allowed for the evaluation of the model’s performance in identifying each class.

Statistical analysis

The severity, stomatal conductance, and leaf photosynthesis rate of each F. oxysporum isolate were analysed using one-way ANOVA to determine the effects of the isolates during the experiment. Heterogeneous groups were separated using Tukey’s test (HSD) (P < 0.05) to discriminate differences between isolates. Severity data were transformed for normalization following the Box-Cox methodology, which employs power transformations to correct unequal variances and non-linearity in the relationship. The machine learning models used Linear regressions to compare the estimated and measured values ​​of stomatal conductance and leaf photosynthesis rate. The root means square error (RMSE) and the mean absolute error (MAE) values were calculated using the same algorithms of the chosen models to analyse predictive performance.

The data presented for models were unbalanced since the number of data included in each class was not equal, so the metric used is represented as ‘F1-score’ since it follows a weighting between the ‘Precision’ and ‘Recall’ metrics of the classification report. Precision measures the quality of the classification and is calculated as VP / (VP + FP), while the Recall informs about the identification capacity and is calculated as TP / (TP + FN). The F1-score combines these two metrics in a single measure as the harmonic mean between Precision and Recall values. Thus, F1-score balances the classification quality and identification ability in unbalanced cases.

Results

Fusarium wilt severity

Figure 2 shows the visual disease severity caused by the different isolates of F. oxysporum on the last sampling date (M4), the only one in which significant differences between control and inoculated plants were observed. Only two isolates (F74, F141) showed a noticeable increase in visual disease symptoms on leaf tissues during the disease. However, due to the high variability of visible symptoms observed in F141, only F74 showed significant differences with the control treatment.

Fig. 2
figure 2

Visual disease severity on leaf tissues caused by each isolate of Fusarium oxysporum on the last sampling date. Boxplots were represented for each treatment as control (non-inoculated) plants (n = 8) and six Fusarium oxysporum isolates (n = 4). The star indicates significant differences between each F. oxysporum isolate and control plants (P < 0.05)

Leaf gas exchange responses to F. Oxysporum isolates

Figure 3 shows the values of gs measured during the experiment. The control treatment (non-inoculated plants) showed relatively constant gs values in the 50–60 mmol m− 2 s1 range. All F. oxysporum isolates showed significantly lower gs values throughout the four sampling dates than the control treatment. However, F111 and F115 did not show significant differences until the second and third sampling dates. The isolate F74 caused the most severe reduction in gs, with a 35% reduction in gs relative to the control treatment in M1 that dropped to 90% in M3 and M4. The rest of the isolates showed a gs decrease with respect to the control treatment, which was less severe than that observed in F74, being significant in all cases from M3 onwards and with percentage reductions in the 15–30% range.

Fig. 3
figure 3

Stomatal conductance of control and inoculated plants for each sampling day was described as M1 (a), M2 (b), M3 (c) and M4 (d). Boxplots were represented for each treatment as control (non-inoculated) plants (n = 8) and six Fusarium oxysporum isolates (n = 4). The star indicates significant differences between each F. oxysporum isolate and control plants (P < 0.05)

Figure 4 shows A measured in control and inoculated plants during the experiment. The control plants showed relatively constant A values throughout the experiment, with average values around 6 µmol m− 2 s− 1. As observed with gs, all F. oxysporum isolates caused a significant reduction in A relative to the control treatment. However, F111 and F115 did not do so until the second and third sampling dates, respectively. F74 was again the isolate that caused the greatest severity in A reduction, with up to 90% reductions on the last two sampling dates. The remaining isolates showed a progressive decrease of A, reaching values close to 30% on the previous sampling date.

Fig. 4
figure 4

Leaf photosynthesis rate (A) of control and inoculated plants for each sampling day was described as M1 (a), M2 (b), M3 (c) and M4 (d). Boxplots were represented for each treatment as control (non-inoculated) plants (n = 8) and six Fusarium oxysporum isolates (n = 4). The star indicates significant differences between each F. oxysporum isolate and control plants (P < 0.05)

Classification and estimation of Fusarium wilt severity by machine learning algorithms

Plants inoculated with the six F. oxysporum isolates were used for model training since they significantly reduced physiological parameters compared to control plants. To train the classification response, the plants were categorized into four categories based on the severity of physiological damage. The established categories were a healthy plant, mild, moderate, and severe impairment. To assess the suitability of using either stomatal conductance or leaf photosynthesis rate as indicators of physiological damage, the plants were categorized twice, once using gs and the other using A as an indicator of biotic stress. In both cases, the variables gs and A were normalized by dividing their values by the maximum value observed for both variables throughout the experiment. Once the variables gs and A were normalized, the normalized ranges of gs and A used in the categorization of plants were as follows: healthy plant (1-0.85), mild impairment (0.85 − 0.75), moderate impairment (0.75 − 0.65), and severe impairment (< 0.65).

Fig. 5
figure 5

shows the average spectral signatures obtained throughout the entire study period for each class for both categorization criteria, i.e. based on gs and A. It can be observed that the spectral region with the most significant variability in reflectance between classes was from the green region (550 nm) to the red region (670 nm), while differences are less noticeable in the NIR region (700–900 nm)

Figure 5 Average spectral signatures and standard deviations for each class were distinguished as follows: class 1 (healthy plant), class 2 (mild impairment), class 3 (moderate impairment), and class 4 (severe impairment). The plants were categorized using stomatal conductance-based (a) and leaf photosynthesis rate-based (b) physiological impairment. For the gs-based and A-based categorizations, the number of individuals belonging to each class was 35 (class 1), 37 (class 2), 30 (class 3), 26 (class 4) and 23 (class 1), 29 (class 2), 34 (class 3), 42 (class 4), respectively.

Figure 6 displays the confusion matrices for the gs-based physiological impairment classes categorized by all studied models. Additionally, Table 1 presents the results of these models in the classification report. The ANN model (Fig. 6a) best classified healthy plants and those with varying degrees of physiological damage (Table 1). Analysing the F1-score metric, this model demonstrated an overall accuracy of 85%, followed by the DT model (77%). The other evaluated models (KNN, SVM, MLR, and NB) exhibited a low classification performance, with overall F1-score values below 60%.

Table 1 Classification report for artificial neural network (ANN), decision tree (DT), K-nearest neighbour (KNN), support vector machine (SVM), multinomial logistic regression (MLR) and Naïve Bayes (NB) classifiers, showing each model’s performance for individual classes (healthy, mild, moderate and severe) and the overall model performance. Classes were established using gs-based physiological impairment. Precision results obtained from stratified cross-validation are represented as F1-scores for a random selection of 20% of samples randomly selected (support) from each class

Analysing the performance of the top-performing models (ANN and DT), it can be observed in Fig. 6 that both models correctly classified 100% of healthy individuals. The classification of individuals with some kind of physiological impairment based on gs was slightly better in ANN, which correctly classified 15 out of 19 individuals. In comparison, 13 individuals were correctly classified by DT. Another critical aspect of the ANN model compared to DT is that it misclassified only one individual with biotic stress as a healthy plant. By contrast, the DT model misclassified three individuals as healthy plants. The remaining evaluated models encountered significant difficulties in correctly classifying healthy and diseased individuals, with SVM’s performance being notably poor.

Fig. 6
figure 6

Confusion matrices were obtained for artificial neural network (a), decision tree (b), K-nearest neighbour (c), support vector machine (d), multinomial logistic regression (e) and Naïve Bayes (f) classifiers using gs-based physiological impairment classes. Four classes were established: healthy plants, mild, moderate, and severe impairment. ‘True label’ corresponds to the actual class, while ‘Predicted label’ represents the class predicted by the model. The colour scale indicates the number of randomly selected plants or samples (support) used for prediction

Figure 7 shows the confusion matrices obtained with the six classifiers for classes established using the A as a physiological indicator of biotic stress severity. As it has been observed when plants were classified using gs as a physiological indicator, the ANN algorithm (Fig. 7a) also demonstrated the highest classification performance, correctly classifying all healthy plants and 19 out of 21 diseased plants. However, the performance of the ANN model to correctly classify the severity level of diseased plants was somewhat poorer than that observed using gs-based classes, with 15 out of 21 plants classified correctly for 15 out of 19 correctly classified in the case of gs-based classes. Analysing the F1-score metrics, the ANN model showed an overall accuracy of 77%, followed by the DT model with 65% (Table 2). In both cases, weaker classification performance was observed compared to that shown by the same models when classes were defined based on gs as a physiological indicator. The rest of the evaluated models (KNN, SVM, MLR, and NB) showed low classification potential, with overall F1-score values in the 30–50% range.

Fig. 7
figure 7

Confusion matrices obtained for artificial neural network (a), decision tree (b), K-nearest neighbour (c), support vector machine (d), multinomial logistic regression (e) and Naïve Bayes (f) classifiers using A-based physiological impairment classes. Four classes were established: healthy plants, mild, moderate, and severe impairment. ‘True label’ corresponds to the actual class, while ‘Predicted label’ represents the class predicted by the model. The colour scale indicates the number of randomly selected plants or samples (support) used for prediction

Table 2 Classification report for artificial neural network (ANN), decision tree (DT), K-nearest neighbour (KNN), support vector machine (SVM), multinomial logistic regression (MLR) and Naïve Bayes (NB) classifiers, showing each model’s performance for individual classes (healthy, mild, moderate and severe) and the overall model performance. Classes were established using A-based physiological impairment. Precision results obtained from stratified cross-validation are represented as F1-scores for a random selection of 20% of samples randomly selected (support) from each class

The best classification model (ANN) was also trained as a regression model to assess its reliability in predicting actual gs and A values in non-inoculated and inoculated strawberry plants. Figure 8 shows the relationships between measured and predicted gs (Fig. 8a) and A (Fig. 8b) values. Stomatal conductance presented a coefficient of determination (R2) of 0.84, an RMSE value of 5.03 mmol m− 2 s− 1 and a MAE value of 3.94 mmol m− 2 s− 1. The R2, RMSE and MAE values for A were 0.81, 0.64 µmol m− 2 s− 1 and 0.41 µmol m− 2 s− 1, respectively. The slopes of the regression lines were 0.87 (gs) and 0.99 (A), whereas the intercepts were 3.23 (gs) and − 0.13 (A).

Fig. 8
figure 8

Relationship between measured and estimated stomatal conductance (a) and photosynthesis rate (b) values by the regression of artificial neural network model. Each symbol represents an individual plant. The solid and dashed lines represent the regression and the 1:1 line, respectively

Discussion

The disease caused by pathogenic F. oxysporum isolates is characterized by the plant’s difficulty in absorbing water and nutrients from the soil (Yadeta & Thomma, 2013), as the fungus affects the plant’s vascular system. In horticultural species such as tomato or cucumber, the disease caused by F. oxysporum has resulted in severe reductions in stomatal conductance (gs) and net photosynthesis rate (A) (Carmona et al., 2021; Lorenzini et al., 1997; Wang et al., 2015), with the latter being down-regulated not only by diffusional limitations (gs) but also by biochemical impairment in the leaf photosynthetic apparatus (Nogués et al., 2002; Susič et al., 2018).

Our findings show that strawberry plants inoculated with the pathogenic Fof isolate also exhibited the characteristic pattern of reduced gs and A as described for other species, with reductions of up to 90% in gs and A compared to control plants (non-inoculated) around 15 days after inoculation (Fig. 3). However, it has also been shown that F. oxysporum isolates considered as non-pathogenic may also cause moderate gs and A impairment, with reductions of up to 20–30% of these variables without any visible symptoms of the disease. This can have consequences at the production level, and its detection requires the development of simple and early detection methods due to the absence of any visual symptoms.

The results obtained in this study reveal that among the six machine learning models evaluated, the ANN model exhibited the highest classification performance, both when disease severity was based on gs and when it was based on A. The overall accuracy values evaluated through the F1-score metrics were 0.85 (gs-based) and 0.77 (A-based). To the best of our knowledge, this is the first study in which machine learning models have been developed for the early detection of F. oxysporum f sp. fragrariae in strawberries based on hyperspectral images. Therefore, it cannot directly compare the performance of the ANN model with other F. oxysporum detection studies in strawberries. However, for detecting powdery mildew in strawberry plants using RGB images, the ANN model also showed the highest classification performance (93.8%) (Chang et al., 2019). In strawberry fruits affected by Botrytis and Anthracnose, the ANN model was also highly accurate (> 95%) for disease detection using hyperspectral images (Siedliska et al., 2018). Ye et al. (2020) also evaluated three models (i.e., SVM, RF, and ANN) for detecting areas affected by Fusarium wilt in banana plantations using multispectral images taken from drones. In this case, all models showed similar accuracies (around 90%) in detecting visually infected areas. Although they did not evaluate the ANN model, Yeh et al. (2016) assessed the reliability of three models for classifying healthy and inoculated strawberry plants with visible symptoms of Anthracnose disease. In all cases, they obtained an accuracy close to 80%, similar to those obtained in this study for classifying F. oxysporum-infected strawberry plants without visual symptoms.

The development of machine learning models trained on hyperspectral images of strawberry plant´s apparently healthy (non-symptomatic) plant tissue represents, therefore, a valuable tool for early detection of physiological impairments caused by F. oxysporum isolates. Moreover, establishing disease severity classes based on stress indicators such as gs and A, commonly used for characterizing crop responses to environmental stresses, marks a departure from traditional methodologies that often rely on visual symptom severity for class determination. However, since most biotic and abiotic stresses decrease these physiological parameters (Abdulridha et al., 2019; Pantazi et al., 2017; Susič et al., 2018), ML models calibrated on the interactions between spectral signatures and variations in gs and A would require further testing under field conditions to determine their accuracy in detecting F. oxysporum infections in real-world scenarios with multiple stressors.

However, there is evidence in the literature that the combination of hyperspectral images and ML algorithms allows for precise discrimination between biotic and abiotic stresses in plants, as there is a certain degree of specificity in the spectral bands altered by each factor. In this sense, Susič et al. (2018) successfully differentiated nematode infestation from water deficiency in tomato plants using shortwave infrared spectral regions associated with O-H and C–H stretches. Pantazi et al. (2017) classified reflectance data from winter wheat crops with over 95% accuracy, identifying health status, nitrogen stress, and disease presence using SOM-based models. As demonstrated by Abdulridha et al. (2019), optimal band selection significantly improves disease detection accuracy, achieving up to 98% accuracy in identifying laurel wilt disease in avocados against nutrient deficiency. These advances underscore the efficacy of combining hyperspectral imaging and machine learning for precisely detecting and classifying plant stresses. Ortiz et al. (2019) found that spectral bands associated with F. oxysporum infection were identified in the visible range, while bands related to water stress were found in the near-infrared range, suggesting high specificity and sensitivity in detecting and dis.

These findings suggest that the models trained in this study can offer a high degree of specificity in detecting F. oxysporum f sp. fragariae (Fof) and the severity of its physiological damage. As future research directions, it is recommended to evaluate the performance of the models in detecting F. oxysporum f sp. fragariae under uncontrolled conditions and multiple stress factors, as well as across different cultivars and with a larger number of pathogenic F. oxysporum isolates.

Conclusions

Pathogenic Fof isolates caused rapid wilting of strawberry plants with severe drops in leaf gas exchange variables. The non-pathogenic F. oxysporum isolates also caused a reduction in leaf gas exchange variables in infected strawberry plants, although much less severe. The combined use of machine learning models and hyperspectral images obtained from healthy-appearing leaf tissue (without visual disease symptomology) allows the classification of healthy and diseased strawberry plants with different levels of physiological impairment. The ANN-based model provided the best classification performance, with an overall accuracy of more than 80% when plant classes were defined based on their stomatal conductance values. Further improvement and refinement of the model with new datasets are still needed to enhance the discrimination between Fof severity classes and to ensure high detection accuracies under multi-stressor environments (e.g., those potentially found under field conditions) and multi-cultivar environments (e.g., those found in breeding programs).