Estimating peanut and soybean photosynthetic traits using leaf spectral reflectance and advance regression models

Buchaillot, Ma. Luisa; Soba, David; Shu, Tianchu; Liu, Juan; Aranjuelo, Iker; Araus, José Luis; Runion, G. Brett; Prior, Stephen A.; Kefauver, Shawn C.; Sanz-Saez, Alvaro

doi:10.1007/s00425-022-03867-6

Estimating peanut and soybean photosynthetic traits using leaf spectral reflectance and advance regression models

Original Article
Open access
Published: 24 March 2022

Volume 255, article number 93, (2022)
Cite this article

Download PDF

You have full access to this open access article

Planta Aims and scope Submit manuscript

Estimating peanut and soybean photosynthetic traits using leaf spectral reflectance and advance regression models

Download PDF

Ma. Luisa Buchaillot^1,2,
David Soba³,
Tianchu Shu⁴,
Juan Liu⁵,
Iker Aranjuelo³,
José Luis Araus^1,2,
G. Brett Runion⁶,
Stephen A. Prior⁶,
Shawn C. Kefauver^1,2 &
…
Alvaro Sanz-Saez ORCID: orcid.org/0000-0002-7754-4618⁴

3214 Accesses
14 Citations
6 Altmetric
Explore all metrics

Abstract

Main conclusion

By combining hyperspectral signatures of peanut and soybean, we predicted V_cmax and J_max with 70 and 50% accuracy. The PLS was the model that better predicted these photosynthetic parameters.

Abstract

One proposed key strategy for increasing potential crop stability and yield centers on exploitation of genotypic variability in photosynthetic capacity through precise high-throughput phenotyping techniques. Photosynthetic parameters, such as the maximum rate of Rubisco catalyzed carboxylation (V_c,max) and maximum electron transport rate supporting RuBP regeneration (J_max), have been identified as key targets for improvement. The primary techniques for measuring these physiological parameters are very time-consuming. However, these parameters could be estimated using rapid and non-destructive leaf spectroscopy techniques. This study compared four different advanced regression models (PLS, BR, ARDR, and LASSO) to estimate V_c,max and J_max based on leaf reflectance spectra measured with an ASD FieldSpec4. Two leguminous species were tested under different controlled environmental conditions: (1) peanut under different water regimes at normal atmospheric conditions and (2) soybean under high [CO₂] and high night temperature. Model sensitivities were assessed for each crop and treatment separately and in combination to identify strengths and weaknesses of each modeling approach. Regardless of regression model, robust predictions were achieved for V_c,max (R² = 0.70) and J_max (R² = 0.50). Field spectroscopy shows promising results for estimating spatial and temporal variations in photosynthetic capacity based on leaf and canopy spectral properties.

Estimating growth and photosynthetic properties of wheat grown in simulated saline field conditions using hyperspectral reflectance sensing and multivariate analysis

Article Open access 11 November 2019

Application of Vegetative Indices for Leaf Nitrogen Estimation in Sugarcane Using Hyperspectral Data

Article 17 October 2023

Predicting grain yield using canopy hyperspectral reflectance in wheat breeding data

Article Open access 03 January 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

One of the great challenges for the future is the production of sufficient food for a growing population. From 1961 to 2012, the human population more than doubled from approximately 3 billion to 7 billion people and a further increase to 9.3 billion is projected for the year 2050 (FAOSTAT 2016). This means that crop production must double by 2050 to meet the predicted production demands of the global population. However, achieving this goal will be a significant challenge for agriculture since crop yields would have to increase at a rate of 2.4% per year, yet the average rate of increase is only 1.3%, with yields stagnating in up to 40% of land under cereal production (Araus and Cairns 2014). Further, climate change will exacerbate this challenge by intensifying field crop exposure to abiotic stress conditions, including rising temperature, drought, and increased CO₂ concentration [CO₂] (Christensen et al. 2007). This is a major issue because climatic factors since the end of the 1980s have counterbalanced the wheat genetic progress of recent decades in Europe (Oury et al. 2012). Indeed, as observed by Oury et al. (2012) and Gray and Brady (2016), the beneficial effects expected from the increase in atmospheric [CO₂] in the World’s crop production during recent decades have been constrained by the effects of temperature increases and extended drought.

Grain legumes are the main source of proteins, minerals, and fibers for animals and humans (Meena et al. 2018). To achieve significant improvements in crop yield, breeding strategies aiming to increase biomass gains and crop productivity need to focus on radiation uptake, photosynthetic efficiency, and harvest index (HI) (Reynolds et al. 2012; Koester et al. 2014). However, to date, breeding for higher photosynthetic efficiency or for tolerance to different environmental stresses has only played a minor role in increasing crop productivity over past decades (Zhu et al. 2010). In a rational sense, plant physiology research should focus on improving photosynthesis due to its central part in plant productivity (Long et al. 2004). Recently, different studies have advanced how to optimize photosynthetic processes in different crops (Ort et al. 2015; Simkin et al. 2019).

One way to improve crop photosynthesis is to increase our knowledge of genomic control of photosynthesis under different environmental conditions. To achieve this, diverse crop populations representing hundreds of cultivars need to be screened (phenotyped) under different environments to associate traits of interest (i.e., photosynthetic parameters) with specific genomic regions. With the rise of genomic and bioinformatics technologies, phenotyping entire populations for traits of interest is the bottleneck that delays scientific advancement in genomics (Adachi et al. 2011; Yan et al. 2015; de Oliveira Silva et al. 2018; Oakley et al. 2018). Therefore, genomic approaches and breeding solutions need to implement new high-throughput phenotyping techniques that allow rapid measurement of photosynthetic traits for screening cultivars in the shortest amount of time (Araus and Cairns 2014; Araus et al. 2018). By improving techniques for measuring photosynthetic traits, more efficient cultivar selection will likely improve both yield potential and resilience to abiotic stresses.

Photosynthetic performance is frequently measured with an infrared gas analyzer that assesses plant CO₂ assimilation rate. Photosynthetic parameters, such as leaf mid-day photosynthesis and leaf diurnal photosynthesis, can be used to assess in situ plant performance under different abiotic stresses (Sanz-Sáez et al. 2012, 2017). More detailed photosynthetic parameters, such as maximum rate of rubisco-catalyzed carboxylation (V_c,max) and maximum electron transport rate supporting RuBP regeneration (J_max), have been identified as selection parameters for tolerance to abiotic stress, such as drought (Aranjuelo et al. 2009, 2013), elevated tropospheric ozone (Yendrek et al. 2017), or for improved performance under elevated atmospheric CO₂ (Ainsworth et al. 2004; Soba et al. 2020). Depending on the parameter to be measured, sampling can take a few minutes each (e.g., mid-day photosynthesis) or 20–60 min per sample for photosynthetic parameters, such as V_c,max and J_max, which are calculated using photosynthesis to intercellular CO₂ curves or A–Ci curves (Farquhar et al. 1980; Long and Bernacchi 2003). In addition, V_c,max and J_max are essential input parameters for the FvCB model (Farquhar et al. 1980) that relates photosynthetic biochemistry responses to known environmental conditions (Von Caemmerer 2013). This model has also been used in earth systems models for predicting ecosystem responses to environmental changes (Rogers 2014).

Reflectance spectra at leaf and canopy levels can facilitate assessment of plant’s structure, nutritional status, and certain stress parameters. This includes estimating contents of chlorophyll, xanthophylls, nitrogen, phosphorus, fiber, sucrose (Gamon et al. 1997; Peñuelas and Filella 1998; Petisco et al. 2006; Asner and Martin 2008; Colombo et al. 2008; Ainsworth et al. 2014; Serbin et al. 2014; Dechant et al. 2017; Yendrek et al. 2017), and plant secondary metabolites (Couture et al. 2016; Vergara-Diaz et al. 2020). In addition, leaf level spectral reflectance has been used to predict photosynthetic parameters, such as V_c,max and J_max in soybean (Ainsworth et al. 2014), wheat (Silva-Perez et al. 2018), maize (Heckmann et al. 2017; Yendrek et al. 2017), and trees (Serbin et al. 2012) as well as dark respiration in wheat (Coast et al. 2019).

Although translating data acquired with a field spectrometer using a leaf clip to scalable imaging approaches using multispectral or hyperspectral cameras in drones or other aerial platforms (frequently limited to the 350–1000 nm spectral range) may be further complicated by the heterogeneous nature of canopies, such techniques could greatly expand the scope of applicability of these measurements. In the above-mentioned research, relationships between photosynthetic parameters and complex data arrays captured by leaf level spectrometers need to be analyzed using complex multivariate statistical models. Partial least squares regression (PLSR) is the most commonly used model (Serbin et al. 2012; Ainsworth et al. 2014; Heckmann et al. 2017; Silva-Pérez et al. 2017; Yendrek et al. 2017). However, Fu et al. (2020) recently reported that other machine learning algorithms such as Least Absolute Shrinkage and Selection Operator (LASSO) can estimate photosynthetic parameters as accurately or better than PLSR, since LASSO is more robust when comparing different environments or plant species (Tibshirani 1996). Therefore, to bypass PLSR performance problems, we propose to explore other powerful machine learning algorithms with appropriate feature extraction capacities, which include LASSO (Vergara-Diaz et al. 2020), Bayesian Ridge (BR; Neal 1996), and Automatic Relevance Determination Regression (ARDR; Tipping 2001).

For these multivariate models, utilized data must represent enough phenotypic variability to support proper model functioning. To achieve sufficient phenotypic variability, several researchers have applied a range of growth conditions, including different levels of abiotic stresses, such as drought (Silva-Perez et al. 2017), elevated tropospheric ozone (Ainsworth et al. 2014; Yendrek et al. 2017), or high temperature (Serbin et al. 2012). Another means for increasing phenotypic variation is by including several related species in the same model. For example, Doughty et al. (2011) used 149 tropical tree species to create a PLSR model to estimate mid-day photosynthesis using canopy hyperspectral imaging; and Serbin et al. (2012) combined hyperspectral data of two tree species to estimate V_c,max. However, to the best of our knowledge, no published study has combined multiple leguminous row crops species. In our research, we focused on soybean (Glycine max) and peanut (Arachys hypogea), which are leguminous crops often grown under high abiotic stress levels (drought and elevated temperature) in the southeastern United States. These legume crops are also important in rotation with corn and cotton.

The aims of this study were (i) to estimate photosynthetic capacity parameters, such as mid-day photosynthesis, leaf chlorophyll content (LCC), V_c,max, and J_max of two legume crops (soybean and peanut) using full-range leaf level reflectance spectra (VIS–NIR–SWIR, 400–2500 nm) with PLSR, BR, ARDR and LASSO models and (ii) to simulate photosynthetic parameter model performance using four common types of sensors with more limited wavelength ranges: VIS–NIR (350–1000 nm), NIR–SWIR (1000–2500 nm), SWIR (1400–2500 nm), and an advanced multispectral sensor imitating the ESA Copernicus Sentinel 2 satellite with 12 spectral bands.

Materials and methods

Trial setup and design

Experiments were conducted in field trials and controlled conditions located at Auburn University (Alabama, USA). The study was carried out with two leguminous crops (soybean and peanut) that were exposed to different growth conditions. The first experiment involved two soybean (Glycine max. L) cultivars grown under ambient and elevated [CO₂] at an Open Top Chamber Facility. The second experiment involved four soybean cultivars grown under high night temperature in growth chambers. The third experiment was performed with 6 peanut (Arachis hypogea L.) cultivars grown under well-watered and water-stress conditions in a greenhouse.

Experiment 1: soybean cultivar response to elevated [CO₂]

Two soybean cultivars representing high (PI398223) and low (PI567201A) water use efficiencies (WUE) were chosen for the study based on previous screening by Dhanapal et al. (2015). The two cultivars were planted on 16 May 2019 in 20 L pots filled with commercial growth media (Pro-Mix, Premier Tech, Quebec, Canada) at the Open Top Chamber Facility located at the USDA-ARS National Soil Dynamics Laboratory, Auburn, AL, USA. Open top chambers (OTC) (Rogers et al. 1983), encompassing 7.3 m² of ground surface area, were used to deliver target [CO₂] of ~ 410 ppm (ambient) or ambient plus 200 ppm (elevated) [CO₂] during light hours using a delivery and monitoring system described elsewhere (Mitchell et al. 1995). There were four replicate chambers of each CO₂ level for a total of eight experimental plots. Each OTC held two pots of each cultivar to have two sub-replicates for each plot. The experiment was conducted as a split plot design with CO₂ level being the main plot factor and cultivar being the split plot factor. Mid-day photosynthesis and A–Ci curves were performed when plants were at the beginning of pod development (R3, Fehr et al. 1971, 15 July) and at the beginning of seed filling (R5, 26 July) according to growth stages defined by Fehr et al. (1971). Relative chlorophyll content and leaf hyperspectral reflectance measurements were performed concurrently with photosynthetic parameter measurements. More detailed information on experimental design was previously reported by Soba et al. (2020).

Experiment 2: soybean cultivar response to high night temperatures

Four soybean cultivars (PI360846, DS25-1, PI458098, and Agx9) were planted in 3.8 L pots containing a peat-moss: perlite potting mixture (2:1) on 1 May 2019. Plants were grown at the Auburn University Plant Science Research Center greenhouse complex. Temperatures were maintained at 28/20 °C (day/night) until plants reached the first flowering stage (R1). To impose night temperature treatments, plants were then moved to two Conviron CMP 6010 growth chambers (Conviron, Manitoba, Winnipeg, Canada) maintained on a 12 h photoperiod (1200 µmol m⁻² s⁻¹ PAR) with 50/70% RH (day/night). Control plants were grown at 30/20 °C (day/night) and high night temperature plants were grown at 30/30 °C (day/night). Three replicates per cultivar and chamber were used and the whole experiment was repeated twice. Fourteen days after temperature treatments were imposed, mid-day photosynthesis, A–Ci curves, LCC, and leaf hyperspectral reflectance were performed as explained below.

Experiment 3: peanut cultivar response to drought

Six peanut cultivars (AUG16-28, AU17, 18H19-3738, G06-G, AU8-19, and AU18-21) were planted at the Auburn University Plant Science Research Center greenhouse complex on 21 April 2019. Plants were grown in 20 L pots containing a mixture of sand and sandy-loam field soil (1:1, w/w) collected from EV-Smith Research Center, Shorter, AL, USA. Plants were maintained under well-watered conditions (80% relative soil water content, RSWC) until 60 days old; at this time, the drought experiment was initiated. Weighing pots every 2–3 days initially and every day towards the end of the experiment allowed RWSC to be gravimetrically maintained. Well-watered plants were maintained at 80% RSWC while drought plants were maintained at a 30% RSWC. Four replicates per cultivar and stress treatment were used in this experiment. At 20 and 40 days after drought initiation (i.e., 80- and 100-day-old plants), mid-day photosynthesis, A–Ci curves, LCC, and leaf hyperspectral reflectance measurements were performed as explained below.

Physiological parameter assessments

In this study, mid-day photosynthesis, A–Ci curves, and SPAD measurements were taken from 3 different experiments and coupled with full-range (350–2500 nm), high-resolution (3–8 nm) spectral reflectance measurements taken with a Field Spec Hi-Res four field spectrometer (Analytical Spectral Devices, Boulder, CO, USA) to predict physiological parameters that characterize photosynthetic traits.

Mid-day photosynthesis measurements

Depending on experiment size, mid-day photosynthesis measurements were taken one day before A–Ci curves using two or three LI-6400 (Li-Cor Biosciences, Lincoln, NE, USA) systems. Measurements were performed on fully expanded young leaves corresponding with the third/forth leaf from the top in soybean, and second/third leaf from the top of the main stem in peanut. Prior to measurements, systems were set to match environmental growth conditions (light intensity and temperature) and maintained at a relative humidity of 60–70%. While photosynthesis measurements were in progress, relative chlorophyll content and spectral reflectance measurements were also performed on the same leaves using a SPAD meter (Minolta SPAD-502, Spectrum Technologies Inc., Plainfield, IL, USA) and the Field Spec Hi-Res 4 field spectrometer, respectively.

A–Ci curves

To calculate maximum rate of rubisco-catalyzed carboxylation (V_c,max) and maximum electron transport rate supporting RuBP regeneration (J_max), A–Ci curves were performed at different developmental stages in each experiment. In general, the A–Ci curves were the same for peanut and soybean except for different light saturation points: 1750 μmol m⁻² s⁻¹ PAR for soybean (Ainsworth et al. 2004) and 2000 μmol m⁻² s⁻¹ PAR for peanuts (Ferreyra et al. 2000). Photosynthesis was initially induced at the growth [CO₂] (410 ppm for ambient and 610 ppm for elevated CO₂ treatments), and then [CO₂] was reduced stepwise to the lowest concentration of 50 ppm. Afterwards, [CO₂] was increased stepwise to the highest CO₂ concentration of 1500 ppm. A total of 11 measurements per curve were recorded (Sanz-Sáez et al. 2017). During measurements, block temperature was set at 28 °C (i.e., mean mid-day temperature at Auburn, AL). The equations and spreadsheet developed by Sharkey et al. (2007) were used to calculate V_c,max and J_max normalized at 25 °C as it has been demonstrated by (Khan et al. 2021) that different temperatures and the effect on reflectance does not affect prediction of these normalized parameters. While A–Ci curves were taken, concurrent spectral reflectance measurements were performed on the same leaves.

Relative chlorophyll content

Relative chlorophyll content was taken on the same mid-day photosynthesis leaves using a SPAD-502 chlorophyll meter (Konica Minolta, Tokyo, Japan). Five subsample measurements per leaf were collected and averaged.

Leaf spectral reflectance measurements

Leaf spectral reflectance was measured with a FieldSpec Hi-Res 4 concurrently on the same leaves used for photosynthetic measurements. This device has three sensors with a full spectro-radiometer range of 350–2500 nm, with a resolution of 3 nm in visible (VIS; 350–700) and near-infrared (NIR; 700–1000 nm) and 8 nm in shortwave-infrared (SWIR; 1000–2500 nm). Measurements were taken via a leaf clip coupled to a fiber-optic cable. The FieldSpec has a radiometrically calibrated internal light source, which was standardized for relative reflectance using white reference measurements every 15 min. For each leaf, 6 reflectance measurements were recorded on different regions of a single leaf per pot. We used the FieldSpectra package in R to average the six samples and align the VIS, NIR, SWIR sensors with a spectral splice correction (Serbin et al. 2014; Yendrek et al. 2017).

To accomplish the second research aim, we simulated if a more limited spectral range (corresponding to other remote sensing devices) would be able to estimate photosynthetic parameters with the same accuracy as the full-range spectra achieved with the Field Spec HiRes4. Simple spectral resampling of four different sensors was performed to simulate commercial spectrophotometer sensors, such as the UniSpec-DC VIS/NIR (310–1100 nm; PP Systems, Amesbury, MA, USA), the USB 2000 VIS/NIR (340–1014 nm; Ocean Optics, Dunedin, FL, USA), and the Liga SWIR spectrophotometer (850–1888 nm; STEAG Micro Parts, Dortmund, Germany). We also included a resampling simulation for the bands and bandwidths of the ESA Copernicus Sentinel-2 satellite, with 12 spectral bands (443, 494, 560, 665, 704, 740, 781, 834, 944, 1375, 1612, and 2194 nm) representing VIS, NIR, and SWIR (see more in Drusch et al. 2012; Segarra et al. 2020).

Statistical analysis of measured and estimate values

Statistical analyses were conducted using R Studio (RStudio Team 2020) and Python 3.7 (Python Software Foundation, https://www.python.org) via a Jupiter notebook (Wofford et al. 2019). Effects of abiotic stress treatments and differences between cultivars on studied variables were assessed using analysis of variance (ANOVA) in R Studio. We also analyzed correlations between photosynthetic parameters against each spectrum band by Pearson’s correlation using R Studio.

With respect to the different advance regression models, we used the SciPy module (Jones et al. 2001; Varoquaux et al. 2015) in Python 3.7 and the Scikit-Learn library for the estimation of different parameters to estimate determination (R²) and the root means squared error (RMSE). For cross-validation, we used the “train test split method” where, we split our data into training (60% of the data used to build the model) and testing (40% of the data used to test the model). This method quantifies the prediction error, the RMSE, which measures the average prediction error made by the model in predicting the outcome for an observation. That is, the average difference between the observed known outcome values and the values predicted by the model. Associations between photosynthetic parameters (response variables) and the leaf reflectance spectrum (explanatory) variables were analyzed using four advances models: (i) Partial Least Squares Regression (PLSR) is based on the dimension reduction method (Wold et al. 2001). For this model, we used between 5 and 11 components, choosing the number of components that gave the highest R² and the lower RSME; (ii) Least Absolute Shrinkage and Selection Operator (LASSO) is a shrinkage method (Tibshirani 1996); (iii) Bayesian ridge (BR) and (iv) Automatic relevance determination regression (ARDR) are both high-dimensional methods (Neal 1996; Tipping 2001). Figures were prepared using the matplotlib (Hunt 2019) and Seaborn Python (Waskom et al. 2017) modules in Python 3.7.

Results

Effect of abiotic stress and cultivar on photosynthetic parameters

Analyzing the effect of abiotic stress and cultivars can yield valuable insights into phenotypic range of variation within each experiment. In Experiment 1, the two soybean cultivars showed significant effects of [CO₂] on mid-day photosynthesis and LCC, but not on V_c,max and J_max (Table 1 and Fig. S2). We observed treatment effects for mid-day photosynthesis and LCC (Table 1a). In summary, phenotypic variation was noticeable with a range of 17.01–36.22 µmol m⁻² s⁻¹ for mid-day photosynthesis, 34.55–51.35 for LCC, 182.9–348.4 µmol m⁻² s⁻¹ for V_c,max, 174.7–263.7 µmol m⁻² s⁻¹ for J_max, and 29.4–30.37 °C for leaf temperature. In Experiment 2, four soybean cultivars were grown under high night temperature (30/30 °C day/night) for comparison to controls (30/20 °C day/night). Cultivar effects with treatment showed a significant effect on mid-day photosynthesis, LCC, and J_max (Table 1b). Overall, phenotypic variation was noticeable with a range of 11.52–32.68 µmol m⁻² s⁻¹ for mid-day photosynthesis, 34.01–53.95 for LCC, 48.01–135.2 µmol m⁻² s⁻¹ for V_c,max, 61.01–165.1 µmol m⁻² s⁻¹for J_max, and 29.9–30.33 °C for leaf temperature. In Experiment 3, the effect of drought was significant for all measured peanut parameters except for V_c,max and J_max (Table 2). Cultivars only showed significant effects for LCC and J_max. The interaction effects of drought and cultivars was only slightly significant for V_c,max (P = 0.094). Phenotypic variation was perceptible since mid-day photosynthesis ranged from 5.051 to 26.41 µmol m⁻² s⁻¹, LCC varied from 42.30 to 52.45, V_c,max varied from 64.38 to 171.3 µmol m⁻² s⁻¹, J_max ranged from 79.3 to 206.1 µmol m⁻² s⁻¹, and 28.6 to 30.5 °C for leaf temperature. When phenotypic variation of all three experiments was considered together, the range for mid-day photosynthesis was 5.051–36.22 µmol m⁻² s⁻¹, 34.55–53.95 for LCC, 48.01–348.4 µmol m⁻² s⁻¹ for V_c,max, 61.01–263.7 µmol m⁻² s⁻¹ for J_max, and 26.33–31.55 °C for leaf temperature (Fig. S2, shows the Box plot for each experiment).

Table 1 Mean values of mid-day photosynthesis (µmol m⁻² s⁻¹), leaf chlorophyll content (LCC, arbitrary units), maximum rate of rubisco-catalyzed carboxylation (V_c,max, µmol m⁻² s⁻¹), maximum electron transport rate supporting RuBP regeneration (J_max, µmol m⁻² s⁻¹), and leaf temperature (°C) per each treatment. (a) Experiment 1: two varieties of soybean grown at 410 ppm and 610 ppm of [CO₂]; n = 32. (b) Experiment 2: four soybean varieties grown at low (20 °C) and high (30 °C) night temperature; n = 48

Full size table

Table 2 Mean values of midday photosynthesis (µmol m⁻² s⁻¹), leaf chlorophyll content (LCC, arbitrary units), maximum rate of rubisco-catalyzed carboxylation (V_c,max, µmol m⁻² s⁻¹), maximum electron transport rate supporting RuBP regeneration (J_max, µmol m⁻² s⁻¹), and leaf temperature (°C) in six varieties of peanut grown under well-watered (WW, 80% SWC) and water-stress (WS, 30% SWC) conditions

Full size table

Relationships between spectral signatures and photosynthetic parameters

Figure 1 presents the sensitivity of leaf reflectance spectrum for different species and abiotic stresses. Under high night temperature, soybean reflectance spectrum shows higher variability than the control with a larger peak at ~ 550 nm and wider reflectance band between ~ 750–1400, 1550–1800 and 2000–2300 nm (Fig. 1a, b). Elevated CO₂ in soybean tended to reduce variability of the reflectance spectrum between ~ 500–600 and 750–1400 while maintaining the variability in the reflectance spectrum between 1550–1800 and 2000–2300 nm (Fig. 1c, d). In peanut, drought increased variability at all wavelengths with the exception of the 500–600 nm range (Fig. 1e, f). When comparing reflectance of the two legume species, we noted that peanut added a lot of spectral variation in the range from 750 to 2300 nm, probably due to the drought treatment; meanwhile soybean added more variability in the 500–600 nm range (Fig. S1).

Pearson’s correlations were performed to highlight which zones of spectral signatures presented negative or positive correlations with each measured parameter. Pearson’s correlations between the parameter and each wavelength were presented separately for soybean (Fig. 2a), peanut (Fig. 2b), and both species combined (Fig. 2c). Regarding soybean V_c,max and J_max values, correlation against each band showed significant (P < 0.05) negative values (Pearson coefficient around − 0.6) in the VIS (400 nm) and in almost all SWIR (1400–2500 nm) bands (Fig. 2a). On the other hand, mid-day photosynthesis and LCC presented lower and no significant correlation coefficients against each band from the reflectance spectrum. In the case of peanut (Fig. 2b), photosynthesis values against each wavelength band showed significant correlation (r = − 0.6, P < 0.05) in VIS–NIR (400–1000 nm) bands. LCC and each wavelength showed strong correlation (r = − 0.7, P < 0.05) in the NIR (700 nm). For V_c,max and J_max, the correlation against each wavelength was very low or non-significant (Fig. 2b). With increased variability from combining all experiments, we could observe that mid-day photosynthesis against each wavelength showed a significant correlation (r = − 0.5, P < 0.05) in the VIS (400 nm). Regarding the coefficient of correlation between V_c,max and J_max, significance (r = 0.6, P < 0.05) in the VIS (400 nm) and most of the SWIR (1400–2500 nm) bands indicated an improvement relative to species analyzed separately. For this reason, we ran all advance models using combined phenotypic and spectral data from each species and environmental condition.

Estimating photosynthetic parameters using field spectroscopy and advance regression models

To test how accurately a given model estimated different photosynthetic parameters, we presented the coefficient of determination (R²) and RMSE for each model and mean parameter, i.e., interpreted as the proportion of information in data that is explained by each model (Fig. 3). Since estimation of the V_c,max and J_max parameters did not work well in the peanut experiment but worked well for the soybean (Table S1), and since the LCC estimation does not work with soybean, we decided to combine these three experiments and focus on the combination of the two crop species in this manuscript (Fig. 3). Mid-day photosynthesis showed a higher R² (0.62) and low RMSE (4.79) using the PLSR model using 10 components, followed by BR (R² = 0.41 and RMSE = 5.92) with the worst model being the ARDR (R² = 0.28 and RMSE = 6.55) (Fig. 3a). LCC was better assessed by PLSR (R² = 0.56 and RMSE 3.83) using 10 components, followed by ARDR (R² = 0.34 and RMSE = 4.71) with the BR model showing the worst performance (R² = 0.08 and RMSE = 5.55; Fig. 2b). The best V_c,max model was obtained by PLSR (R² = 0.70 and RMSE = 42.80) using nine components followed by the other three models with similar values (R² = 0.56–0.59; RMSE = 50.11–52.03). Regarding J_max, the best model was PLSR (R² = 0.50 and RMSE = 35.83) using nine components closely followed by Lasso (R² = 0.46 and RMSE = 37.1) and BR (R² = 0.45 and RMSE = 37.41), with ARDR (R² = 0.40 and RMSE = 39.29) being the worst model.

For each of the four models, we calculated the coefficient of weight for each band and model (Fig. 4). These coefficients showed waveband contributions along the VIS–NIR–SWIR spectrum for photosynthetic parameter estimations using leaf reflectance spectrum of pooled species, cultivars, and growing conditions. The coefficient of weight for estimating mid-day photosynthesis using PLSR showed maximum values around 400, 750, and 1750 nm, while ARDR and LASSO showed high coefficient weights at 400 nm. On the other hand, BR did not show any remarkable coefficient weights for mid-day photosynthesis (Fig. 4a). With respect to LCC, PLSR showed maximum coefficients at 400, 750, and 1750 nm, while ARDR showed a peak around 400 nm (Fig. 4b). LASSO and BR showed very low coefficients at all wavelengths (Fig. 4b). In Fig. 4c, we can observe the different coefficients of each band for V_c,max, where the maximum peaks were at 400, 700 and around 2000 nm for PLSR, BR and LASSO, while for ARDR it was only at 400 and 750 nm. For estimates of J_max, the highest coefficient weights for PLSR were located in SWIR (2200–2300), followed by NIR (900–1100). For the LASSO model, the strongest areas were at 400, 750, and 1750 nm (Fig. 4d), while the highest coefficients were found in the SWIR (1400–2500 nm) for BR and ARDR.

Scaling up estimations of photosynthetic parameters for potential hyperspectral aerial or satellite applications

To assess their ability to estimate photosynthetic parameters compared to full spectra captured by the Field Spec Hi-Res4 (VIS–NIR–SWIR, 350–2500 nm), we simulated other sensors with limited wavelength ranges, specifically VIS–NIR (350–1000 nm), NIR–SWIR (1000–2500 nm), SWIR (1400–2500 nm), and the 12 wavelength bands of Sentinel-2 satellites (Table S2). To test this, we used reflectance data acquired by the Field Spec Hi-Res4 and separated the reflectance data according to the wavelength range of each before mentioned sensor. We then performed photosynthetic estimations using the same 4 models (PLSR, BR, ARDR, and LASSO).

Table 3, Figs. 5, and 6 show estimations of photosynthetic parameters using pooled data from both species. For mid-day photosynthesis and LCC, simulations with different sensors with just the VIS–NIR (350–1000 nm), NIR–SWIR (1000–2500 nm), and SWIR (1400–2500 nm) spectrum regions were best performed using PLSR compared to BR, ARDR, and LASSO models (Table 3; Figs. 5, 6). However, LCC was estimated best by BR, ARDR, and LASSO using the simulated ESA Copernicus Sentinel-2 satellite multispectral bands (Table 3; Figs. 5, 6). Concerning estimation of V_c,max within the VIS–NIR range (350–1000 nm) and the ESA Copernicus Sentinel-2 satellite sensors, the best performing model was PLSR using 10 components (R² = 0.63 and 0.53, respectively). For simulations of the NIR–SWIR (1000–2500 nm), SWIR (1400–2500 nm), BR was the best model for assessing V_c,max (R² = 0.62 and 0.60, respectively). For estimating J_max with the VIS–NIR (350–1000 nm) sensor, the best model was LASSO (R² = 0.42). For the range NIR–SWIR (1000–2500 nm), SWIR (1400–2500 nm) ARDR estimated J_max similarly (R² = 0.51). PLSR, BR, and LASSO presented the same coefficient of determination (R² = 0.41) when using ESA Copernicus Sentinel-2 satellite simulated wavebands to assess J_max.

Table 3 Coefficient of determination (R²) and root mean squared error (RMSE) of mid-day photosynthesis (µmol m⁻² s⁻¹), leaf chlorophyll content (arbitrary units) of all species pooled together based on leaf reflectance spectra at different ranges [VIS–NIR (350–1000 nm), NIR-SWIR (1000-–2500 nm), SWIR (1400–2500 nm), and Sentinel-2 bands] through advance regression models: Partial Least Squares Regression (PLSR), Bayesian Ridge (BR), the Automatic Relevance Determination Regression (ARDR), and Least Absolute Shrinkage and Selection Operator (LASSO)

Full size table

Regarding comparison of different sensors (VIS–NIR, NIR–SWIR, and SWIR) against original FieldSpec data (VIS–NIR–SWIR), we observed that estimation of mid-day photosynthesis by the different models was similar to that of simulated sensors (Figs. 3, 5, 6 and Table 3). With ESA Copernicus Sentinel-2 satellite, estimation of mid-day photosynthesis did not work. Estimation of LCC using ESA Copernicus Sentinel-2 satellite was lower than when the whole spectrum was used. Regarding the estimation of the V_c,max simulating ESA Copernicus Sentinel-2 satellite, the PLSR and LASSO presented an R² (0.50) that was a little lower than the FieldSpec (R² = 0.70). With respect to J_max estimation, we observed that coefficients for the simulated NIR–SWIR and SWIR sensor ranges were very similar (but slightly lower) to the full-range FieldSpec (Figs. 3, 5, 6 and Table 3). The VIS–NIR and ESA Copernicus Sentinel-2 satellite simulations presented values that were lower than using the whole spectrum (Figs. 3, 5, 6 and Table 3).

Discussion

Estimating photosynthetic parameters using field spectroscopy and advance regression models

The main objective of this research was to assess which advanced statistical model (PLSR, BR, ARDR, and LASSO) was the most successful in estimating different photosynthetic parameters using leaf reflectance spectra (VIS–NIR–SWIR, 350–2500 nm) from two legume species. The use of advance regression models to predict different physiological parameters needs ample phenotypic variation to be accurate (Kuhn and Johnson 2013). Since statistical effects of different treatments over some variables were not significant (Table 1), we combined findings from three experiments (two different species) to increase phenotypic range for better parameter estimation with all models rather than examining each species separately (Table S1). Similar approaches have been used recently to increase phenotypic variation and obtain a better prediction model by including different species and/or cultivars (Doughty et al. 2011; Serbin et al. 2012; Choquette et al. 2019), different abiotic stresses such as drought (Silva-Perez et al. 2018), or elevated atmospheric ozone concentrations (Ainsworth et al. 2014; Yendrek et al. 2017).

In our study, when data from both legumes were combined, almost all of the advanced models were able to estimate V_c,max and J_max at greater than R² > 0.50 (Fig. 3). Of the four models used to predict these two parameters, PLSR was the overall best model for V_c,max (R² = 0.70 and RMSE 42.80) and J_max (R² = 0.50 and 35.83), followed by LASSO and BR for V_c,max (R² = 0.59 with RMSE 50.11; 0.59 with a RMSE 50.15, respectively), and BR and LASSO for J_max (R² = 0.45 with a RMSE 37.11; 0.46 with a RMSE 37.41, respectively) (Fig. 3). This may be because the PLSR model does not estimate shrinkage when performing variable selection (spectral wavebands) as do BR, ARDR, and LASSO (Neal 1996; Tipping 2001; Wold et al. 2001). Others have also found that PLSR and LASSO had similar estimation capacities, showing that LASSO band block contribution was similar to the PLSR model (Fu et al. 2020). Specific reasons why PLSR was more efficient at estimating photosynthetic parameters assessed in this study are discussed in detail below.

Successful predictions of V_c,max (R² = 0.89 with a RMSE 15.4) and J_max (R² = 0.93 with a RMSE 18.67) using PLSR have been previously obtained by combining two tree species (Serbin et al. 2012); this study showed statistically significant phenotypic variation due to temperature treatments as well as species. In our study, the lower R² associated with V_c,max and J_max estimates could be attributed to the lack of effect of some environmental treatments (temperature, elevated CO₂, and drought) and cultivars over these parameters (Table 1). However, Ainsworth et al. (2014) showed a significant correlation between measured and estimated V_c,max (R² = 0.88 with a RMSE 13.4) with the effect of treatments (elevated ozone) and cultivars not being significant. This demonstrated that good parameter estimation and significant treatment or cultivar effects are not mutually exclusive and that it is only necessary to have sufficient range in variation of phenotypic data. For example, Ainsworth et al. (2014) and Serbin et al. (2012) noted V_c,max variation (60–280 μmol m⁻² s⁻¹ and 40–170 μmol m⁻² s⁻¹, respectively) similar to the values obtained in this study when all three experiments were combined (48–348 μmol m⁻² s⁻¹ for the current experiment). Since the ranges in variation of V_c,max and J_max data are similar but higher to those obtained in the above-mentioned research, why are R² values in the current study for V_c,max (R² = 0.70) and J_max (R² = 0.50) lower and RSME (42.80 and 35.83, respectively) higher than in those studies? Tibshirani (1996) has noted that PLSR models lose accuracy when estimating parameters across different environments. Research by Serbin et al. (2012) and Ainsworth et al. (2014) were each performed in one environment (greenhouse and field, respectively) for one growing season, while our study combined information from three experiments representing distinct environments (greenhouse, growth chambers, and open top chambers) with plants grown at very different environmental conditions. In an experiment with several corn breeding lines grown under ambient and elevated ozone repeated over three growing seasons, Yendrek et al. (2017) obtained V_c,max estimations (R² = 0.55 with RMSE 6.61, and 0.65 with a RMSE 6.60) similar to those reported in our study but with a RMSE lower than ours. This was probably due to the effects of changing environments on PLSR performance (Serbin et al. 2012; Ainsworth et al. 2014). Regarding the lower RMSE obtained in the above-mentioned publications (Serbin et al. 2012; Ainsworth et al. 2014; Yendrek et al. 2017) in comparison with those obtained in our research, this could be due to the different cross-validation used in our approach. In our cross-validation, the test error rate can be highly variable, depending on which observations are included in the training set and which observations are included in the validation set. This may be the reason for the higher RMSE values observed in V_c,max and J_max. Also the high RMSE values can be due to a higher phenotypic range as a result of including two crop species grown in three very different environments. This highlights the importance of performing calibration experiments under multiple environments. Other issue that can arise is the use of these models with completely new set of cultivars and experimental conditions as was tested in Yendrek et al. (2017). In such a case, it would be recommendable to test model precision by measuring spectral reflectance under new conditions and corroborating model estimates of extreme values for V_c,max with ground truth measurements of the photosynthetic parameter. Although this extra step will take more time, this procedure could serve to test model accuracy and help improve the model with new training data.

To solve this multiple environment/location problem, new approaches need to be developed and implemented. For example, Fu et al. (2020) increased prediction model accuracy by stacking different machine learning algorithms (i.e., R² increases of 0.1–0.2 over single prediction models). Another alternative would be creation of a consortium of scientists interested in using hyperspectral reflectance technology to predict physiological traits. Their combined expertise would create strong standardized calibrations that could be used across multiple environments as has been done for assessing forage quality traits using NIRS technology (i.e., NIRS Consortium; https://www.nirsconsortium.org/).

Estimation of mid-day photosynthesis using PLSR, BR, ARDR, and LASSO presented lower R² values (≈ 0.29–0.62) than for V_c,max and J_max (Fig. 3) since in situ photosynthetic measurements are likely more influenced by environment (Sanz-Sáez et al. 2017; Soba et al. 2020) than by leaf structure and biochemistry (Serbin et al. 2012; Ainsworth et al. 2014). Thus, a looser estimation was expected. Due to environmental variability, few reports have estimated mid-day photosynthesis. However, our PLSR estimation was better than the observations of Vitrack-Tamam et al. (2020) for cotton stomatal conductance (R² = 0.23); this was likely due to the lower range spectral reflectance device used in their experiment (633–1659 nm). Similar estimations of net photosynthesis were accomplished using the scaled photochemical reflectance index and a FieldSpec Hi-Res Device (Kumari et al. 2012).

Regarding spectral wavelength specific coefficients for each estimation model for V_c,max and J_max, the most frequent selection for the four models was the VIS waveband (Fig. 4) where chlorophyll and other pigments have strong absorption features (Peñuelas and Filella 1998). However, these models also used wavebands in the NIR and SWIR, similar to other studies (Hansen and Schjoerring 2003; Doughty et al. 2011; Serbin et al. 2012; Ainsworth et al. 2014; Yendrek et al. 2017). In addition, Rubisco has several relatively broad spectral absorption features in the NIR and SWIR (Elvidge 1990). These selections of spectral region combinations indicate that V_c,max and J_max spectral signatures are not simply a function of chlorophyll content, which suggests that more information is needed beyond the VIS–NIR wavebands to estimate such complex processes. The inclusion of a broader range of wavebands, due in part to less penalizations, is likely why the PLSR model outperformed BR, ARDR, and LASSO by more effectively capturing the broader spectral absorption features of Rubisco. For example, the V_c,max LASSO model only selected specific coefficients at 540, 680, 720, 2000, and 2250 nm (Fig. 4c), while the PLSR model had significant coefficient ranges between 400–450, 700–800, and 1750–1900 (Fig. 4c). Photosynthesis and LCC also presented the highest selection of spectral peaks in the VIS, followed by NIR; this has been extensively documented through both vegetation indices that estimate chlorophyll pigment content and also by the Photochemical Reflectance Index (PRI) that predicts photosynthetic efficiency through a zeaxanthin absorption feature (Gamon et al. 1997; Gitelson et al. 2005; Schlemmera et al. 2013).

We also present a more in-depth comparison of the four models. As shown in Fig. 3, the R² of models do not present significant differences between each other, although we can see that the models used different numbers of coefficients to estimate each parameter (Fig. 4). This was reflected in the algorithm differences in each model approach to parsimony, the simple explanation of an occurrence involving the fewest entities, assumptions, or changes. This means that a fewer number of weight coefficients were used to estimate the different parameters (Vandekerckhove and Matzke 2015). In our study, all PLSR models (blue line in Fig. 4) used VIS, NIR, and SWIR wavelengths, but potentially over-fitted by an over-inclusion of predictor variables (Geladi et al. 1986; Wold et al. 2001). This contrasts to the BR (in green), ARDR (in red), and LASSO (in yellow) models (Fig. 4), which used more specific and limited spectra than restricted models that penalize the lesser coefficients (Neal 1996; Tibshirani 1996; Tipping 2001).

Scaling up estimations of photosynthetic parameters for potential hyperspectral aerial or satellite applications

The second aim of this study was to simulate different sensors with more limited spectral coverage (VIS–NIR, NIR–SWIR, and SWIR), including the ESA Copernicus Sentinel-2 satellite13 bands. We found that estimation of V_c,max using three different sensor ranges (VIS–NIR–SWIR) with the four models performed (R² = 0.50) surprisingly similar to the whole spectrum (Figs. 3 and 5). For J_max, the highest estimation (R² = 0.51) used NIR–SWIR and SWIR data in ARDR. This was quite similar to Meacham-Hensold et al. (2020) who used PLSR models and canopy-level spectra with three different spectral ranges (500–900, 500–1700, and 500–2400 nm) to achieve V_c,max estimations near R² = 0.60 and J_max estimations around R² = 0.40.

We also resampled FieldSpec data to cover the 12 spectral bands of the ESA Copernicus Sentinel-2 satellite; these were quite similar to spectral ranges selected by the coefficients used by the different models to estimate photosynthetic parameters. Concerning the different photosynthetic parameters, only V_c,max was estimated at more than R² = 0.50. This could be related to the carboxylation process (V_c,max) having several relatively broad spectral absorption features in NIR and SWIR centered at 1.5, 1.68, 1.74, 1.94, 2.05, 2.29 µm, etc. (Elvidge 1990), which are in close proximity to several Sentinel-2 wavelength bands (Table S2). Supplementary data (Table S3) and Serbin et al. (2012) showed that wavelengths (490, 610, 690, 710, 1680, 1940, 2200, 2400 nm) used to estimate V_c,max have some bands similar to Sentinel-2. Figure 5d also shows that the spectral regions used in PLSR models were similar to Sentinel bands (Yendrek et al. 2017). The limited success of single-leaf-level estimations of photosynthetic capacities using point-based spectral analysis (Serbin et al. 2015) found considerable promise in airborne and potential promise in space-borne imaging spectroscopy such as the NASA HyspIRI mission (Mariotto et al. 2013). In this regard, hyperspectral imagery through inversion of the Soil-Canopy Observation of Photosynthesis and Energy (SCOPE) model to estimate V_c,max also uses sensor resolutions available in airborne or even precision agriculture technologies (Camino et al. 2019). Recently, one plot-level study using sunlit vegetative reflectance pixels from a single visible near infra-red (VNIR; 400–900 nm) hyperspectral camera reported determination coefficients of R² = 0.79 for V_c,max and R² = 0.59 for J_max (Meacham-Hensold et al. 2020). Thus, our simulation analyses and other recent literature suggest that the wide range of variability in VIS, NIR, and SWIR sensors and the Sentinel-2 multispectral sensor (to a more limited extent) could be employed to estimate photosynthetic parameters (including V_c,max and J_max) with advanced regression models. However, more research needs to be done in this area as one of the limitations of this work was that we measured leaf reflectance with a leaf clip, while UAV and satellites measure canopy reflectance that can be different from single leaf reflectance. For the future, we suggest to test if canopy reflectance measurements at different precision levels can predict leaf level photosynthetic measurements or even canopy-level photosynthesis as has been done with models such as PROSAIL (Berger et al. 2018).

Conclusion and future directions

In this study, we estimated V_c,max and J_max using leaf spectral reflectance data and different advanced regression models with determination coefficients higher than R² = 0.50–0.70. The combination of different species and environmental conditions (elevated [CO₂], high temperature, and drought) increased phenotypic variation and improved model estimations where treatment effects were not significant. To achieve higher coefficients of determination and model performance, this research demonstrated that it is more important to have a wider range of phenotypic variation than a significant effect of a treatment or cultivar. We suggest that estimating photosynthetic capacity from reflectance spectra may be considered sufficiently robust to be useful for several different plant physiological applications, such as abiotic stress detection, improved characterization of photosynthesis process-based crop models, and a prescreening tool in breeding programs. We demonstrated that PLSR was the best model for predicting photosynthetic parameters in comparison to other advanced regression models (BR, ARDR and LASSO). However, new advance regression approaches that combine different regression models may be employed to increase phenotype estimation using this technology. Based on simulation of four limited spectral range sensors (VIS–NIR, NIR–SWIR and SWIR) using a leaf level spectrophotometer, we demonstrated that it is possible to estimate V_c,max with similar precision compared to using the whole VIS–NIR–SWIR spectrum. This research should encourage future studies using different imaging sensors (hyperspectral and multispectral) at different scales for estimating V_c,max and J_max.

Author contribution statement

MLB Experimentation, curation of the data, formal analysis, writing original draft. DS Experimentation, review and editing. TS Experimentation, review and editing. JL Experimentation, review and editing. IA Resource managing, review and editing. JLA Resource managing, review and editing. GBR Experimentation, review and editing. SAP Experimentation, review and editing, resource managing. SCK Conceptualization, data curation, resource managing, formal analysis, supervision, writing original draft. ASS Conceptualization, experimentation, data curation, resource managing, formal analysis, supervision, project administration, writing original draft.

Data availability statement

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ARDR:: Automatic relevance determination regression model
BR:: Bayesian ridge model
J _max :: Maximum electron transport rate supporting RuBP regeneration
LASSO:: Least absolute shrinkage and selection operator model
NIR:: Near-infrared spectral reflectance
PLSR:: Partial least squares regression model
RuBP:: Ribulose 1,5-bisphosphate
SWIR:: Shortwave infrared spectral reflectance
V _c,max :: Maximum rate of rubisco-catalyzed carboxylation
VIS:: Visible spectral reflectance

References

Adachi S, Tsuru Y, Nito N et al (2011) Identification and characterization of genomic regions on chromosomes 4 and 8 that control the rate of photosynthesis in rice leaves. J Exp Bot 62:1927–1938. https://doi.org/10.1093/jxb/erq387
Article CAS PubMed PubMed Central Google Scholar
Ainsworth EA, Rogers A, Nelson R, Long SP (2004) Testing the “source-sink” hypothesis of down-regulation of photosynthesis in elevated [CO₂] in the field with single gene substitutions in Glycine max. Agric for Meteorol 122:85–94. https://doi.org/10.1016/j.agrformet.2003.09.002
Article Google Scholar
Ainsworth EA, Serbin SP, Skoneczka JA, Townsend PA (2014) Using leaf optical properties to detect ozone effects on foliar biochemistry. Photosynth Res 119:65–76. https://doi.org/10.1007/s11120-013-9837-y
Article CAS PubMed Google Scholar
Aranjuelo I, Pardo A, Biel C et al (2009) Leaf carbon management in slow-growing plants exposed to elevated CO₂. Glob Chang Biol 15:97–109. https://doi.org/10.1111/j.1365-2486.2008.01829.x
Article Google Scholar
Aranjuelo I, Cabrerizo PM, Arrese-Igor C, Aparicio-Tejo PM (2013) Pea plant responsiveness under elevated [CO₂] is conditioned by the N source (N₂ fixation versus NO₃- fertilization). Environ Exp Bot 95:34–40. https://doi.org/10.1016/j.envexpbot.2013.06.002
Article CAS Google Scholar
Araus JL, Cairns JE (2014) Field high-throughput phenotyping: the new crop breeding frontier. Trends Plant Sci 19:52–61. https://doi.org/10.1016/j.tplants.2013.09.008
Article CAS PubMed Google Scholar
Araus JL, Kefauver SC, Zaman-Allah M et al (2018) Translating high-throughput phenotyping into genetic gain. Trends Plant Sci 23:451–466. https://doi.org/10.1016/j.tplants.2018.02.001
Article CAS PubMed PubMed Central Google Scholar
Asner GP, Martin RE (2008) Spectral and chemical analysis of tropical forests: scaling from leaf to canopy levels. Remote Sens Environ 112:3958–3970. https://doi.org/10.1016/j.rse.2008.07.003
Article Google Scholar
Berger K, Atzberger C, Danner M et al (2018) Evaluation of the PROSAIL model capabilities for future hyperspectral model environments: a review study. Remote Sens 10:85. https://doi.org/10.3390/rs10010085
Article Google Scholar
Camino C, Gonzalez-Dugo V, Hernandez P, Zarco-Tejada PJ (2019) Radiative transfer V_cmax estimation from hyperspectral imagery and SIF retrievals to assess photosynthetic performance in rainfed and irrigated plant phenotyping trials. Remote Sens Environ 231:111186. https://doi.org/10.1016/j.rse.2019.05.005
Article Google Scholar
Choquette NE, Ogut F, Wertin TM et al (2019) Uncovering hidden genetic variation in photosynthesis of field-grown maize under ozone pollution. Global Chang Biol 25:4327–4338. https://doi.org/10.1111/gcb.14794
Article Google Scholar
Christensen JH, Hewitson B, Busuioc A, Chen A, Gao X, Held R, Jones R, et al (2007) Regional climate projections. In: Climate Change 2007: The physical science basis. Contribution of Working group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change, University Press, Cambridge, Chapter 11, ISBN: 978-0-521-88009-1
Coast O, Shah S, Ivakov A et al (2019) Predicting dark respiration rates of wheat leaves from hyperspectral reflectance. Plant Cell Environ 42:2133–2150. https://doi.org/10.1111/pce.13544
Article CAS PubMed Google Scholar
Colombo R, Meroni M, Marchesi A et al (2008) Estimation of leaf and canopy water content in poplar plantations by means of hyperspectral indices and inverse modeling. Remote Sens Environ 112:1820–1834. https://doi.org/10.1016/j.rse.2007.09.005
Article Google Scholar
Couture JJ, Singh A, Rubert-Nason KF et al (2016) Spectroscopic determination of ecologically relevant plant secondary metabolites. Methods Ecol Evol 7:1402–1412. https://doi.org/10.1111/2041-210X.12596
Article Google Scholar
de Oliveira Silva FM, Lichtenstein G, Alseekh S et al (2018) The genetic architecture of photosynthesis and plant growth-related traits in tomato. Plant Cell Environ 41:327–341. https://doi.org/10.1111/pce.13084
Article CAS PubMed Google Scholar
Dechant B, Cuntz M, Vohland M et al (2017) Estimation of photosynthesis traits from leaf reflectance spectra: correlation to nitrogen content as the dominant mechanism. Remote Sens Environ 196:279–292. https://doi.org/10.1016/j.rse.2017.05.019
Article Google Scholar
Dhanapal AP, Ray JD, Singh SK et al (2015) Genome-wide association study (GWAS) of carbon isotope ratio (δ₁₃C) in diverse soybean [Glycine max (L.) Merr.] genotypes. Theor Appl Genet 128:73–91. https://doi.org/10.1007/s00122-014-2413-9
Article CAS PubMed Google Scholar
Doughty CE, Asner GP, Martin RE (2011) Predicting tropical plant physiology from leaf and canopy spectroscopy. Oecologia 165:289–299. https://doi.org/10.1007/s00442-010-1800-4
Article PubMed Google Scholar
Drusch M, Del Bello U, Carlier S et al (2012) Remote sensing of environment Sentinel-2: ESA’s optical high-resolution mission for GMES operational services. Remote Sens Environ 120:25–36. https://doi.org/10.1016/j.rse.2011.11.026
Article Google Scholar
Elvidge CD (1990) Visible and near infrared reflectance characteristics of dry plant materials. Int J Remote Sens 11:1775–1795. https://doi.org/10.1080/01431169008955129
Article Google Scholar
FAOSTAT (2016) The State of Food and Agriculture 2016 (SOFA): Climate change, agriculture and food security. Rome. https://www.fao.org/3/i6030e/I6030E.pdf
Farquhar GD, von Caemmerer S, Berry JA (1980) A biochemical model of photosynthetic CO₂ assimilation in leaves of C₃ species. Planta 149:78–90. https://doi.org/10.1007/BF00386231
Article CAS PubMed Google Scholar
Fehr WR, Caviness CE, Burmood DT, Pennington JS (1971) Stage of development descriptions for soybeans, Glycine max (L.) Merrill. Crop Sci 11:929–931. https://doi.org/10.2135/cropsci1971.0011183X001100060051x
Article Google Scholar
Ferreyra RA, Pachepsky LB, Collino D, Acock B (2000) Modeling peanut leaf gas exchange for the calibration of crop models for different cultivars. Ecol Model 131:285–298. https://doi.org/10.1016/S0304-3800(00)00252-0
Article Google Scholar
Fu P, Meacham-Hensold K, Guan K et al (2020) Estimating photosynthetic traits from reflectance spectra: a synthesis of spectral indices, numerical inversion, and partial least square regression. Plant Cell Environ. https://doi.org/10.1111/pce.13718
Article PubMed PubMed Central Google Scholar
Gamon JA, Serrano L, Surfus JS (1997) The photochemical reflectance index: an optical indicator of photosynthetic radiation use efficiency across species, functional types, and nutrient levels. Oecologia 112:492–501. https://doi.org/10.1007/s004420050337
Article CAS PubMed Google Scholar
Geladi P, Kowalski B (1986) Partial least-squares regression: a tutorial. Anal Chim Acta 185:1–17
Article CAS Google Scholar
Gitelson AA, Viña A, Ciganda V et al (2005) Remote estimation of canopy chlorophyll content in crops. Geophys Res Lett 32:1–4. https://doi.org/10.1029/2005GL022688
Article CAS Google Scholar
Gray SB, Brady SM (2016) Plant developmental responses to climate change. Dev Biol 419:64–77. https://doi.org/10.1016/j.ydbio.2016.07.023
Article CAS PubMed Google Scholar
Hansen PM, Schjoerring JK (2003) Reflectance measurement of canopy biomass and nitrogen status in wheat crops using normalized difference vegetation indices and partial least squares regression. Remote Sens Environ 86:542–553. https://doi.org/10.1016/S0034-4257(03)00131-7
Article Google Scholar
Heckmann D, Schlüter U, Weber APM (2017) Machine learning techniques for predicting crop photosynthetic capacity from leaf reflectance spectra. Mol Plant 10:878–890. https://doi.org/10.1016/j.molp.2017.04.009
Article CAS PubMed Google Scholar
Hunt J (2019) Introduction to Matplotlib. In: Advanced guide to Python 3 programming. Springer, Cham, pp 35–42. https://doi.org/10.1007/978-3-030-25943-3
Book Google Scholar
Jones E, Oliphant T, Peterson P (2001) SciPy: Open source scientific tools for Python, online
Khan HA, Nakamura Y, Furbank RT, Evans JR (2021) Effect of leaf temperature on the estimation of photosynthetic and other traits of wheat leaves from hyperspectral reflectance. J Exp Bot 72:1271–1281. https://doi.org/10.1093/jxb/eraa514
Article CAS PubMed Google Scholar
Koester RP, Skoneczka JA, Cary TR et al (2014) Historical gains in soybean (Glycine max Merr.) seed yield are driven by linear increases in light interception, energy conversion, and partitioning efficiencies. J Exp Bot 65:3311–3321. https://doi.org/10.1093/jxb/eru187
Article CAS PubMed PubMed Central Google Scholar
Kuhn M, Johnson K (2013) Applied predictive modeling. Springer, New York. https://doi.org/10.1007/978-1-4614-6849-3
Book Google Scholar
Kumari M, Patel NR, Raj R et al (2012) Parametric estimation of net photosynthesis in rice from in-situ spectral reflectance measurements. Curr Sci 103:55–61
CAS Google Scholar
Long SP, Bernacchi CJ (2003) Gas exchange measurements, what can they tell us about the underlying limitations to photosynthesis? Procedures and sources of error. J Exp Bot 54:2393–2401. https://doi.org/10.1093/jxb/erg262
Article CAS PubMed Google Scholar
Long SP, Ainsworth EA, Rogers A, Ort DR (2004) Rising atmospheric carbon dioxide: plants FACE the future. Annu Rev Plant Biol 55:591–628. https://doi.org/10.1146/annurev.arplant.55.031903.141610
Article CAS PubMed Google Scholar
Mariotto I, Thenkabail PS, Huete A et al (2013) Hyperspectral versus multispectral crop-productivity modeling and type discrimination for the HyspIRI mission. Remote Sens Environ 139:291–305. https://doi.org/10.1016/j.rse.2013.08.002
Article Google Scholar
Meacham-Hensold K, Fu P, Wu J et al (2020) Plot-level rapid screening for photosynthetic parameters using proximal hyperspectral imaging. J Exp Bot 71:2312–2328. https://doi.org/10.1093/jxb/eraa068
Article CAS PubMed PubMed Central Google Scholar
Meena RS, Das A, Yadav GS, Lal R (2018) Legumes for soil health and sustainable management. Springer, Singapore
Book Google Scholar
Mitchell RJ, Runion GB, Prior SA et al (1995) Effects of nitrogen on Pinus palustris foliar respiratory responses to elevated atmospheric CO₂ concentration. J Exp Bot 46:1561–1567. https://doi.org/10.1093/jxb/46.10.1561
Article CAS Google Scholar
Neal RM (1996) Bayesian learning for neural networks. Springer Verlag, New York
Book Google Scholar
Oakley CG, Savage L, Lotz S et al (2018) Genetic basis of photosynthetic responses to cold in two locally adapted populations of Arabidopsis thaliana. J Exp Bot 69:699–709. https://doi.org/10.1093/jxb/erx437
Article CAS PubMed Google Scholar
Ort DR, Merchant SS, Alric J et al (2015) Redesigning photosynthesis to sustainably meet global food and bioenergy demand. Proc Natl Acad Sci USA 112:8529–8536. https://doi.org/10.1073/pnas.1424031112
Article CAS PubMed PubMed Central Google Scholar
Oury FX, Godin C, Mailliard A et al (2012) A study of genetic progress due to selection reveals a negative effect of climate change on bread wheat yield in France. Eur J Agron 40:28–38. https://doi.org/10.1016/j.eja.2012.02.007
Article Google Scholar
Peñuelas J, Filella L (1998) Technical focus: visible and near-infrared reflectance techniques for diagnosing plant physiological status. Trends Plant Sci 3:151–156. https://doi.org/10.1016/S1360-1385(98)01213-8
Article Google Scholar
Petisco C, García-Criado B, Mediavilla S et al (2006) Near-infrared reflectance spectroscopy as a fast and non-destructive tool to predict foliar organic constituents of several woody species. Anal Bioanal Chem 386:1823–1833. https://doi.org/10.1007/s00216-006-0816-4
Article CAS PubMed Google Scholar
Reynolds M, Foulkes J, Furbank R et al (2012) Achieving yield gains in wheat. Plant Cell Environ 35:1799–1823. https://doi.org/10.1111/j.1365-3040.2012.02588.x
Article PubMed Google Scholar
Rogers A (2014) The use and misuse of V_c_,max in earth system models. Photosynth Res 119:15–29. https://doi.org/10.1007/s11120-013-9818-1
Article CAS PubMed Google Scholar
Rogers HH, Bingham GE, Cure JD et al (1983) Responses of selected plant species to elevated carbon dioxide in the field. J Environ Qual 12:569–574. https://doi.org/10.2134/jeq1983.00472425001200040028x
Article CAS Google Scholar
RStudio Team (2020) RStudio: integrated development environment for R. RStudio. PBC, Boston
Google Scholar
Sanz-Sáez Á, Erice G, Aguirreolea J et al (2012) Alfalfa yield under elevated CO₂ and temperature depends on the Sinorhizobium strain and growth season. Environ Exp Bot 77:267–273. https://doi.org/10.1016/j.envexpbot.2011.11.017
Article Google Scholar
Sanz-Sáez Á, Koester RP, Rosenthal DM et al (2017) Leaf and canopy scale drivers of genotypic variation in soybean response to elevated carbon dioxide concentration. Glob Chang Biol 23:3908–3920. https://doi.org/10.1111/gcb.13678
Article PubMed Google Scholar
Schlemmera M, Gitelson A, Schepersa J et al (2013) Remote estimation of nitrogen and chlorophyll contents in maize at leaf and canopy levels. Int J Appl Earth Obs Geoinf 25:47–54. https://doi.org/10.1016/j.jag.2013.04.003
Article Google Scholar
Segarra J, Buchaillot ML, Araus JL, Kefauver SC (2020) Remote sensing for precision agriculture: Sentinel-2 improved features and applications. Agronomy 10:1–18. https://doi.org/10.3390/agronomy10050641
Article CAS Google Scholar
Serbin SP (2012) Spectroscopic determination of leaf nutritional, morphological, and metabolic traits. https://doi.org/10.6084/M9.FIGSHARE.745311.V1
Serbin SP, Dillaway DN, Kruger EL, Townsend PA (2012) Leaf optical properties reflect variation in photosynthetic metabolism and its sensitivity to temperature. J Exp Bot 63:489–502. https://doi.org/10.1093/jxb/err294
Article CAS PubMed Google Scholar
Serbin SP, Singh A, McNeil BE et al (2014) Spectroscopic determination of leaf morphological and biochemical traits for northern temperate and boreal tree species. Ecol Appl 24:1651–1669. https://doi.org/10.1890/13-2110.1
Article Google Scholar
Serbin SP, Singh A, Desai AR et al (2015) Remotely estimating photosynthetic capacity, and its response to temperature, in vegetation canopies using imaging spectroscopy. Remote Sens Environ 167:78–87. https://doi.org/10.1016/j.rse.2015.05.024
Article Google Scholar
Sharkey TD, Bernacchi CJ, Farquhar GD, Singsaas EL (2007) Fitting photosynthetic carbon dioxide response curves for C₃ leaves. Plant Cell Environ 30:1035–1040. https://doi.org/10.1111/j.1365-3040.2007.01710.x
Article CAS PubMed Google Scholar
Silva-Pérez V, Furbank RT, Condon AG, Evans JR (2017) Biochemical model of C₃ photosynthesis applied to wheat at different temperatures. Plant Cell Environ 40:1552–1564. https://doi.org/10.1111/pce.12953
Article CAS PubMed Google Scholar
Silva-Perez V, Molero G, Serbin SP et al (2018) Hyperspectral reflectance as a tool to measure biochemical and physiological traits in wheat. J Exp Bot 69:483–496. https://doi.org/10.1093/jxb/erx421
Article CAS PubMed Google Scholar
Simkin AJ, López-Calcagno PE, Raines CA (2019) Feeding the world: Improving photosynthetic efficiency for sustainable crop production. J Exp Bot 70:1119–1140. https://doi.org/10.1093/jxb/ery445
Article CAS PubMed PubMed Central Google Scholar
Soba D, Shu T, Runion GB et al (2020) Effects of elevated [CO₂] on photosynthesis and seed yield parameters in two soybean genotypes with contrasting water use efficiency. Environ Exp Bot 178:104154. https://doi.org/10.1016/j.envexpbot.2020.104154
Article CAS Google Scholar
Tibshirani R (1996) Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B 58:267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
Article Google Scholar
Tipping ME (2001) Sparse Bayesian learning and the relevance vector machine. J Mach Learn Res 1:211–244
Google Scholar
Vandekerckhove J, Matzke D, Wagenmakers E-J (2015) Model comparison and the principle of parsimony. In: Busemeyer JR et al (eds) Oxford handbook of computational and mathematical psychology. Oxford University Press, Oxford
Google Scholar
Varoquaux G, Buitinck L, Louppe G et al (2015) Scikit-learn. GetMobile Mob Comput Commun 19:29–33. https://doi.org/10.1145/2786984.2786995
Article Google Scholar
Vergara-Diaz O, Vatter T, Vicente R et al (2020) Metabolome profiling supports the key role of the spike in wheat yield performance. Cells 9:1025. https://doi.org/10.3390/cells9041025
Article CAS PubMed Central Google Scholar
Vergara-diaz O, Araus L (2020) Assessing durum wheat ear and leaf metabolomes in the field through hyperspectral data. Plant J 102:615–630. https://doi.org/10.1111/tpj.14636
Article CAS PubMed Google Scholar
Vitrack-Tamam S, Holtzman L, Dagan R et al (2020) Random forest algorithm improves detection of physiological activity embedded within reflectance spectra using stomatal conductance as a test case. Remote Sens 12:2213. https://doi.org/10.3390/rs12142213
Article Google Scholar
Von Caemmerer S (2013) Steady-state models of photosynthesis. Plant Cell Environ 36:1617–1630. https://doi.org/10.1111/pce.12098
Article CAS Google Scholar
Waskom M, Botvinnik O, O’Kane D (2017) mwaskom/seaborn: seaborn v0. 8.1, Zenodo
Wofford MF, Boscoe BM, Borgman CL et al (2019) Jupyter notebooks as discovery mechanisms for open science: citation practices in the astronomy community. Comput Sci Eng 22:5–15
Article Google Scholar
Wold S, Sjöström M, Eriksson L (2001) PLS-regression: a basic tool of chemometrics. Chemom Intell Lab Syst 58:109–130. https://doi.org/10.1016/S0169-7439(01)00155-1
Article CAS Google Scholar
Yan D, Wang Y, Murakami T et al (2015) Auxenochlorella protothecoides and Prototheca wickerhamii plastid genome sequences give insight into the origins of non-photosynthetic algae. Sci Rep 5:1–9. https://doi.org/10.1038/srep14465
Article CAS Google Scholar
Yendrek CR, Tomaz T, Montes CM et al (2017) High-throughput phenotyping of maize leaf physiological and biochemical traits using hyperspectral reflectance. Plant Physiol 173:614–626. https://doi.org/10.1104/pp.16.01447
Article CAS PubMed Google Scholar
Zhu X-G, Long SP, Ort DR (2010) Improving photosynthetic efficiency for greater yield. Annu Rev Plant Biol 61:235–261. https://doi.org/10.1146/annurev-arplant-042809-112206
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank the technical help during the experiment of Mr. Robert Icenogle, Barry Dorman (USDA-ARS), Seth Johnston, and Mary Durstock (Crop Physiology Laboratory, Auburn University). The authors also would like to thank to Dr. Jose A. Jimenez Berni for statistical support to analyze the data. This research was supported by the Action CA17134 SENSECO (Optical Synergies for Spatiotemporal Sensing of Scalable Ecophysiological Traits) funded by COST (European Cooperation in Science and Technology, www.cost.eu). This research was also supported by Auburn University and Alabama Agricultural Experimental Station Seed Grant.

Funding

Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature.

Author information

Authors and Affiliations

Integrative Crop Ecophysiology Group, Plant Physiology Section, Faculty of Biology, University of Barcelona, 08028, Barcelona, Spain
Ma. Luisa Buchaillot, José Luis Araus & Shawn C. Kefauver
AGROTECNIO (Center for Research in Agrotechnology), Av. Rovira Roure 191, 25198, Lleida, Spain
Ma. Luisa Buchaillot, José Luis Araus & Shawn C. Kefauver
Instituto de Agrobiotecnología (IdAB), Consejo Superior de Investigaciones Científicas (CSIC)-Gobierno de Navarra, Av. Pamplona 123, 31192, Mutilva, Spain
David Soba & Iker Aranjuelo
Department of Crop, Soil, and Environmental Sciences, Auburn University, Alabama, USA
Tianchu Shu & Alvaro Sanz-Saez
Industrial Crops Research Institute, Henan Academy of Agricultural Sciences, Henan, China
Juan Liu
U.S. Department of Agriculture–Agricultural Research Service, National Soil Dynamics Laboratory, Auburn, AL, 36832, USA
G. Brett Runion & Stephen A. Prior

Authors

Ma. Luisa Buchaillot
View author publications
You can also search for this author in PubMed Google Scholar
David Soba
View author publications
You can also search for this author in PubMed Google Scholar
Tianchu Shu
View author publications
You can also search for this author in PubMed Google Scholar
Juan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Iker Aranjuelo
View author publications
You can also search for this author in PubMed Google Scholar
José Luis Araus
View author publications
You can also search for this author in PubMed Google Scholar
G. Brett Runion
View author publications
You can also search for this author in PubMed Google Scholar
Stephen A. Prior
View author publications
You can also search for this author in PubMed Google Scholar
Shawn C. Kefauver
View author publications
You can also search for this author in PubMed Google Scholar
Alvaro Sanz-Saez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Shawn C. Kefauver or Alvaro Sanz-Saez.

Additional information

Communicated by Dorothea Bartels.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 1179 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Buchaillot, M.L., Soba, D., Shu, T. et al. Estimating peanut and soybean photosynthetic traits using leaf spectral reflectance and advance regression models. Planta 255, 93 (2022). https://doi.org/10.1007/s00425-022-03867-6

Download citation

Received: 16 November 2021
Accepted: 03 March 2022
Published: 24 March 2022
DOI: https://doi.org/10.1007/s00425-022-03867-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Estimating peanut and soybean photosynthetic traits using leaf spectral reflectance and advance regression models

Abstract

Main conclusion

Abstract

Similar content being viewed by others

Estimating growth and photosynthetic properties of wheat grown in simulated saline field conditions using hyperspectral reflectance sensing and multivariate analysis

Application of Vegetative Indices for Leaf Nitrogen Estimation in Sugarcane Using Hyperspectral Data

Predicting grain yield using canopy hyperspectral reflectance in wheat breeding data

Introduction

Materials and methods

Trial setup and design

Experiment 1: soybean cultivar response to elevated [CO2]

Experiment 2: soybean cultivar response to high night temperatures

Experiment 3: peanut cultivar response to drought

Physiological parameter assessments

Mid-day photosynthesis measurements

A–Ci curves

Relative chlorophyll content

Leaf spectral reflectance measurements

Statistical analysis of measured and estimate values

Results

Effect of abiotic stress and cultivar on photosynthetic parameters

Relationships between spectral signatures and photosynthetic parameters

Estimating photosynthetic parameters using field spectroscopy and advance regression models

Scaling up estimations of photosynthetic parameters for potential hyperspectral aerial or satellite applications

Discussion

Estimating photosynthetic parameters using field spectroscopy and advance regression models

Scaling up estimations of photosynthetic parameters for potential hyperspectral aerial or satellite applications

Conclusion and future directions

Author contribution statement

Data availability statement

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 1179 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Experiment 1: soybean cultivar response to elevated [CO₂]