Full Parameterisation Matters for the Best Performance of Crop Models: Inter-comparison of a Simple and a Detailed Maize Model

Process-based crop growth models have become indispensable tools for investigating the effects of genetic, management, and environmental factors on crop productivity. One source of uncertainty in crop model predictions is model parameterization, i.e. estimating the values of model input parameters, which is carried out very differently by crop modellers. One simple (SSM-iCrop) and one detailed (APSIM) maize (Zea mays L.) model were partially or fully parameterized using observed data from a 2-year field experiment conducted in 2016 and 2017 at the UFT (Universitäts- und Forschungszentrum Tulln, BOKU) in Austria. Model initialisation was identical for both models based on field measurements. Partial parameterization (ParLevel_1) was first performed by estimating only those parameters related to crop phenology. Full parameterization (ParLevel_2) was then conducted by estimating parameters related to phenology plus those affecting dry mass production and partitioning, nitrogen uptake, and grain yield formation. With ParLevel_1, both models failed to provide accurate estimation of LAI, dry mass accumulation, nitrogen uptake and grain yield, but the performance of APSIM was generally better than SSM-iCrop. Full parameterization greatly improved the performance of both crop models, but it was more effective for the simple model, so that SSM-iCrop was equally well or even better compared to APSIM. It was concluded that full parameterization is indispensable for improving the accuracy of crop model predictions regardless whether they are simple or detailed. Simple models seem to be more vulnerable to incomplete parameterization, but they better respond to full parameterization. This needs confirmation by further research.


Introduction
Dynamic process-based crop models simulate crop development and growth processes in response to climatic variables, soil conditions, management factors, and cultivar-specific genetic characteristics. The major processes simulated in crop models include crop phenological development and dry mass growth, leaf area development, light interception and utilisation, crop demand for water and nutrients, yield formation, dynamics of soil water, carbon, and nitrogen (N), evapotranspiration, and the effect of atmospheric carbon dioxide concentrations (CO 2 ) on photosynthesis and wateruse efficiency. Due to these capabilities, crop models have widely been used in applications as diverse as optimising the application of fertilizers for improved crop productivity and environmental benefits (Moeller et al. 2007(Moeller et al. , 2013Manschadi et al. 2014;Tang et al. 2018), estimating crop water requirements and optimizing irrigation scheduling (e.g. Dalla Marta et al. 2018;Gobin et al. 2017), modelling the value of adaptive plant traits to support crop improvement (Manschadi et al. 2006;Chenu et al. 2018;Hammer et al. 2019), operational crop yield forecasting (Van der Velde and Nisini 2018), and assessing impacts of climate change and variability on crop growth and yield formation at regional, national, and global scales (White et al. 2011;Rötter et al. 2012;Thaler et al. 2012;Eitzinger et al. 2013a;Devkota et al. 2013;Asseng et al. 2013;Rosenzweig et al. 2014;Ebrahimi et al. 2016).
Crop models commonly require a large number of input data including daily weather parameters (such as temperature, precipitation and solar radiation), detailed soil properties and initial soil conditions, cultivar-specific genetic parameters, and information on crop management practices. The sources of uncertainty in crop model projections include (1) model input data (e.g. soil physical properties and initial water, carbon, and N conditions), (2) model parameterization (i.e. inadequate estimation of cultivar-specific phenological development and parameterization of crop growthrelated processes), (3) model structure (inadequate scientific representation of crop response to temperature, water and nutrient stress, [CO 2 ] levels and any other crop growth limiting factors or combination of those), (4) human errors in set-up and configuration of simulation runs, crop and soil parameterization, and interpretation and communication of simulation results (Palosuo et al. 2011;Rötter et al. 2011;Eitzinger et al. 2013b;Ewert et al. 2015).
A common approach in assessing uncertainty in crop models is the use of model inter-comparisons. That involves repeating the same simulation with a selection of crop models, which differ in their representation of processes determining crop responses to growing conditions, and evaluating their outputs for a range of scenarios (Palosuo et al. 2011;Rötter et al. 2011;Asseng et al. 2013;Bassu et al. 2014). Such crop model inter-comparisons are similar to ensembles of climate models, which have been used to address the uncertainty associated with projecting climate scenarios due to, for instance, uncertainties in predicting the trajectories of future greenhouse gas emissions (Knutti and Sedlácek 2013).
It is also widely recognised that an adequate level of parameterization, i.e. obtaining parameters governing crop phenology and growth processes, is indispensable to reduce the uncertainty and improve the accuracy of crop model simulations (Rötter et al. 2011;Wallach et al. 2011;Bassu et al. 2014;Salo et al. 2016). It is common to use calibration as a synonym for parameterization. Crop model parameters can be obtained via references, in-situ field measurements, and/ or a calibration procedure. Thus, calibration is one possible way of model parameterization in which the model is tested indirectly using different values for one or several parameters within the simulation process of higher end variables, which are more often available from measurements or observations, such as crop yield or leaf area. The parameterization of a process and its simulated result that match closest the observed variables is selected as the "calibrated" parameter estimate. Sinclair and Seligman (1996) criticized incorrect use of 'calibration' as being synonymous with parameterization. Calibration is better to be limited to those cases in which one or several parameters are estimated or adjusted by "calibrating" the whole model to achieve outputs that match relevant observations. In many cases, calibration is performed because the parameters under concern are not easily measurable or unavailable (Grassini et al. 2015).
The performance of crop models in simulating crop responses to climatic, soil, genetic, and management factors depends largely on the level of detail in the input information available for model parameterization. For instance, Bassu et al. (2014) demonstrated that a low-level model parameterization based on a set of minimal information (e.g. soil characteristics, management inputs, and crop phenology) can lead to high uncertainty in simulated responses of crop yield to climate change, whereas parameterising crop models to simulate more accurately the time series of above-ground dry mass, leaf area index, soil water and N contents, and plant N uptake, can significantly reduce variation among crop models and improve the accuracy of simulation results. In a recent inter-comparison of multiple global gridded crop models, Rosenzweig et al. (2014) showed that considering the effects of N stress on crop growth processes results in much severe projected impacts from climate change, with significant implications for designing adaptation options. This is because crop N-acquisition and biological N fixation will be negatively affected by climate change-induced drought stress combined with higher temperatures (StClair and Lynch 2010; Devkota et al. 2013).
The objective of this study was to compare the effect of two parameterization levels on the performance of APSIM (Agricultural Production Systems sIMulator, Holzworth et al. 2014) and SSM-iCrop Sinclair 2012, 2015) models for simulating leaf area development, plant N uptake and partitioning, dry mass growth, and yield formation using detailed experimental data from two maize experiments conducted in a temperate central European climate.

Field Experiments
Plant and soil data were collected from two field experiments conducted in 2016 and 2017 at the UFT (Universitäts-und Forschungszentrum Tulln, BOKU), Lower Austria (48°19′ N, 16°04′ E, 178 m a.s.l.). Maize (Zea mays L., cv. P8400) was sown on 9 and 11 May in 2016 and 2017, respectively. The soil at the experimental site is classified as a chernozem with silty loam in the topsoil. The detailed water holding characteristics and organic matter content of the soil profile are presented in Tables 2 and 3. The mean annual precipitation of the site is 625 mm and mean annual temperature is 9.7 °C (reference period 1981-2010) with a distinct seasonal change of both temperature and precipitation at its maximum in the summer (continental climate type).
Experimental design was a randomized split-plot with four replications. The setup included five different N fertilization levels: 0 (N0), 40 (N1), 80 (N2), 120 (N3), 160 (N4) kg/ha, applied in two rates at two-leaf and two-node phenological stages of maize. In each plot (10 × 6 m), 8 rows of maize were planted with a row spacing of 0.75 m and a plant density of 7.5 plants m −2 . Due to very high initial soil mineral nitrogen (Nmin) in 2016, the experiment in 2017 was set up on the exact same area and plot arrangement as in 2016 to make use of lower initial soil Nmin contents in the N0 plots. In 2017, the experiment was irrigated three times (21 June, 5 July, and 20 July) with a total of 65 mm water to avoid severe drought stress. Pests, diseases, and weeds were controlled with common on-station measures.
In both seasons, three randomly chosen plants per plot were marked for weekly scoring of phenology using the growth scale BBCH (Biologische Bundesanstalt, Bundessortenamt and CHemical industry; Meier 2001) and main-stem leaf number. Sequential destructive plant samples (8 plants per plot) were taken for measuring crop dry mass, leaf area development, N uptake, and grain yield. Plant samples were separated into senesced and photosynthetic active (green) leaf blades, stems (including leaf sheaths), ears, and grains. A subsample of green leaves was taken to measure leaf area (LI-3100C Area Meter, LI-COR, USA). All samples were dried at 60 °C for 72 h to determine tissue dry mass.
At each plant sampling date, gravimetric soil water content was also determined in 0-10, 10-30, 30-60, 60-90, and 90-120 cm soil layers in all plots of two replications by drying samples at 105 °C for 72 h. The soil bulk density in the corresponding soil layers was determined by digging a pit and sampling horizontally into the pit wall using an open-ended, thin-walled sampling ring of 70 mm diameter and 50 mm length. The gravimetric soil water measurements were converted to volumetric water content using the bulk density values. The drained upper limit of the soil was measured using the ponded method (Dalgliesh and Foale 1998). The crop lower limit of soil water extraction was estimated by preventing the infiltration of rainfall into the soil during the grain-filling phase of maize. For this an area of 6 m 2 in the border rows of the maize experiment in 2017 was covered with an opaque vinyl cover and gravimetric soil water was determined at crop harvest. In addition, soil samples taken at maize sowing, silking, and physiological maturity were sent to a commercial soil laboratory for measuring Nmin (NO3 − and NH4 + ) and organic carbon.

Statistical Analysis
Analysis of variance (ANOVA) was carried out using the GLM (General Linear Model) procedure of SAS (Statistical Analysis System) package (SAS-Institute 2008). Significant differences in the mean values were determined by Tukey's HSD (honest significant difference) test at a significance level of 0.05. Graphs were produced using SigmaPlot (Version 14.0, Systat Software Inc., San Jose, USA).

Description of Crop Models
APSIM is a detailed crop model compared to the simple model SSM-iCrop. The Agricultural Production Systems sIMulator (APSIM) is internationally recognised as one of the most advanced and comprehensive crop/cropping system models for simulating the effects of genetic factors, environmental variables, and management decisions on production (crops, pasture, trees, livestock), profits, and environmental processes (e.g. soil erosion, nitrate leaching) (https ://www. apsim .info, Holzworth et al. 2014). APSIM incorporates a generic crop model, which utilizes a library of routines for simulating growth and development processes for more than 20 crop species including maize (Soufizadeh et al. 2018). APSIM has been used in a broad range of applications including supporting on-farm decision making, designing farming systems for production or resource-management objectives, guiding crop breeding strategies, assessing risk for government policy-making, and evaluating management options for adaptation to climate change and variability (Manschadi et al. 2006;Moeller et al. 2007Moeller et al. , 2013Hochman et al. 2009;Huth et al. 2010;Lobell et al. 2015;Hammer et al. 2010Hammer et al. , 2019Wu et al. 2019).
Simple simulation models (SSM) are a group of crop models based on Sinclair's approach (Sinclair 1986;Soltani and Sinclair 2012;Sinclair et al. 2020) in crop modelling. The development and application of the models dates back to 1986 when a soybean model was developed (Sinclair 1986). The modelling framework was then improved and applied over the past 35 years to nearly all major grain crops including maize (Sinclair and Muchow 1995). A complete description of SSM principles and procedures can be found in Soltani and Sinclair (2012). The model includes the key physiological processes to simulate crop responses to radiation-, water-, and nitrogen-limited conditions. SSM-iCrop (Soltani et al. 2013; thereafter referred to as iCrop) is a research version of SSM that was first developed for wheat. iCrop uses identical sub-models to simulate crop processes in all grain crops except for phenology that can have its own specific sub-model depending on crop species under consideration. It uses a layered soil, simulates N fixation in grain legumes, and includes several physiological concepts to evaluate genetic improvement, such as limited transpiration rate (Messina et al. 2015) and drought survival (Ghanem et al. 2015). Sinclair et al. (2020) newly reviewed the application of SSM models in different crops.
A brief description of major crop processes in maize simulated by both models is provided here. Crop phenological development in both models is predicted as a function of temperature, photoperiod and water deficit. iCrop phenology sub-model is based on Kiniry (1991). APSIM simulates delay in phenology due to N stress but iCrop assumes no effect on phenology of N stress.
APSIM uses a detailed method to simulate leaf area development from prediction of leaf number and the size of individual leaves and adjusting it for water and N stresses. iCrop, however, simulates leaf area development using a simple approach based on the relationship between plant leaf area and main stem leaf (node) number and then adjusts it for water and N stresses. Phyllochron is fixed in iCrop but depends on the development stage in APSIM. New leaf expansion in APSIM is governed by bounding values of specific leaf area, so that leaves cannot be grown to be extremely 'thin' or 'thick'. This is not simulated in iCrop.
Dry mass production in iCrop is based on radiation use efficiency (RUE) concept in which RUE is adjusted for current day conditions of temperature and water deficit. The RUE concept is also used in APSIM, but it is limited to compute dry mass production under non-limited water conditions. Under water-limited conditions, dry mass production is obtained from available water for crop transpiration and transpiration efficiency; hence water uptake by the roots from the soil is needed to be simulated. RUE in APSIM is also N dependent but it is not in iCrop.
Simulation of dry mass partitioning is much simpler in iCrop and is a function of allometric relation between leaves and stem dry mass before silking. Between silking and beginning seed growth a fixed, negligible fraction of produced dry mass goes to leaves and the remaining is allocated to stem. Following the start of grain filling, all produced dry mass is allocated to the developing seeds and any possible leftover is partitioned to the stem. Ears without grains are not simulated by iCrop. APSIM uses dry mass partitioning coefficients to leaves, stem, and cobs that are depended on the development stage of the crop. Between emergence and flag leaf appearance, the fraction of dry mass that is provisionally allocated to the leaves decreases as the number of fully expanded leaves increases. Between tassel initiation and flag leaf appearance, the dry mass remaining after allocation to leaves is partitioned between stem and developing ears in a fixed ratio. After flag leaf appearance, dry mass is partitioned between stem and ears only, until partitioning to the grain starts at the onset of grain filling. While iCrop does not simulate root dry mass, APSIM accounts for root dry mass as a fraction of top dry mass and the fraction is specified for each growth stage.
Yield formation in iCrop is simulated based on linear increase in harvest index, but actual daily grain growth rate is limited to current crop dry mass production plus daily rate of translocated dry mass from vegetative organs. The translocation occurs only if daily dry mass production is not enough to support seed growth rate calculated from the slope of the linear increase in harvest index. Yield formation in APSIM is more detailed and relies on the simulation of grain number and grain size. Grain number is related to crop growth between tassel initiation and beginning seed growth using a genetic coefficient. Grain size is depended on grain growth rate, the effective grain-filling period, and dry mass re-translocation. If grain mass demand for a day exceeds the daily increase in dry mass, the shortfall will first be met through translocation from the stem and, if that is insufficient to meet the demand of the grain, through translocation from leaves.
The basics of plant N balance simulation in both models is more or less the same: before seed growth, daily demand for N is computed from new growth in LAI and stem and their critical N concentration. The demand is then adjusted for maximum rate of N uptake and the amount of soil N available for crop uptake. When N uptake rate does not fully meet the demand, the following responses occur sequentially. First, the concentration of N (N%) in stem is decreased until stem N% reaches its minimum. Second, when stem N% reaches its minimum, leaf area development is inhibited and stem growth at minimum N% is continued, as structural stem mass is required to support leaf growth. Third, leaves are senesced to provide N for stem growth with minimum N%. Thus, leaf senescence may occur during vegetative growth. After seed growth, daily N demand is obtained from seed growth rate multiplied by seed critical N%, but it is limited to daily extractable N from vegetative tissue plus soil N uptake. If N uptake from the soil is not sufficient to support seed growth, N is translocated from the leaves and stems to the seeds. N translocation from leaves results in leaf senescence. iCrop treats this concept simpler and needs less parameters compared to APSIM. iCrop relies on specific leaf N (SLN) in green and senesced leaves and N% in green and senesced stems, but APSIM works with minimum, critical and maximum SLN and stem N%. Minimum, critical and maximum stem N% in APSIM are varied depending on development stage. While APSIM permits N dilution in leaves, this is not simulated in iCrop and any shortage in leaf N results in leaf senescence. In APSIM, N translocation to the grains is initially taken from the stem (plus cobs), and if this becomes insufficient then N translocation from the leaf occurs. In iCrop, N translocation from leaf and stem is proportional to their relative translocatable N.
Both models simulate soil water dynamic in multiple layers using a one-dimensional approach (cascade method). Water addition from rainfall or irrigation and water removal due to run-off, evaporation, transpiration (uptake) and drainage are accounted for. But the procedures are more detailed in APSIM, as it predicts water uptake by the roots. Both models simulate daily increase in effective rooting depth from a potential daily rate of increase that is adjusted for plant and soil conditions. iCrop computes water uptake as a function of calculated daily transpiration, but APSIM calculate water uptake based on soil water content via an exponential function, parameterized via an extraction decay constant (kl) that incorporates the effects of both soil hydraulic conductivity and root length density on water uptake. The value of kl must be determined for each soil layer. APSIM simulates upward movement of water, which is not simulated by iCrop. In iCrop, three water deficit factors are calculated from the fraction of available (transpirable) soil water in the crop root zone that are utilized to adjust dry mass production, leaf area expansion, and phenological development for water shortage. APSIM uses the ratio of actual growth from limited water to potential growth without water limitation to correct crop responses to water deficit.
Similarly, soil N balance in each layer is calculated by both models in which N addition due to mineralization and N fertilizer application and N removal due to volatilization, leaching and crop uptake are taken into account. Again, the methods used by APSIM are more detailed. For example, iCrop does not simulate soil temperature and air temperature is used instead. Soil N mineralization of organic N to ammonium (NH4 + ) and the subsequent transformation to nitrate (NO3 − ) is modelled as one transformation in iCrop, while these are treated as separate responses in APSIM. Soil organic N is considered as a single pool in iCrop but APSIM deals with several pools including fresh and permanent pools.

Initialization and Parameterization of Crop Models
To eliminate as much as possible, the differences between the two models in simulated soil water and N supply, as they interact strongly with crop canopy development and N uptake, the management and soil input information were set to those exactly done or measured in the field experiments in both models (see details in the "Results" section).
In terms of parameterization, first only the parameters related to crop phenology and leaf appearance on the mainstem were parameterized to ensure very similar simulations of crop phenological development and main-stem leaf number production from both models. This is referred to as partial parameterization (ParLevel_1). In the next step, the observed experimental data were used to derive the genetic parameters for dry mass production and partitioning, N uptake, and grain yield formation. This is referred to as full parameterization (ParLevel_2). It should be noted that in APSIM, crop parameters are divided into two parts; the larger fraction consists of parameters that are normally constant for all cultivars and the smaller fraction (i.e. cultivar-specific) includes those that can be modified for new cultivars. The cultivar-specific parameters of APSIM are mainly phenological parameters and parameters related to yield formation (potential grain size). In iCrop, however, there is not such a distinction and all genetic model parameters can be modified.

Field Experiments
The cumulative rainfall during the maize growing season (May-September) in 2016 (379.3 mm) and 2017 (360.9 mm) was slightly lower than the long-term average in the same period (393 mm). Compared to 2016, the temporal pattern of precipitation in the drier season 2017 was less favourable for crop growth. The total precipitation received in May and June of 2017 was 38% lower than that in 2016 (193.5 mm) (Fig. 1). Therefore, 65 mm irrigation water was applied in 2017 to avoid severe drought stress. The average monthly temperature was similar in both growing seasons, except for June and August 2017, which were warmer than those in 2016 (Fig. 1).
Due to more favourable growing conditions, the average maize total above-ground dry mass across all N Fig. 1 Cumulative monthly rainfall (bars) and average monthly temperature (points) in maize growing seasons 2016 and 2017 at Tulln, Austria, compared to long-term historical data  treatments in 2016 was significantly higher than that in 2017 (Table 1). The difference in average grain yield in 2016 (1154 g/m 2 ) and 2017 (1008 g/m 2 ) was also statistically significant. Similarly, the total N uptake, grain N content, thousand seed weight, LAI at silking, and maximum main-stem leaf number were all higher in 2016 (Table 1). In both seasons, maize yield in unfertilised (N0) plots was significantly reduced but increasing the N fertiliser rate from 40 to 160 kg/ha did not result in significant yield improvement (Table 1). Similarly, the yield components (grain number and thousand seed weight) and total above-ground dry mass were significantly reduced in unfertilised (N0) plots.
The total above-ground N uptake in 2016 followed the pattern observed for grain yield and dry mass. In 2017, however, maize plants responded to increasing N supply by accumulating more N, reaching a maximum of 26.6 g/ m 2 in N4 plots (Table 1). On average, 73% of the total accumulated N was partitioned to grains, with no effect of N fertilization. Nitrogen stress resulted in a significant reduction in LAI of crops in unfertilised plots in both seasons, but crop phenology and main-stem leaf number were not affected by N treatment. Other physiological traits, such as specific leaf nitrogen (SLN, g/m 2 ) and tissue N concentration, showed significant responses to N supply (data will be presented in the sections comparing observed with simulated results).

Initialization of Crop Models
Careful initialisation was necessary to ensure that simulation results are varied only due to parameterization. Thus, the soil input data in both models were first set to the same values, which were derived from the ponding experiment and measurements conducted during the two growing seasons (Table 2). Second, the simulated potential evapotranspiration and soil evaporation by both models were compared to ensure similar levels of water losses from the soil (Fig. 2). Third, the initial parameters for simulating the soil N dynamic were adjusted in both models. For the initial soil mineral N content, the measured Nmin data per layer were used. The data on measured soil organic C were used to estimate soil organic N in each layer assuming a C:N ratio of 10:1 (BMLFUW 2017). The parameters FBiom (fraction of the more labile, soil microbial biomass and microbial products) and FInert (the rest of the soil organic matter) for APSIM and FMIN (fraction of soil organic N available for mineralisation) for iCrop were determined based on a simple N balance calculation for the N0 treatment in 2017. These data allowed the assessment of potential soil N supply to an unfertilised maize crop. The soil N input parameters used in APSIM and iCrop models are summarised in Table 3.

Partial Parameterization (ParLevel_1)
For the partial parameterization, first the model input parameters related to simulation of crop phenology were adjusted based on the observed data for cv. P8500 (Table 4). Both models simulated the dates of emergence, silking, and physiological maturity of maize plants within 2-3 days of observed dates in 2016 and 2017 seasons (data not shown). Second, the default values for leaf appearance rate on the main-stem (phyllochron) were modified in both models based on field data in order to minimise its effect on the simulation of leaf canopy development. Adjusting the phyllochron resulted in good prediction of leaf appearance on the main-stem in both growing seasons by both models (Fig. 4).
With the ParLevel_1, the simulation results for total above-ground dry mass, LAI, and plant N uptake from both models were not satisfactory. For unfertilised crops, iCrop simulated the temporal pattern of dry mass accumulation and N uptake very well but underestimated the growth and N accumulation of fertilised plants substantially (Fig. 5). APSIM, on the other hand, provided good simulation of dry mass and LAI under fertilised conditions, while underestimated N uptake in both fertilised and unfertilised plants. Although the observed data indicated a significant effect of N supply on LAI at silking (Table 1), neither models were capable of accounting for this accurately (Fig. 5). iCrop, in particular, was not able to simulate the interaction between N supply and leaf area  Table 4 Cultivar-specific input parameters for maize cv. P8500 used for the parameterisation of iCrop and APSIM for partial parameterisation (ParLevel_1) a For iCrop the default values were those reported in Soltani and Sinclair (2012) Parameter Value

Default a Modified
iCrop Phyllochron (°Cd leaf-1) 50 38.9 Biological days from sowing to emergence (bdSOWEMR, bd) 3.0 Biological day from emergence to end of juvenile (bdEMREJU, bd) 7.0 Biological day from silking/end of leaf growth on main-stem to physiological maturity (bdSILPM, bd) 32.5 APSIM Phyllochron as a function of leaf number (leaf_app_rate, °Cd) 65 at 1 36 at 8 36 at 1 36 at 8 Thermal time from emergence to end of juvenile (tt_emerg_to_endjuv, °Cd) 310 Thermal time from silking/flowering to start grain filling (tt_flower_to_start_grain, °Cd) 170 Thermal time from silking/flowering to physiological maturity (tt_flower_to_maturity, °Cd) 920 development. With ParLevel_1, iCrop underestimated the total plant N uptake and grain yield of fertilised crops considerably (Fig. 6). APSIM also underestimated N uptake across all N treatments, but the simulated yields matched the observed data well (Fig. 6).

Full Parameterization (ParLevel_2)
In the next step of parameterization, the input parameters related to leaf canopy development, dry mass partitioning, plant N uptake, and grain filling were modified based on experimental data (for the values of parameters see Table 5). Simulation of LAI in iCrop depends on the coefficient PLAPOW, which describes the power relationship between plant leaf area and main-stem leaf (node) number. This parameter was derived from experimental data (Fig. 7). Furthermore, the parameters describing the biphasic pattern of dry matter partitioning to the leaves during the vegetative growth in iCrop were calculated by plotting leaf dry matter against total dry matter (Fig. 8). The leaf partitioning coefficients at lower (FLF1A) and higher (FLF1B) levels of total crop mass and the inflection point between the stages (WTOPL) were 0.6, 0.16, and 239.4 g/m 2 , respectively. A detailed parameterization of leaf canopy development in APSIM was not performed because these parameters are (1) considered as species-specific, and (2) require measurement data on vertical profiles of leaf size and number in crop canopy ), which were not available when sampling plants at unit area basis.
Both iCrop and APSIM use the specific leaf nitrogen (SLN) approach for simulating the response of leaf canopy development to N availability. Experimental data on SLN showed a declining pattern from 34 days after sowing (DAS) in all N treatments. Nitrogen fertilization had a significant effect on SLN. The average SLN from 34 to 63 (silking) DAS was 1.75, 1.90, and 2.15 g/m 2 in N0, N1, and N4 treatment, respectively (Fig. 9). The default value for target or critical SLN in both models was set to 1.9 g N/m 2 from the N1 treatment because higher values of SLN and total plant N uptake in N3 and N4 treatments did not result in significant improvement in crop growth and grain yield. The SLN of senesced leaves in both models was set to the observed value of 0.3 g/m 2 measured in N0 plots. Similar to SLN, stem N% declined during the crop growth period with N0 plants showing consistently lower stem N% until silking (63 DAS) (Fig. 9). The average maximum stem N% was 0.0425 g/g at 21 DAS and decreased to 0.0042 g/g at crop harvest. The target or green stem N% in both models was set to 0.0333 g/g measured at 34 DAS in N1 plants. Stem N% at harvest (124 DAS) was on average 0.0042 g/g and was not affected by N fertilization. The minimum stem N% in N0 plants (0.0032 g/g) was similar to default values for structural stem N% in APSIM. For N% in senesced stem, the iCrop code was modified to allow setting two values: a higher value from emergence to begin grain growth (SNCS1) and a lower value for the grain-filling period (SNCS2) ( Table 5). This resulted in reduction in leaf area due to N stress during the vegetative growth as measured in the experiments.
Grain N% in maize ranges from 0.0076 to 0.0166 g/g (Tenorio et al. 2019). In the current study, grain N% was 0.0114 and 0.0176 g/g in N0 and N4 treatments of 2017, respectively. As iCrop uses only one value for minimum (GNCmin) and maximum (GNCmax) grain N% during the grain-filling period, GNCmin and GNCmax had to be set to 0.0090 and 0.0300 g/g, respectively, to match the simulated grain N% with observed data (see Table 5). Although the chosen values were different with means observed in N0 and N4 treatments, they were not in dis-agreement with observations in experimental plots.
In both crop models, there is a parameter for limiting the daily rate of plant N uptake. In iCrop, the maximum uptake rate of nitrogen (MXNUP) is expressed as g/m 2 /day, whereas in APSIM the corresponding parameter (maxUptakeRate) is a function of degree-days. The calculated values for this parameter using the plant N uptake data between 34 and 63 DAS in N4 treatment was lower than the default value in iCrop and higher than the default value in APSIM (see Table 5).
The simulated N uptake dynamics for ParLevel_1 showed that APSIM ceases N uptake quite early in the reproductive phase resulting in underestimation of final N uptake in all N treatments (see Fig. 5). Therefore, the timing of N uptake cessation in APSIM (nUptakeCease) was changed from the default value of 523-700 °Cd, which resulted in N uptake up until 5 weeks after silking in the maize experiments of current research. This is supported by Soufizadeh et al. (2018) who reported that maize plants continue with N uptake up until 4-6 weeks after silking.
Grain yield in APSIM is simulated as the product of grain number and grain size (Soufizadeh et al. 2018). In ParLevel_1 simulations, APSIM predicted on average Fig. 5 Maize total above-ground biomass, leaf area index (LAI), and nitrogen (N) uptake measured (symbols) and simulated (curves) by iCrop and APSIM with partial parameterization under 0 (N0) and 160 (N4) kg/ ha of nitrogen fertilization in 2017; bars indicate standard error of measured values 1 3 3356.3 grains per m 2 across all N treatments, whereas the average observed grain number was 3899.9. To account for this underestimation, the default value of the parameter GNk was increased to 1.7. With this modification, the simulated grain number of fertilised plants (3992.2) matched well the observed data (3899.9). Similarly, the potential slope of harvest index (PDHI) in iCrop was increased to improve the grain yield of fertilised crops (see Table 5).
Following the modifications made in both models (see Table 5), the ParLevel_2 parameterization improved the iCrop and APSIM simulations of dry mass production, LAI, and N uptake in all N treatments (Fig. 10). Similarly, compared to ParLevel_1, simulations of total N uptake and grain yield by both models were improved substantially (Fig. 11). Improvements in the simulation of plant N uptake resulted in accurate predictions of dynamics of soil mineral N (Nmin) content in both unfertilised and fertilised plots (Fig. 12).

Discussion
The field experiments were relatively successful in providing a range of data on total dry mass, grain yield, yield components, N uptake and LAI (Table 1). For these variables, maximum observed means were 40-120% higher than minimum observed means. The weak response of maize yield and dry mass to N treatment, especially in the first experiment (2016), can be attributed to very high initial soil Nmin values (see Table 3). In addition, soils at the experimental site are very rich in organic carbon and N and, therefore, can supply substantial amounts of N through mineralisation of organic matter. For instance, maize plants in unfertilised plots in 2017 had accumulated 120.7 kg/ha nitrogen with an initial soil Nmin of 66.0 kg/ha. Given that 27.9 kg/ha of N was still available in the soil at crop harvest, the N balance (input-output) was − 82.6 kg/ha. Assuming soil as the only source of N, this negative balance must have been compensated by mineralisation of soil organic matter. Therefore, for field research on crop responses to N supply on these soils, the previous crops should not be fertilised with N for at least 1-2 seasons.
With partial parameterization (ParLevel_1) based on observed phenological stages and main-stem leaf number, both models failed to provide accurate estimation of LAI, dry mass accumulation, N uptake and grain yield, but the performance of APSIM was generally better than iCrop (Figs. 5,6). Thus, one possible important conclusion is Fig. 6 Relationship between measured and simulated nitrogen (N) uptake and grain yield of maize grown in 2016 and 2017 seasons under various levels of N fertilization for iCrop (a) and APSIM (b) models with partial parameterization that the simple model iCrop was more sensitive to parameterization. Detailed models like APSIM use many more equations and parameters (species and cultivar-specific parameters) to describe crop processes and their responses to the environment; this probably prevents the model from out of range predictions under new conditions. Thus, simple models need better attention to parameterization under new conditions, but this needs to be confirmed by future studies.   The results of the current study certainly support the conclusion that partial parameterization may not be enough and does not guarantee improved performance of both simple and complex models under new situations. Palosuo et al. (2011) applied a "blind test" in comparison of eight wheat models and concluded that none of the models perfectly reproduced recorded observations, and none were unequivocally accurate. In this "blind test" only phenological data were provided to the modeller to test the capability of their models to reproduce yields and yield variability under different climatic conditions in Europe. He et al. (2017) in evaluation of a range of experimental data in the parameterization of APSIM-Canola showed that model parameterization needs to be carefully performed before the model can be used to simulate crop growth across diverse environments and management scenarios.
Full parameterization (ParLevel_2) greatly improved the performance of both crop models in simulation of dynamics of dry mass accumulation, LAI and N uptake over the growing seasons (Fig. 10) and in predicting final N uptake and grain yield (Fig. 11). The performance of the simple model (iCrop) was as good as or better than the complex model (APSIM). This is in agreement with previously reported findings by Soltani and Sinclair (2015) in comparison of four wheat models. They indicated that the coefficient of variation in prediction of grain yield was about 45% lower for iCrop compared to APSIM. They concluded that sacrification of transparency by adding more functions/parameters was not rewarded by increased model robustness. There are some other studies that have indicated increased complexity of a model does not increase the robustness of the model (e.g. Bell and Fischer 1994;Goudriaan 1996;Adam et al. 2011). For example, Adam et al. (2011) compared a simple and a detailed approach for simulating dry matter production and LAI dynamics and did not find any advantage of one approach to the other one. Zhao et al. (2019) indicated that the performance of a new simple crop model (SIMPLE) was comparable to the models with more details (DSSAT and APSIM). Sinclair and Seligman (2000) had stated that models should be kept as simple as possible.
The results of the current study confirm that there is a need to generate and compile high-quality data sets for parameterization and testing of crop models (Rosenzweig et al. 2014;Grassini et al. 2015). Such data sets should include not only production-related parameters such as crop yield but also cover temporal patterns of crop growth variables as well as dynamics of soil water, carbon, and N. The importance of good quality comprehensive data sets for better parameterization and testing of crop models has also been frequently highlighted by European and international climate change impact assessment research efforts such as AgMIP (https ://www.agmip .org).
The results of the current study also support the conclusion that the role of model parameterization cannot be ignored (He et al. 2017) and it is something in which more investment is needed (Confalonieri et al. 2016). Seidel et al. (2018) did a large-scale survey of crop model calibration practices (211 responses) conducted among the various crop modelling teams and showed that there is a very large variability in approaches to crop model calibration. They concluded that a wide range of approaches and choices are utilized for model calibration and it would be very useful to provide guidelines for crop model calibration.
The iCrop model needed modification during ParLevel_2. It was required to define two values for senesced stem N% instead of one in the original model (SNCS) in order to improve the simulation of LAI response to N stress. When using only one value for senesced stem N%, N stress during the vegetative growth in iCrop causes translocation of stem N to meet the leaf N demand. Therefore, the relatively low default value of SNCS prevented the reduction in LAI under N stress (see Fig. 5). Introducing two SNCS values (see Table 5) resulted in a reduction of LAI under N stress without affecting the translocation of N from stem during the grain filling phase. Modifications in iCrop are simple and straightforward as the model is written in Visual Basic for Application (VBA) in Excel and codes are open-access and handy for such manipulations.
One of the principles of iCrop is to evaluate the model assumption in view of the objectives of the current model situation and add new hypotheses and modify the model if it is required (Sinclair and Seligman 1996;Soltani and Sinclair 2012). For such modifications the crop model needs to be transparent. For transparency, model parameters and code should be accessible and readily understood by model users (van Ittersum et al. 2003;Soltani and Sinclair 2012). Transparency is facilitated by a minimum number of parameters that can be independently observed and measured. Often, transparency is diminished as the complexity of a model is increased (Soltani and Sinclair 2015).
One important finding from the current study is that simple crop models may better respond to parameterization than the complex models. ParLevel_2 was more effective for the simple model iCrop. While the performance of iCrop was equally well compared to APSIM in prediction of dynamics of dry mass accumulation, LAI and N uptake during the growing season (Fig. 10), it outperformed APSIM in prediction of final N uptake and grain yield; RMSEs (root mean square errors) were ≥ 40% lower for iCrop compared to APSIM (Fig. 11). A potential reason for that behaviour might be that there were remaining parameters for APSIM, which are not classified as cultivar-parameters and, therefore, were not parameterised. Even if they were classified as cultivar-specific parameters, it might not help because experimental data to calculate the parameters (e.g. leaf canopy development) are often not available.
Studies that have evaluated the impact of parameterization level on model predictions are limited. While modelling studies indicate higher levels of parameterization generally improves model performance, they have not attempted to separate the results in terms of complexity of the crop models applied, i.e. simple versus complex models. Bassu et al. (2014) inter-compared 23 maize models under two levels of input information and showed that variability of model predictions was strongly reduced when the level of input information in model parameterization increased. Similarly, Battisti et al. (2017) compared five soybean models under three phases of parameterization: no parameterization using default parameters, partial parameterization using phenology data, and complete parameterization using phenology and growth data. They found that model performance improved from no parameterization phase to complete parameterization. Asseng et al. (2013) evaluated uncertainty of 27 wheat models, partially or fully parameterized, in simulation of wheat. Full parameterization reduced relative RMSE in prediction of all simulated variables associated with phenology, dry mass production, evapotranspiration, crop N uptake and yield.

Conclusion
Partial parameterization (ParLevel_1) was not adequate for best performance of the simple (iCrop) and detailed (APSIM) crop models used in the current study, although APSIM performed better at this parameterization level. iCrop was, therefore, more vulnerable to incomplete parameterization than APSIM. However, iCrop better responded to complete parameterisation and its performance was even better than the detailed model. This is because the parameters governing leaf canopy development and biomass partitioning in iCrop can easily be derived from experimental data. More research with other simple and detailed models is required to confirm the results of the current research. Complete parameterisation should not be passed over easily if model predictions matter.