Skip to main content
Log in

The utility of less-common statistical methods for analyzing agricultural systems: focus on kernel density estimation, copula modeling and extreme value theory

  • Review Paper
  • Published:
Behaviormetrika Aims and scope Submit manuscript

Abstract

A variety of statistical methods have been developed for multivariate analysis of agricultural systems. Some statistical methods are rarely used to study these systems, although they can contribute to issues such as identifying atypical farms, modeling relations among variables and describing farms with common characteristics. To address these issues, we reviewed studies that applied kernel density estimation (KDE), copula modeling and extreme value theory (EVT) to French dairy farm data. KDE helped identify joint value ranges of forage production and milk production or greenhouse gas emissions that most farms in specific French region were likely to have. Copula modeling formalized the shapes of relations among farm characteristics, while EVT distinguished production strategies and management practices of farms that produced extreme amounts of forage. The present study reviews studies that applied these three methods, recommends when to use the latter and discusses their contribution to improving the understanding of dairy farms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Source: Integrated Farm System Model, USDA-ARS, https://www.quantitative-plant.org/model/IntegratedFarmSystemModel

Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability statement

Data sharing is not applicable to this article since no new data were created in this study.

Abbreviations

EVT:

Extreme value theory

GHG:

Greenhouse gas emissions

KDE:

Kernel density estimation

cdf:

Cumulative distribution function

pdf:

Probability density function

LU:

Livestock unit

References

  • Aas K, Czado C, Frigessi A, Bakken H (2009) Pair-copula constructions of multiple dependence. Insur Math Econ 44:182–198

    MATH  Google Scholar 

  • Beirlant J, Teugels JL, Vynckier P (1998) Some thoughts on extreme values. In: Accardi L, Heyde CC (eds) Probability towards 2000. Springer New York, New York, pp 58–73

    Google Scholar 

  • Beirlant J, Goegebeur Y, Teugels JL, Segers J (2004) Statistics of extremes: theory and applications (Vol 558). John Wiley & Sons

  • Brisson N, Ruget F, Gate P, Lorgeou J, Nicoullaud B, Tayot X, Plenet D, Jeuffroy M-H, Bouthier A, Ripoche D (2002) STICS: a generic model for simulating crops and their water and nitrogen balances. II. Model validation for wheat and maize. Agronomie 22:69–92

    Google Scholar 

  • Chardon X, Rigolot C, Baratte C, Martin-Clouaire R, Rellier J-P, Raison C, Gall AL, Dourmad J-Y, Poupa J-C, Delaby L (2011) A whole farm-model to simulate the environmental impacts of animal farming systems: MELODIE. Modelling nutrient digestion and utilisation in farm animals. Springer, pp 403–411

    Google Scholar 

  • Charpentier A, Fermanian J-D, Scaillet O (2007) The estimation of copulas: theory and practice. In: Rank J (ed) Copulas: from theory to application in finance. Risk Books, London, pp 35–64

    Google Scholar 

  • Chen SX, Huang T-M (2007) Nonparametric estimation of copula functions for dependence modelling. Can J Ournal of Stat 35:265–282

    MATH  Google Scholar 

  • Coles S (2001) An introduction to statistical modeling of extreme values. Springer, London

    MATH  Google Scholar 

  • Coles S, Heffernan J, Tawn J (1999) Dependence measures for extreme values analyses. Extremes 24:339–365

    MATH  Google Scholar 

  • Corson MS, Rotz AC, Skinner HR, Sanderson MA (2007) Adaptation and evaluation of the integrated farm system model to simulate temperate multiple-species pastures. Agric Syst 94:502–508

    Google Scholar 

  • Drees H, Sabourin A (2021) Principal component analysis for multivariate extremes. Electron J Stat 15:908–943

    MATH  Google Scholar 

  • Drews J, Czycholl I, Junge W, Krieter J (2018) An evaluation of efficiency in dairy production using structural equation modelling. J Agric Sci 156:996–1004

    Google Scholar 

  • Duong T, Hazelton ML (2005) Cross-validation bandwidth matrices for multivariate kernel density estimation. Scand J Stat 32:485–506

    MATH  Google Scholar 

  • Embrechts P, Klüppelberg C, Mikosch T (1996) Modelling extremal events for insurance and finance. In: Rozovskii B, Glynn PW (eds) Stochastic modelling and applied probability. Springer

    MATH  Google Scholar 

  • Embrechts P, Mcneil A, Straumann D (2002). Correlation and dependence in risk management: properties and pitfalls. In: Dempster, M (ed) Risk management: value at risk and beyond. Cambridge: Cambridge University Press, pp 176–223

  • FAO (2017) Livestock solutions for climate change. FAO Rome

    Google Scholar 

  • Genest C, Favre A-C (2007) Everything you always wanted to know about copula modeling but were afraid to ask. J Hydrol Eng 12:347–368

    Google Scholar 

  • Gerber PJ, Steinfeld H, Henderson B, Mootet A, Opio C, Dijkman J, Falcucci A, Tempio A (2013) Tackling climate change though livestock—a global assessment of Emissions and mitigation opportunities. Food and Agriculture Organization of the United Nations (FAO), Rome

    Google Scholar 

  • Gil R, Bojacá CR, Schrevens E (2021) Accounting for correlational structures in stochastic comparative life cycle assessments through copula modeling. Int J Life Cycle Assess 26:604–615

    Google Scholar 

  • Gobin A, Van de Vyver H (2021) Spatio-temporal variability of dry and wet spells and their influence on crop yields. Agric for Meteorol 308–309:108565

    Google Scholar 

  • Gudendorf G, Segers J (2010) Extreme-value copulas. In: Jaworski, P, Durante, F, Härdle, W K, Rychlik, T (eds) Copula theory and its applications. Springer, Berlin, Heideberg, pp. 127-145

  • Hubert M, Vandervieren E (2008) An adjusted boxplot for skewed distributions. Comput Stat Data Anal 52:5186–5201

    MATH  Google Scholar 

  • Hyndman RJ (1996) Computing and graphing highest density regions. Am Stat 50:120–126

    Google Scholar 

  • Jayasundara S, Worden D, Weersink A, Wright T, VanderZaag A, Gordon R, Wagner-Riddle C (2019) Improving farm profitability also reduces the carbon footprint of milk production in intensive dairy production systems. J Clean Prod 229:1018–1028

    Google Scholar 

  • Jouany J-P, Thivend P (2008) La production de méthane d’origine digestive chez les ruminants et son impact sur le réchauffement climatique. Manag Avenir 20:259

    Google Scholar 

  • Kokonendji CC, Senga Kiessé T (2011) Discrete associated kernels method and extensions. Stat Methodol 8:497–516

    MATH  Google Scholar 

  • Micha E, Heanue K, Hyland JJ, Hennessy T, Dillon EJ, Buckley C (2017) Sustainability levels in Irish dairy farming: a farm typology according to sustainable performance indicators. Stud Agric Econ 119:62–69

    Google Scholar 

  • Muñoz-Tamayo R, Giger-Reverdin S, Sauvant D (2016) Mechanistic modelling of in vitro fermentation and methane production by rumen microbiota. Anim Feed Sci Technol 220:1–21

    Google Scholar 

  • Nadarajah S, Afuecheta E, Chan S (2018) A compendium of copulas. Statistica (bologna) 77:279–328

    MATH  Google Scholar 

  • Nagler T (2014) Kernel methods for vine copula estimation. Department of Mathematics. Technische Universität München

    Google Scholar 

  • Nguyen TTH, Corson MS, Doreau M, Eugène M, van der Werf HMG (2013) Consequential LCA of switching from maize silage-based to grass-based dairy systems. Int J Life Cycle Assess 18:1470–1484

    Google Scholar 

  • Niu M, Kebreab E, Hristov AN, Oh J, Arndt C, Bannink A, Bayat AR, Brito AF, Boland T, Casper D, Crompton LA, Dijkstra J, Eugene MA, Garnsworthy PC, Haque MN, Hellwing ALF, Huhtanen P, Kreuzer M, Kuhla B, Lund P, Madsen J, Martin C, McClelland SC, McGee M, Moate PJ, Muetzel S, Munoz C, O’Kiely P, Peiren N, Reynolds CK, Schwarm A, Shingfield KJ, Storlien TM, Weisbjerg MR, Yanez-Ruiz DR, Yu Z (2018) Prediction of enteric methane production, yield, and intensity in dairy cattle using an intercontinental database. Glob Change Biol 24:3368–3389

    Google Scholar 

  • Opio C, Gerber P, Mottet A, Falcucci A, Tempio G, MacLeod M, Vellinga T, Henderson B, Steinfeld H (2013) Greenhouse gas emissions from ruminant supply chains—a global life cycle assessment. Food and Agriculture Organization of the United Nations (FAO), Rome

    Google Scholar 

  • Ouatahar L, Bannink A, Lanigan G, Amon B (2021) Modelling the effect of feeding management on greenhouse gas and nitrogen emissions in cattle farming systems. Sci Total Environ 776:145932

    Google Scholar 

  • Racine JS, Li Q, Yan KX (2020) Kernel smoothed probability mass functions for ordered datatypes. J Nonparametr Stat 32:563–586

    MATH  Google Scholar 

  • Rosenblatt M (1956) Remarks on some nonparametric estimates of a density function. Ann Math Stat 27:832–837

    MATH  Google Scholar 

  • Rotz CA (2004) The integrated farm system model: a tool for developing more economically and environmentally sustainable farming systems for the Northeast. In NABEC Papers (p. 1). American Society of Agricultural and Biological Engineers

  • Rotz CA (2018) Modeling greenhouse gas emissions from dairy farms. J Dairy Sci 101:6675–6690

    Google Scholar 

  • Rousseeuw PJ, Ruts I, Tukey JW (1999) The bagplot: a bivariate boxplot. Am Stat 53:382–387

    Google Scholar 

  • Salvadori G, Michele CD, Durante F (2011) On the return period and design in a multivariate framework. Hydrol Earth Syst Sci 15:3293–3305

    Google Scholar 

  • Schaak H, Mußhoff O (2018) Understanding the adoption of grazing practices in German dairy farming. Agric Syst 165:230–239

    Google Scholar 

  • Schlather M, Tawn JA (2003) A dependence measure for multivariate and spatial extreme values: properties and inference. Biometrika 90:139–156

    MATH  Google Scholar 

  • Senga Kiessé T, Corson MS, Eugène M, Aubin J, Wilfart A (2019) Analysis of enteric methane emissions due to extreme variations in management practices of dairy-production systems. Agric Syst 173:449–457

    Google Scholar 

  • Senga Kiessé T, Corson MS, Le Galludec G, Wilfart A (2020) Sensitivity of greenhouse gas emissions to extreme differences in forage production of dairy farms. Livest Sci 232:103906

    Google Scholar 

  • Senga Kiessé T, Corson M, Wilfart A (2022a) Analysis of milk production and greenhouse gas emissions as a function of extreme variations in forage production among French dairy farms. J Environ Manag 307:114537

    Google Scholar 

  • Senga Kiessé T, Corson MS, Eugène M (2022b) The potential of kernel density estimation for modelling relations among dairy farm characteristics. Agric Syst 199:103406

    Google Scholar 

  • Senga Kiessé T, Heijungs R, Corson MS (2022c) Modeling production efficiency and greenhouse gas objectives as a function of forage production of dairy farms using copula models. Environ Model Assess 27:413-424

    Google Scholar 

  • Shine P, Scully T, Upton J, Murphy MD (2018) Multiple linear regression modelling of on-farm direct water and electricity consumption on pasture based dairy farms. Comput Electron Agric 148:337–346

    Google Scholar 

  • Sklar A (1959) Fonctions de répartition à n dimensions et leurs marges. Publ L’inst Stat L’univ Paris 8:229–231

    MATH  Google Scholar 

  • Soteriades AD, Stott AW, Moreau S, Charroin T, Blanchard M, Liu J, Faverdin P (2016) The relationship of dairy farm eco-efficiency with intensification and self-sufficiency. Evidence from the French dairy sector using life cycle analysis, data envelopment analysis and partial least squares structural equation modelling. PLoS ONE 11:e0166445

    Google Scholar 

  • Sutton MA, Bleeker A, Howard C, Erisman J, Abrol Y, Bekunda M, Datta A, Davidson E, De Vries W, Oenema O (2013) Our nutrient world. The challenge to produce more food & energy with less pollution. Centre for Ecology & Hydrology

    Google Scholar 

  • Terranova MV, Molina R, Sánchez H, Campos R, Perilla S (2021) Influence of climatic conditions on tympanic temperature and milk production in grazing cows. J Anim Behav Biometeorol 9:2132–2132

    Google Scholar 

  • Todde G, Murgia L, Caria M, Pazzona A (2016) A multivariate statistical analysis approach to characterize mechanization, structural and energy profile in Italian dairy farms. Energy Rep 2:129–134

    Google Scholar 

  • Tsybakov AB (2004) Introduction à l'estimation non-paramétrique. Mathématiques & Applications (Vol 41). Springer

  • Wand MP, Jones MC (1995) Kernel smoothing. Chapman and Hall

    MATH  Google Scholar 

  • Wolf CA, Sumner DA (2001) Are farm size distributions bimodal? evidence from kernel density estimates of dairy farm size distributions. Am J Agric Econ 83:77–88

    Google Scholar 

Download references

Acknowledgements

The authors thank the French Livestock Institute (IDELE) for providing the dataset and Françoise Vertès, Samuel Le Féon and Hayo van der Werf for their careful reading of and helpful discussions about the manuscript, which helped to improve it greatly.

Funding

The authors declare that they received no funds, grants or other financial support for preparing this manuscript.

Author information

Authors and Affiliations

Authors

Contributions

Both TSK and MSC contributed to the structure of the review. TSK wrote the first draft, and both authors revised the manuscript. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Tristan Senga Kiesse.

Ethics declarations

Conflict of interest

The authors declare that they have no financial interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Senga Kiesse, T., Corson, M.S. The utility of less-common statistical methods for analyzing agricultural systems: focus on kernel density estimation, copula modeling and extreme value theory. Behaviormetrika 50, 491–508 (2023). https://doi.org/10.1007/s41237-022-00190-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41237-022-00190-y

Keywords

Navigation