Abstract
A variety of statistical methods have been developed for multivariate analysis of agricultural systems. Some statistical methods are rarely used to study these systems, although they can contribute to issues such as identifying atypical farms, modeling relations among variables and describing farms with common characteristics. To address these issues, we reviewed studies that applied kernel density estimation (KDE), copula modeling and extreme value theory (EVT) to French dairy farm data. KDE helped identify joint value ranges of forage production and milk production or greenhouse gas emissions that most farms in specific French region were likely to have. Copula modeling formalized the shapes of relations among farm characteristics, while EVT distinguished production strategies and management practices of farms that produced extreme amounts of forage. The present study reviews studies that applied these three methods, recommends when to use the latter and discusses their contribution to improving the understanding of dairy farms.
Similar content being viewed by others
Data availability statement
Data sharing is not applicable to this article since no new data were created in this study.
Abbreviations
- EVT:
-
Extreme value theory
- GHG:
-
Greenhouse gas emissions
- KDE:
-
Kernel density estimation
- cdf:
-
Cumulative distribution function
- pdf:
-
Probability density function
- LU:
-
Livestock unit
References
Aas K, Czado C, Frigessi A, Bakken H (2009) Pair-copula constructions of multiple dependence. Insur Math Econ 44:182–198
Beirlant J, Teugels JL, Vynckier P (1998) Some thoughts on extreme values. In: Accardi L, Heyde CC (eds) Probability towards 2000. Springer New York, New York, pp 58–73
Beirlant J, Goegebeur Y, Teugels JL, Segers J (2004) Statistics of extremes: theory and applications (Vol 558). John Wiley & Sons
Brisson N, Ruget F, Gate P, Lorgeou J, Nicoullaud B, Tayot X, Plenet D, Jeuffroy M-H, Bouthier A, Ripoche D (2002) STICS: a generic model for simulating crops and their water and nitrogen balances. II. Model validation for wheat and maize. Agronomie 22:69–92
Chardon X, Rigolot C, Baratte C, Martin-Clouaire R, Rellier J-P, Raison C, Gall AL, Dourmad J-Y, Poupa J-C, Delaby L (2011) A whole farm-model to simulate the environmental impacts of animal farming systems: MELODIE. Modelling nutrient digestion and utilisation in farm animals. Springer, pp 403–411
Charpentier A, Fermanian J-D, Scaillet O (2007) The estimation of copulas: theory and practice. In: Rank J (ed) Copulas: from theory to application in finance. Risk Books, London, pp 35–64
Chen SX, Huang T-M (2007) Nonparametric estimation of copula functions for dependence modelling. Can J Ournal of Stat 35:265–282
Coles S (2001) An introduction to statistical modeling of extreme values. Springer, London
Coles S, Heffernan J, Tawn J (1999) Dependence measures for extreme values analyses. Extremes 24:339–365
Corson MS, Rotz AC, Skinner HR, Sanderson MA (2007) Adaptation and evaluation of the integrated farm system model to simulate temperate multiple-species pastures. Agric Syst 94:502–508
Drees H, Sabourin A (2021) Principal component analysis for multivariate extremes. Electron J Stat 15:908–943
Drews J, Czycholl I, Junge W, Krieter J (2018) An evaluation of efficiency in dairy production using structural equation modelling. J Agric Sci 156:996–1004
Duong T, Hazelton ML (2005) Cross-validation bandwidth matrices for multivariate kernel density estimation. Scand J Stat 32:485–506
Embrechts P, Klüppelberg C, Mikosch T (1996) Modelling extremal events for insurance and finance. In: Rozovskii B, Glynn PW (eds) Stochastic modelling and applied probability. Springer
Embrechts P, Mcneil A, Straumann D (2002). Correlation and dependence in risk management: properties and pitfalls. In: Dempster, M (ed) Risk management: value at risk and beyond. Cambridge: Cambridge University Press, pp 176–223
FAO (2017) Livestock solutions for climate change. FAO Rome
Genest C, Favre A-C (2007) Everything you always wanted to know about copula modeling but were afraid to ask. J Hydrol Eng 12:347–368
Gerber PJ, Steinfeld H, Henderson B, Mootet A, Opio C, Dijkman J, Falcucci A, Tempio A (2013) Tackling climate change though livestock—a global assessment of Emissions and mitigation opportunities. Food and Agriculture Organization of the United Nations (FAO), Rome
Gil R, Bojacá CR, Schrevens E (2021) Accounting for correlational structures in stochastic comparative life cycle assessments through copula modeling. Int J Life Cycle Assess 26:604–615
Gobin A, Van de Vyver H (2021) Spatio-temporal variability of dry and wet spells and their influence on crop yields. Agric for Meteorol 308–309:108565
Gudendorf G, Segers J (2010) Extreme-value copulas. In: Jaworski, P, Durante, F, Härdle, W K, Rychlik, T (eds) Copula theory and its applications. Springer, Berlin, Heideberg, pp. 127-145
Hubert M, Vandervieren E (2008) An adjusted boxplot for skewed distributions. Comput Stat Data Anal 52:5186–5201
Hyndman RJ (1996) Computing and graphing highest density regions. Am Stat 50:120–126
Jayasundara S, Worden D, Weersink A, Wright T, VanderZaag A, Gordon R, Wagner-Riddle C (2019) Improving farm profitability also reduces the carbon footprint of milk production in intensive dairy production systems. J Clean Prod 229:1018–1028
Jouany J-P, Thivend P (2008) La production de méthane d’origine digestive chez les ruminants et son impact sur le réchauffement climatique. Manag Avenir 20:259
Kokonendji CC, Senga Kiessé T (2011) Discrete associated kernels method and extensions. Stat Methodol 8:497–516
Micha E, Heanue K, Hyland JJ, Hennessy T, Dillon EJ, Buckley C (2017) Sustainability levels in Irish dairy farming: a farm typology according to sustainable performance indicators. Stud Agric Econ 119:62–69
Muñoz-Tamayo R, Giger-Reverdin S, Sauvant D (2016) Mechanistic modelling of in vitro fermentation and methane production by rumen microbiota. Anim Feed Sci Technol 220:1–21
Nadarajah S, Afuecheta E, Chan S (2018) A compendium of copulas. Statistica (bologna) 77:279–328
Nagler T (2014) Kernel methods for vine copula estimation. Department of Mathematics. Technische Universität München
Nguyen TTH, Corson MS, Doreau M, Eugène M, van der Werf HMG (2013) Consequential LCA of switching from maize silage-based to grass-based dairy systems. Int J Life Cycle Assess 18:1470–1484
Niu M, Kebreab E, Hristov AN, Oh J, Arndt C, Bannink A, Bayat AR, Brito AF, Boland T, Casper D, Crompton LA, Dijkstra J, Eugene MA, Garnsworthy PC, Haque MN, Hellwing ALF, Huhtanen P, Kreuzer M, Kuhla B, Lund P, Madsen J, Martin C, McClelland SC, McGee M, Moate PJ, Muetzel S, Munoz C, O’Kiely P, Peiren N, Reynolds CK, Schwarm A, Shingfield KJ, Storlien TM, Weisbjerg MR, Yanez-Ruiz DR, Yu Z (2018) Prediction of enteric methane production, yield, and intensity in dairy cattle using an intercontinental database. Glob Change Biol 24:3368–3389
Opio C, Gerber P, Mottet A, Falcucci A, Tempio G, MacLeod M, Vellinga T, Henderson B, Steinfeld H (2013) Greenhouse gas emissions from ruminant supply chains—a global life cycle assessment. Food and Agriculture Organization of the United Nations (FAO), Rome
Ouatahar L, Bannink A, Lanigan G, Amon B (2021) Modelling the effect of feeding management on greenhouse gas and nitrogen emissions in cattle farming systems. Sci Total Environ 776:145932
Racine JS, Li Q, Yan KX (2020) Kernel smoothed probability mass functions for ordered datatypes. J Nonparametr Stat 32:563–586
Rosenblatt M (1956) Remarks on some nonparametric estimates of a density function. Ann Math Stat 27:832–837
Rotz CA (2004) The integrated farm system model: a tool for developing more economically and environmentally sustainable farming systems for the Northeast. In NABEC Papers (p. 1). American Society of Agricultural and Biological Engineers
Rotz CA (2018) Modeling greenhouse gas emissions from dairy farms. J Dairy Sci 101:6675–6690
Rousseeuw PJ, Ruts I, Tukey JW (1999) The bagplot: a bivariate boxplot. Am Stat 53:382–387
Salvadori G, Michele CD, Durante F (2011) On the return period and design in a multivariate framework. Hydrol Earth Syst Sci 15:3293–3305
Schaak H, Mußhoff O (2018) Understanding the adoption of grazing practices in German dairy farming. Agric Syst 165:230–239
Schlather M, Tawn JA (2003) A dependence measure for multivariate and spatial extreme values: properties and inference. Biometrika 90:139–156
Senga Kiessé T, Corson MS, Eugène M, Aubin J, Wilfart A (2019) Analysis of enteric methane emissions due to extreme variations in management practices of dairy-production systems. Agric Syst 173:449–457
Senga Kiessé T, Corson MS, Le Galludec G, Wilfart A (2020) Sensitivity of greenhouse gas emissions to extreme differences in forage production of dairy farms. Livest Sci 232:103906
Senga Kiessé T, Corson M, Wilfart A (2022a) Analysis of milk production and greenhouse gas emissions as a function of extreme variations in forage production among French dairy farms. J Environ Manag 307:114537
Senga Kiessé T, Corson MS, Eugène M (2022b) The potential of kernel density estimation for modelling relations among dairy farm characteristics. Agric Syst 199:103406
Senga Kiessé T, Heijungs R, Corson MS (2022c) Modeling production efficiency and greenhouse gas objectives as a function of forage production of dairy farms using copula models. Environ Model Assess 27:413-424
Shine P, Scully T, Upton J, Murphy MD (2018) Multiple linear regression modelling of on-farm direct water and electricity consumption on pasture based dairy farms. Comput Electron Agric 148:337–346
Sklar A (1959) Fonctions de répartition à n dimensions et leurs marges. Publ L’inst Stat L’univ Paris 8:229–231
Soteriades AD, Stott AW, Moreau S, Charroin T, Blanchard M, Liu J, Faverdin P (2016) The relationship of dairy farm eco-efficiency with intensification and self-sufficiency. Evidence from the French dairy sector using life cycle analysis, data envelopment analysis and partial least squares structural equation modelling. PLoS ONE 11:e0166445
Sutton MA, Bleeker A, Howard C, Erisman J, Abrol Y, Bekunda M, Datta A, Davidson E, De Vries W, Oenema O (2013) Our nutrient world. The challenge to produce more food & energy with less pollution. Centre for Ecology & Hydrology
Terranova MV, Molina R, Sánchez H, Campos R, Perilla S (2021) Influence of climatic conditions on tympanic temperature and milk production in grazing cows. J Anim Behav Biometeorol 9:2132–2132
Todde G, Murgia L, Caria M, Pazzona A (2016) A multivariate statistical analysis approach to characterize mechanization, structural and energy profile in Italian dairy farms. Energy Rep 2:129–134
Tsybakov AB (2004) Introduction à l'estimation non-paramétrique. Mathématiques & Applications (Vol 41). Springer
Wand MP, Jones MC (1995) Kernel smoothing. Chapman and Hall
Wolf CA, Sumner DA (2001) Are farm size distributions bimodal? evidence from kernel density estimates of dairy farm size distributions. Am J Agric Econ 83:77–88
Acknowledgements
The authors thank the French Livestock Institute (IDELE) for providing the dataset and Françoise Vertès, Samuel Le Féon and Hayo van der Werf for their careful reading of and helpful discussions about the manuscript, which helped to improve it greatly.
Funding
The authors declare that they received no funds, grants or other financial support for preparing this manuscript.
Author information
Authors and Affiliations
Contributions
Both TSK and MSC contributed to the structure of the review. TSK wrote the first draft, and both authors revised the manuscript. Both authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no financial interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Senga Kiesse, T., Corson, M.S. The utility of less-common statistical methods for analyzing agricultural systems: focus on kernel density estimation, copula modeling and extreme value theory. Behaviormetrika 50, 491–508 (2023). https://doi.org/10.1007/s41237-022-00190-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41237-022-00190-y