Abstract
Mustard is the second most important edible oilseed after groundnut for India. Adverse weather drastically reduces the mustard yield. Weather variables affect the crop differently during different stages of development. Weather influence on crop yield depends not only on the magnitude of weather variables but also on weather distribution pattern over the crop growing period. Hence, developing models using weather variables for accurate and timely crop yield prediction is foremost important for crop management and planning decisions regarding storage, import, export, etc. Machine learning plays a significant role as it has a decision support tool for crop yield prediction. The models for mustard yield prediction was developed using long-term weather data during the crop growing period along with mustard yield data. Techniques used for developing the model were variable selection using stepwise multiple linear regression (SMLR) and artificial neural network (SMLR-ANN), variable selection using SMLR and support vector machine (SMLR-SVM), variable selection using SMLR and random forest (SMLR-RF), variable extraction using principal component analysis (PCA) and ANN (PCA-ANN), variable extraction using PCA and SVM (PCA-SVM), and variable extraction using PCA and RF (PCA-RF). Optimal combinations of the developed models were done for improving the accuracy of mustard yield prediction. Results showed that, on the basis of model accuracy parameters nRMSE, RMSE, and RPD, the PCA-SVM model performed best among all the six models developed for mustard yield prediction of study areas. Performance of mustard yield prediction done by optimum combinations of the models was better than the individual model.
Similar content being viewed by others
Data Availability
Weather data are availble at NDC IMD Pune
References
Aggarwal PK, Bandyopadhyay SK, Pathak H, Kalra N, Chandra S, Kumar S (2000) Analysis of yield trend of the rice-wheat system in north-western India. Outlook Agric 29(4):259–268
Ahmad MW, Mourshed M, Rezgui Y (2017) Trees vs neurons: comparison between random forest and ANN for high-resolution prediction of building energy consumption. Energy Build 147:77–89
Aravind KS, Ananta V, Krishanan P, DAS B, (2022) Wheat yield prediction based on weather parameters using multiple linear, neural network and penalised regression models. J Agrometeorol 24(1):18–25
Azfar M, Sisodia BVS, Rai VN, Devi M (2015) Pre-harvest forecast models for rapeseed & mustard yield using principal component analysis of weather variables. Mausam 4:761–766
Balakrishnan N, Muthukumarasamy G (2016) Crop production-ensemble machine learning model for prediction. Int J Comput Sci Soft Eng 5(7):148–153
Bates JM, Granger CW (1969) The combination of forecasts. J Opera Res Soc 20(4):451–468
Boomiraj K, Chakrabarti B, Aggarwal PK, Choudhary R, Chander S (2010) Assessing the vulnerability of Indian mustard to climate change. Agric Ecosyst Environ 138:265–273
Breiman L (2001) Random forests. Kluwer Academic Publishers. Manufactured Neth 45:5–32
DACFW (2021) Annual report 2020–21. Department of Agriculture Cooperation & Farmer’s Welfare, Ministry of Agriculture & Farmer’s Welfare, Government of India:1–298
Deryng D, Conway D, Ramankutty N, Price J, Warren R (2014) Global crop yield response to extreme heat stress under multiple climate change futures. Environ Res Lett 9:1–13. https://doi.org/10.1088/1748-9326/9/3/034011
Everingham Y, Sexton J, Skocaj D, Bamber GI (2016) Accurate prediction of sugarcane yield using a random forest algorithm. Agron Sustain Dev 36(2):1–9. https://doi.org/10.1007/s13593-016-0364-z
Fukuda S, Spreer W, Yasunaga E, Yuge K, Sardsud V, Muller J (2013) Random forests modelling for the estimation of mango (Mangifera indica L. cv. Chok Anan) fruit yields under different irrigation regimes. Agric Water Manag 116:142–150
Gandhi N, Armstrong LJ, Petkar O, Tripathy AK (2016) Rice crop yield prediction in India using support vector machines. In Proceedings of the 2016 13th international joint conference on computer science and software engineering Khon Kaen Thailand, IEEE, Piscataway, NJ:1–5 13–15 July 2016
Goyal A, Ananta V (2021) Mustard yield prediction using machine learning approach. J Agric Phys 21(2):445–456
Gromping U (2009) Variable importance assessment in regression: linear regression versus random forest. Am Stat 63(4):308–319
Hsiao C, Wan SK (2014) Is there an optimal forecast combination? J Econ 178:294–309
IPCC (2022) Climate change impacts, adaptation and vulnerability contribution of working group II to the sixth assessment report of the Intergovernmental Panel on Climate Change. In: Portner HO, Roberts DC, Tignor M, Poloczanska ES, Mintenbeck K, Alegría A, Craig M, Langsdorf S, Loschke S, Moller V, Okem A, Rama B (eds) Cambridge University Press, Cambridge, UK and New York, pp 1–3056. https://doi.org/10.1017/9781009325844
Jamieson PD, Porter JR, Wilson DR (1991) A test of computer simulation model ARC-WHEAT1 on wheat crops grown in New Zealand. Field Crops Res 27:337–350
Jeong JH, Resop JP, Mueller ND, Fleisher DH, Yun K, Butler EE, Timlin DJ, Shim KM, Gerber JS, Reddy VR, Kim SH (2016) Random forests for global and regional crop yield predictions. Plos One 1–15. https://doi.org/10.1371/journal.pone.0156571
Johnson M, Hsie W, Cannon A, Davidson A, Bedard F (2016) Crop yield forecasting on the Canadian Prairies by remotely sensed vegetation indices and machine learning methods. Agric For Meteorol 3:74–84
Kalra N, Chakraborty D, Sharma A, Rai HK, Jolly M, Chander S, Kumar PR, Bhandrary S, Barman D, Mittal RB, Lal M, Sehgal M (2008) Effect of increasing temperature on yield of some winter crops in Northwest India. Curr Sci 94:82–88
Karimi Y, Prasher SO, Madani A, Kim S (2008) Application of support vector machine technology for the estimation of crop biophysical parameters using aerial hyperspectral observations. Can Biosyst Eng 50:7.14–7.20
Kuhn M (2008) Building predictive models in R using caret package. J Stat Soft 28:1–6
Palanivel K, Surianarayanan C (2019) An approach for prediction of crop yield using machine learning and big data techniques. Int J Comp Eng Tech 10(3):110–118
Pandey PC, Dadhwal VK, Sahai B, Kale PP (1992) An optimal estimation technique for increasing the accuracy of crop forecasts by combining remotely sensed and conventional forecast results. Int J Remote Sens 13(14):2735–2741
Parviz L (2018) Assessing accuracy of barley yield forecasting with integration of climate variables and support vector regression. Anna Biologia 73(1):19–30
Smola AJ, Scholkopf B (2004) A tutorial on support vector regression. Stat Comp 14:199–222
Song H, Zhang R, Zhang Y, Xia F, Miao Q (2011) Energy consumption combination forecast of Hebei province based on the IOWA operator. Energy Procedia 5:2224–2229
SRD (2022) Statista Research Department (2022) Gross value added from rapeseed and mustard in India FY 2012–2020
Su YX, Xu H, Yan LJ (2017) Support vector machine-based open crop model (SBOCM): case of rice production in China. Saudi J Biologi Sci 24(3):537–547
Suzuki M, Shibahara T, Muragaki Y (2020) A method to extract feature variables contributed in nonlinear machine learning prediction. Methods Infor Med 59(1):1–8
Vashisth A, Singh R, Choudary M (2014) Crop yield forecast at different growth stage of wheat crop using statistical model under semi arid region. J Agroecosyst Nat Resour Manag 1(1):1–3
Vashisth A, Goyal A, Roy D (2018) Pre harvest maize crop yield forecast at different growth stage using different model under semi arid region of India. Int J Tropi Agric 36(4):915–920
Vashisth A, Aravind KS (2020) Multistage mustard yield estimation based on weather variables using multiple linear, LASSO and elastic net models for semi arid region of India. J Agric Phys 20(2):213–223
ViscarraRossel RA, Walvoort DJJ, Mcbratney AB, Janik LJ, Skjemstad JO (2006) Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties. Geoderma 131:59–75
Zhang Z (2016) Variable selection with stepwise and best subset approaches. Ann Transl Med 4(7):126
Acknowledgements
The authors are highly grateful and thankful to the editor in chief and reviewers of International Journal of Biometeorology for their fruitful, constructive comments and suggestions, which improved the content of the paper. Authors acknowledge Director, ICAR-Indian Agricultural Research Institute, New Delhi, India, for providing the facilities.
Funding
ICAR-IARI, New Delhi.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
The contents and view expressed in this study are the views of the author. The paper is not currently being considered for publication elsewhere and reflects the authors’ own research and analysis in a truthful and complete manner.
Conflict of interest
The authors declare no competing interests.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Vashisth, A., Goyal, A. Prediction of mustard yield using different machine learning techniques: a case study of Rajasthan, India. Int J Biometeorol 67, 539–551 (2023). https://doi.org/10.1007/s00484-023-02434-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00484-023-02434-2