Skip to main content
Log in

Prediction of mustard yield using different machine learning techniques: a case study of Rajasthan, India

  • Original Paper
  • Published:
International Journal of Biometeorology Aims and scope Submit manuscript

Abstract

Mustard is the second most important edible oilseed after groundnut for India. Adverse weather drastically reduces the mustard yield. Weather variables affect the crop differently during different stages of development. Weather influence on crop yield depends not only on the magnitude of weather variables but also on weather distribution pattern over the crop growing period. Hence, developing models using weather variables for accurate and timely crop yield prediction is foremost important for crop management and planning decisions regarding storage, import, export, etc. Machine learning plays a significant role as it has a decision support tool for crop yield prediction. The models for mustard yield prediction was developed using long-term weather data during the crop growing period along with mustard yield data. Techniques used for developing the model were variable selection using stepwise multiple linear regression (SMLR) and artificial neural network (SMLR-ANN), variable selection using SMLR and support vector machine (SMLR-SVM), variable selection using SMLR and random forest (SMLR-RF), variable extraction using principal component analysis (PCA) and ANN (PCA-ANN), variable extraction using PCA and SVM (PCA-SVM), and variable extraction using PCA and RF (PCA-RF). Optimal combinations of the developed models were done for improving the accuracy of mustard yield prediction. Results showed that, on the basis of model accuracy parameters nRMSE, RMSE, and RPD, the PCA-SVM model performed best among all the six models developed for mustard yield prediction of study areas. Performance of mustard yield prediction done by optimum combinations of the models was better than the individual model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Data Availability

Weather data are availble at NDC IMD Pune

References

  • Aggarwal PK, Bandyopadhyay SK, Pathak H, Kalra N, Chandra S, Kumar S (2000) Analysis of yield trend of the rice-wheat system in north-western India. Outlook Agric 29(4):259–268

    Article  Google Scholar 

  • Ahmad MW, Mourshed M, Rezgui Y (2017) Trees vs neurons: comparison between random forest and ANN for high-resolution prediction of building energy consumption. Energy Build 147:77–89

    Article  Google Scholar 

  • Aravind KS, Ananta V, Krishanan P, DAS B, (2022) Wheat yield prediction based on weather parameters using multiple linear, neural network and penalised regression models. J Agrometeorol 24(1):18–25

    Article  Google Scholar 

  • Azfar M, Sisodia BVS, Rai VN, Devi M (2015) Pre-harvest forecast models for rapeseed & mustard yield using principal component analysis of weather variables. Mausam 4:761–766

    Article  Google Scholar 

  • Balakrishnan N, Muthukumarasamy G (2016) Crop production-ensemble machine learning model for prediction. Int J Comput Sci Soft Eng 5(7):148–153

    Google Scholar 

  • Bates JM, Granger CW (1969) The combination of forecasts. J Opera Res Soc 20(4):451–468

    Article  Google Scholar 

  • Boomiraj K, Chakrabarti B, Aggarwal PK, Choudhary R, Chander S (2010) Assessing the vulnerability of Indian mustard to climate change. Agric Ecosyst Environ 138:265–273

    Article  Google Scholar 

  • Breiman L (2001) Random forests. Kluwer Academic Publishers. Manufactured Neth 45:5–32

    Google Scholar 

  • DACFW (2021) Annual report 2020–21. Department of Agriculture Cooperation & Farmer’s Welfare, Ministry of Agriculture & Farmer’s Welfare, Government of India:1–298

  • Deryng D, Conway D, Ramankutty N, Price J, Warren R (2014) Global crop yield response to extreme heat stress under multiple climate change futures. Environ Res Lett 9:1–13. https://doi.org/10.1088/1748-9326/9/3/034011

    Article  Google Scholar 

  • Everingham Y, Sexton J, Skocaj D, Bamber GI (2016) Accurate prediction of sugarcane yield using a random forest algorithm. Agron Sustain Dev 36(2):1–9. https://doi.org/10.1007/s13593-016-0364-z

    Article  Google Scholar 

  • Fukuda S, Spreer W, Yasunaga E, Yuge K, Sardsud V, Muller J (2013) Random forests modelling for the estimation of mango (Mangifera indica L. cv. Chok Anan) fruit yields under different irrigation regimes. Agric Water Manag 116:142–150

    Article  Google Scholar 

  • Gandhi N, Armstrong LJ, Petkar O, Tripathy AK (2016) Rice crop yield prediction in India using support vector machines. In Proceedings of the 2016 13th international joint conference on computer science and software engineering Khon Kaen Thailand, IEEE, Piscataway, NJ:1–5 13–15 July 2016

  • Goyal A, Ananta V (2021) Mustard yield prediction using machine learning approach. J Agric Phys 21(2):445–456

    Google Scholar 

  • Gromping U (2009) Variable importance assessment in regression: linear regression versus random forest. Am Stat 63(4):308–319

    Article  Google Scholar 

  • Hsiao C, Wan SK (2014) Is there an optimal forecast combination? J Econ 178:294–309

    Article  Google Scholar 

  • IPCC (2022) Climate change impacts, adaptation and vulnerability contribution of working group II to the sixth assessment report of the Intergovernmental Panel on Climate Change. In: Portner HO, Roberts DC, Tignor M, Poloczanska ES, Mintenbeck K, Alegría A, Craig M, Langsdorf S, Loschke S, Moller V, Okem A, Rama B (eds) Cambridge University Press, Cambridge, UK and New York, pp 1–3056. https://doi.org/10.1017/9781009325844

  • Jamieson PD, Porter JR, Wilson DR (1991) A test of computer simulation model ARC-WHEAT1 on wheat crops grown in New Zealand. Field Crops Res 27:337–350

    Article  Google Scholar 

  • Jeong JH, Resop JP, Mueller ND, Fleisher DH, Yun K, Butler EE, Timlin DJ, Shim KM, Gerber JS, Reddy VR, Kim SH (2016) Random forests for global and regional crop yield predictions. Plos One 1–15. https://doi.org/10.1371/journal.pone.0156571

  • Johnson M, Hsie W, Cannon A, Davidson A, Bedard F (2016) Crop yield forecasting on the Canadian Prairies by remotely sensed vegetation indices and machine learning methods. Agric For Meteorol 3:74–84

    Article  Google Scholar 

  • Kalra N, Chakraborty D, Sharma A, Rai HK, Jolly M, Chander S, Kumar PR, Bhandrary S, Barman D, Mittal RB, Lal M, Sehgal M (2008) Effect of increasing temperature on yield of some winter crops in Northwest India. Curr Sci 94:82–88

    Google Scholar 

  • Karimi Y, Prasher SO, Madani A, Kim S (2008) Application of support vector machine technology for the estimation of crop biophysical parameters using aerial hyperspectral observations. Can Biosyst Eng 50:7.14–7.20

    Google Scholar 

  • Kuhn M (2008) Building predictive models in R using caret package. J Stat Soft 28:1–6

    Article  Google Scholar 

  • Palanivel K, Surianarayanan C (2019) An approach for prediction of crop yield using machine learning and big data techniques. Int J Comp Eng Tech 10(3):110–118

    Google Scholar 

  • Pandey PC, Dadhwal VK, Sahai B, Kale PP (1992) An optimal estimation technique for increasing the accuracy of crop forecasts by combining remotely sensed and conventional forecast results. Int J Remote Sens 13(14):2735–2741

    Article  Google Scholar 

  • Parviz L (2018) Assessing accuracy of barley yield forecasting with integration of climate variables and support vector regression. Anna Biologia 73(1):19–30

    Google Scholar 

  • Smola AJ, Scholkopf B (2004) A tutorial on support vector regression. Stat Comp 14:199–222

    Article  Google Scholar 

  • Song H, Zhang R, Zhang Y, Xia F, Miao Q (2011) Energy consumption combination forecast of Hebei province based on the IOWA operator. Energy Procedia 5:2224–2229

    Article  Google Scholar 

  • SRD (2022) Statista Research Department (2022) Gross value added from rapeseed and mustard in India FY 2012–2020

  • Su YX, Xu H, Yan LJ (2017) Support vector machine-based open crop model (SBOCM): case of rice production in China. Saudi J Biologi Sci 24(3):537–547

    Article  Google Scholar 

  • Suzuki M, Shibahara T, Muragaki Y (2020) A method to extract feature variables contributed in nonlinear machine learning prediction. Methods Infor Med 59(1):1–8

    Article  Google Scholar 

  • Vashisth A, Singh R, Choudary M (2014) Crop yield forecast at different growth stage of wheat crop using statistical model under semi arid region. J Agroecosyst Nat Resour Manag 1(1):1–3

    Google Scholar 

  • Vashisth A, Goyal A, Roy D (2018) Pre harvest maize crop yield forecast at different growth stage using different model under semi arid region of India. Int J Tropi Agric 36(4):915–920

    Google Scholar 

  • Vashisth A, Aravind KS (2020) Multistage mustard yield estimation based on weather variables using multiple linear, LASSO and elastic net models for semi arid region of India. J Agric Phys 20(2):213–223

    Google Scholar 

  • ViscarraRossel RA, Walvoort DJJ, Mcbratney AB, Janik LJ, Skjemstad JO (2006) Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties. Geoderma 131:59–75

    Article  CAS  Google Scholar 

  • Zhang Z (2016) Variable selection with stepwise and best subset approaches. Ann Transl Med 4(7):126

    Article  Google Scholar 

Download references

Acknowledgements

The authors are highly grateful and thankful to the editor in chief and reviewers of International Journal of Biometeorology for their fruitful, constructive comments and suggestions, which improved the content of the paper. Authors acknowledge Director, ICAR-Indian Agricultural Research Institute, New Delhi, India, for providing the facilities.

Funding

ICAR-IARI, New Delhi.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ananta Vashisth.

Ethics declarations

The contents and view expressed in this study are the views of the author. The paper is not currently being considered for publication elsewhere and reflects the authors’ own research and analysis in a truthful and complete manner.

Conflict of interest

The authors declare no competing interests.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vashisth, A., Goyal, A. Prediction of mustard yield using different machine learning techniques: a case study of Rajasthan, India. Int J Biometeorol 67, 539–551 (2023). https://doi.org/10.1007/s00484-023-02434-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00484-023-02434-2

Keywords

Navigation