Skip to main content

Advertisement

Log in

Prediction of crop yield in India using machine learning and hybrid deep learning models

  • Research Article - Atmospheric & Space Sciences
  • Published:
Acta Geophysica Aims and scope Submit manuscript

Abstract

Crop yield prediction is one of the burgeoning research areas in the agriculture domain. The crop yield forecasting models are developed to enhance productivity with improved decision-making strategies. The highly efficient crop yield forecasting model assists farmers in determining when, what and how much to plant on their cultivable land. The main objective of the proposed research work is to build a high efficacious crop yield prediction model based on the data available for the period of 21 years from 1997 to 2017 using machine learning and hybrid deep learning approaches. Two prediction models have been proposed in this research work to predict the crop yield accurately. The first model is a machine learning-based model which uses the CatBoost regression model and its hyperparameters are tuned which improves the performance of the yield prediction using the Optuna framework. The second model is the hybrid deep learning model which uses spatio-temporal attention-based convolutional neural network (STACNN) for extracting the features and the bidirectional long short-term memory (BiLSTM) model for predicting the crop yield effectively. The proposed models are evaluated using the error metrics and compared with the latest contemporary models. From the evaluation results, it is shown that the proposed models significantly outperform all other existing models and CatBoost regression model slightly performs better than the STACNN-BiLSTM model, with the R-squared value of 0.99.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Algorithm 1
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Availability of data and materials

The dataset for crop yield is collected from Kaggle website (The source of the dataset used in this research work is https://www.kaggle.com/datasets/abhinand05/crop-production-in-india/data or https://www.kaggle.com/code/ anjali21/indian-production-analysis-and-prediction/data) and Tata-Cornell Institute (TCI) website. The Kaggle website crop yield data are 646 districts of 33 Indian states and consists of historical information of crop yields between the years 1997–2015. Seven instances are present in the Kaggle crop yield dataset namely State name, District name, Crop year, Season, Crop name, Area and Production with 246091different attributes. The Kaggle crop yield dataset consists of nearly 124 types of crops grown all over in India. The crop yield dataset for district level Indian agriculture from TCI website is created by International Crops Research Institute for the Semi-Arid Tropics and TCI. The crop yield TCI dataset is collected for the years 2016 and 2017 which has 18,009 different attributes. Both the datasets are used for the proposed work by merging the data according to the common instances.

References

Download references

Funding

No funding has been claimed for this research work.

Author information

Authors and Affiliations

Authors

Contributions

KSS contributed to the conceptualization, methodology, resources, data curation, writing—original draft preparation, writing—review and editing, investigation and validation. VB was involved in the methodology, resources, supervision and validation.

Corresponding author

Correspondence to Krithikha Sanju Saravanan.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethical approval

Since this research work deals with text data, ethical approval is not applicable.

Additional information

Edited by Dr. Ahmad Sharafati (ASSOCIATE EDITOR) / Prof. Theodore Karacostas (CO-EDITOR-IN-CHIEF).

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Saravanan, K.S., Bhagavathiappan, V. Prediction of crop yield in India using machine learning and hybrid deep learning models. Acta Geophys. (2024). https://doi.org/10.1007/s11600-024-01312-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11600-024-01312-8

Keywords

Navigation