Skip to main content

Advertisement

Log in

An automatic crop yield prediction framework designed with two-stage classifiers: a meta-heuristic approach

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Agriculture is the main source of income for 75% of the Indian people. Weather, climate, and other natural factors have a higher influence on agricultural productivity. Forecasting crop yields is a very challenging issue for global food security. Forecasting crop yield based on the soil, environment, water, as well as crop parameters, is an important research field. Recently, many existing methods are presented to solve crop yield prediction. But, the performances of those models are inaccurate and suffered from errors. Hence, in this article, a deep learning-based system for agricultural production prediction based on environmental data is established. This research includes three stages, like pre-processing, feature extraction, and as well as prediction. The obtained raw data (environmental data) is pre-processed using a data cleaning technique to improve data quality prediction performance. The most dependable properties, such as statistical features, improved correlation, and mutual information-based features, are then retrieved from the pre-processed data. The yield predictors in the yield prediction phase are trained using these characteristics. Two-stage classifiers, stage-1 pre-prediction, and stage-2 final classification are used to represent the yield prediction phase. “Deep Belief Networks (DBN), Long Short Term Memory Networks (LSTM), and Recurrent Neural Networks (RNN)” are all used in the pre-prediction stage. The DBN, LSTM, and RNN outputs are sent into the final classification step, and it contains an improved Convolutional Neural Network (CNN). The CNN's weight function is fine-tuned via the novel Dingo Optimized Sand Piper (DOSP) model as it makes final judgments about crop production. Eventually, the efficiency of the anticipated model (two-stage classifier with DOSP) is validated by a comparative examination in terms of various measures. This research showed the efficiency of the proposed work in different types of datasets. In particular, for dataset 1, the MAE of the developed method at the 70th training rate is 50%, 62.5%, 57.1%, and 40% improved over the existing SOA, DOX, BOA, BMVO, GWO and PRO, respectively. For dataset 2, the mean of the suggested work is 2.76%, 2.57%, 2.57%, and 2.85% better than existing SOA, DOX, BOA, BMVO, GWO and PRO, schemes respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

The data were generated by using https://www.kaggle.com/prasadkevin/crops-prediction-indian-dataset and https://www.kaggle.com/prasadkevin/crops-prediction-indian-dataset

Abbreviations

ANN:

Artificial Neural Network

Bi-GRU:

Bidirectional Gated Recurrent Unit

Bi-LSTM:

Bidirectional Long Short-Term Memory

BP:

Back Propagation

BPNN:

Backpropagation neural networks

BR:

Bayesian regularization

CNN:

Convolutional Neural Network

DBN-FNN:

Deep Belief Network—Fuzzy Neural Networks

DOSP:

Dingo Optimized Sand Piper

DOX:

Dingo Optimizer

FAO:

Food and Agriculture Organization

FDEA:

Fuzzy Data Envelopment Analysis

GWR:

Geographically weighted regression

IndRNN:

Independently Recurrent Neural Network

LM:

Levenberg–Marquardt

LR:

Logistic Regression

LSTM:

Long Short Term Memory

MAE:

Mean Absolute Error

ML:

Machine Learning

MLR:

Multiple Linear Regression

MSE:

Mean Square Error

OBL:

Opposition-based Learning

OLS:

Ordinary Least Squares

PCA:

Principal Component Analysis

PRO:

Poor and Rich Optimization

RF:

Random Forest

RL-RF:

Reinforcement Learning – Random Forest

RMSE:

Root Mean Square Error

RNN:

Recurrent Neural Network

RQ:

Research Question

SCG:

Scaled Conjugated Gradient

SOA:

Sandpiper Optimization Algorithm

SVM:

Support Vector Machine

SVR:

Support Vector Regression

WHO:

World Health Organization

References

  1. Zhao S, Zheng H, Chi M, Chai X, Liu Y (2019) Rapid yield prediction in paddy fields based on 2D image modelling of rice panicles. Comput Electron Agric 162:759–766

    Article  Google Scholar 

  2. Guo Y, Fu Y, Hao F, Zhang X, Wu W, Jin X, Senthilnath J (2021) Integrated phenology and climate in rice yields prediction using machine learning methods. Ecol Ind 120:106935

    Article  Google Scholar 

  3. Chu Z, Yu J (2020) An end-to-end model for rice yield prediction using deep learning fusion. Comput Electron Agric 174:105471

    Article  Google Scholar 

  4. Elavarasan D, Durai Raj Vincent PM (2021) Fuzzy deep learning-based crop yield prediction model for sustainable agronomical frameworks. Neural Comput Appli, pp. 1–20

  5. Nandy A, Singh PK. Application of fuzzy DEA and machine learning algorithms in efficiency estimation of paddy producers of rural Eastern India. Benchmarking: An International Journal. 2020.

  6. Amaratunga V, Wickramasinghe L, Perera A, Jayasinghe J, Rathnayake U (2020) Artificial neural network to estimate the paddy yield prediction using climatic data. Mathematic Probl Eng

  7. Shiu YS, Chuang YC (2019) Yield estimation of paddy rice based on satellite imagery: Comparison of global and local regression models. Remote Sensing 11(2):111

    Article  ADS  Google Scholar 

  8. Elavarasan D, Vincent PD (2020) Crop yield prediction using deep reinforcement learning model for sustainable agrarian applications. IEEE Access 8:86886–86901

    Article  Google Scholar 

  9. Rashid M, Bari BS, Yusup Y, Kamaruddin MA, Khan N (2021) A comprehensive review of crop yield prediction using machine learning approaches with special emphasis on palm oil yield prediction. IEEE Access 9:63406–63439

    Article  Google Scholar 

  10. Jiang D, Lin W, Raghavan N (2020) A novel framework for semiconductor manufacturing final test yield classification using machine learning techniques. IEEE Access 8:197885–197895

    Article  Google Scholar 

  11. Das B, Nair B, Reddy VK, Venkatesh P (2018Oct) Evaluation of multiple linear, neural network and penalised regression models for prediction of rice yield based on weather parameters for west coast of India. Int J Biometeorol 62(10):1809–1822

    Article  PubMed  Google Scholar 

  12. Shahhosseini M, Martinez-Feria RA, Hu G, Archontoulis SV (2019Dec 4) Maize yield and nitrate loss prediction with machine learning algorithms. Environ Res Lett 14(12):124026

    Article  ADS  Google Scholar 

  13. Pantazi XE, Moshou D, Alexandridis T, Whetton RL, Mouazen AM (2016Feb) Wheat yield prediction using machine learning and advanced sensing techniques. Comput Electron Agric 1(121):57–65

    Article  Google Scholar 

  14. Elavarasan D, Vincent PD (2021Nov) A reinforced random forest model for enhanced crop yield prediction by integrating agrarian parameters. J Ambient Intell Humaniz Comput 1:1–4

    Google Scholar 

  15. Zhang J, Feng F, Zhang QJ (2021) Rapid yield estimation of microwave passive components using model-order reduction based neuro-transfer function models. IEEE Microwave Wirel Compon Lett 31(4):333–336

    Article  Google Scholar 

  16. Jiang S, Zhang Z, Zhao H, Li J, Yang Y, Lu BL, Xia N (2021) When SMILES smiles, practicality judgment and yield prediction of chemical reaction via deep chemical language processing. IEEE Access 9:85071–85083

    Article  Google Scholar 

  17. Qiao M, He X, Cheng X, Li P, Luo H, Tian Z, Guo H (2021) Exploiting Hierarchical Features for Crop Yield Prediction Based on 3-D Convolutional Neural Networks and Multikernel Gaussian Process. IEEE J Select Top Appl Earth Observ Remote Sensing 14:4476–4489

    Article  ADS  Google Scholar 

  18. Yuan T, Bae SJ, Kuo Y (2020) Statistical models of overdispersed spatial defects for predicting the yield of integrated circuits. IEEE Trans Reliab 69(2):510–521

    Article  Google Scholar 

  19. Sun J, Lai Z, Di L, Sun Z, Tao J, Shen Y (2020) Multilevel deep learning network for county-level corn yield estimation in the us corn belt. IEEE J Selec Topics Appl Earth Observ Remote Sens 13:5048–5060

    Article  ADS  Google Scholar 

  20. Cui C, Liu K, Zhang Z (2020) Chance-constrained and yield-aware optimization of photonic ICs with non-Gaussian correlated process variations. IEEE Trans Comput Aided Des Integr Circuits Syst 39(12):4958–4970

    Article  Google Scholar 

  21. Coviello L, Cristoforetti M, Jurman G, Furlanello C (2020Jul 16) GBCNet: In-field grape berries counting for yield estimation by dilated CNNs. Appl Sci 10(14):4870

    Article  CAS  Google Scholar 

  22. Lin T, Zhong R, Wang Y, Xu J, Jiang H, Xu J, Ying Y, Rodriguez L, Ting KC, Li H (2020Feb 19) DeepCropNet: a deep spatial-temporal learning framework for county-level corn yield estimation. Environ Res Lett 15(3):034016

    Article  ADS  Google Scholar 

  23. Maimaitijiang M, Sagan V, Sidike P, Hartling S, Esposito F, Fritschi FB (2020Feb) Soybean yield prediction from UAV using multimodal data fusion and deep learning. Remote Sens Environ 1(237):111599

    Article  Google Scholar 

  24. Yoon D, Kim E, Choi I, Han SW, Yang S (2020) Prediction of voluntary motion using decomposition-and-ensemble framework with deep neural networks. IEEE Access 8:201555–201565

    Article  Google Scholar 

  25. Jiang D, Lin W, Raghavan N (2021) Semiconductor Manufacturing Final Test Yield Optimization and Wafer Acceptance Test Parameter Inverse Design Using Multi-Objective Optimization Algorithms. IEEE Access 9:137655–137666

    Article  Google Scholar 

  26. Arami A, Poulakakis-Daktylidis A, Tai YF, Burdet E (2019) Prediction of gait freezing in Parkinsonian patients: a binary classification augmented with time series prediction. IEEE Trans Neural Syst Rehabil Eng 27(9):1909–1919

    Article  PubMed  Google Scholar 

  27. Wu S, Yang J, Cao G, Qiu Y, Cheng G, Yao M, Dong J (2020) Elevating Prediction Performance for Mechanical Properties of Hot-Rolled Strips by Using Semi-Supervised Regression and Deep Learning. IEEE Access 8:134124–134136

    Article  Google Scholar 

  28. Jiang J, Xing F, Zeng X, Zou Q (2019) Investigating maize yield-related genes in multiple omics interaction network data. IEEE Trans Nanobiosci 19(1):142–151

    Article  Google Scholar 

  29. Anderson C, Vasudevan R, Johnson-Roberson M (2020) Off the beaten sidewalk: Pedestrian prediction in shared spaces for autonomous vehicles. IEEE Robot Automa Lett 5(4):6892–6899

    Article  Google Scholar 

  30. Luciani R, Laneve G, JahJah M (2019Jun 25) Agricultural monitoring, an automatic procedure for crop mapping and yield estimation: The great rift valley of Kenya case. IEEE J Selec Topics Appl Earth Observ Remote Sens 12(7):2196–2208

    Article  ADS  Google Scholar 

  31. Hans R, Kaur H (2020) Binary Multi-Verse Optimization (BMVO) Approaches for Feature Selection. International Journal Of Interactive Multimedia And Artificial Intelligence, 6(Special Issue on Soft Computing), pp. 91–106 https://doi.org/10.9781/ijimai.2019.07.004

  32. Wang X, Yuan Y, Mu X, Sun W, Song X (2019) Sensitivity of TBM’s Performance to Structural, Control and Geological Parameters Under Different Prediction Models. IEEE Access 7:19738–19751

    Article  Google Scholar 

  33. Goli A, Zare, HK, Tavakkoli-Moghaddam R, Sadeghieh A (2019). An Improved Artificial Intelligence Based on Gray Wolf Optimization and Cultural Algorithm to Predict Demand for Dairy Products: A Case Study. International Journal Of Interactive Multimedia And Artificial Intelligence, 5(Special Issue on Use Cases of Artificial Intelligence, Digital Marketing and Neuroscience), pp. 15–22 https://doi.org/10.9781/ijimai.2019.03

  34. Kaur A, Jain S, Goel S (2019) Sandpiper optimization algorithm: a novel approach for solving real-life engineering problems. Appl Intell 50(2):582–619

    Article  Google Scholar 

  35. Bairwa AK, Joshi S, Singh D (2021) Dingo optimizer: A nature-inspired metaheuristic approach for engineering problems. Mathematical Problems in Engineering

  36. Bao S, Cao C, Ni X, Xu M, Ju H, He Q, Zhou S (2017 ) Crop yield variation trend and distribution pattern in recent ten years. In: 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), pp. 6150–6153. IEEE

  37. Dataset1, from : https://www.kaggle.com/prasadkevin/crops-prediction-indian-dataset

  38. Dataset 2: from https://www.kaggle.com/prasadkevin/crops-prediction-indian-dataset

  39. Salehinejad H, Sankar S, Barfett J, Colak E, Valaee S (2017) Recent advances in recurrent neural networks. arXiv preprint arXiv:1801.01078

  40. Mohan P, Patil KK (2017) Crop production rate estimation using parallel layer regression with deep belief network. In2017 International Conference on Electrical, Electronics, Communication, Computer, and Optimization Techniques (ICEECCOT)

  41. Graves A, Graves A (2012) Long short-term memory. Supervised sequence labelling with recurrent neural networks, pp. 37–45

  42. Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B, Liu T, Wang X, Wang G, Cai J, Chen T (2018May) Recent advances in convolutional neural networks. Pattern Recogn 1(77):354–377

    Article  ADS  Google Scholar 

  43. Jadhav AN, Gomathi N (2019Jul) DIGWO: Hybridization of dragonfly algorithm with improved grey wolf optimization algorithm for data clustering. Multimed Res 2(3):1–1

    Google Scholar 

  44. Ishwarya R, Nagapooja BN, Raghavi R (2022) CROP YIELD PREDICTION USING MACHINE LEARNING ALGORITHM. Int Res J Moderniza Eng Technol Sci 04(07)

  45. Ilyas QM, Ahmad M, Mehmood A (2023) Automated Estimation of Crop Yield Using Artificial Intelligence and Remote Sensing Technologies. Bioengineering 10(2):125

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Funding

No specific funding is received for this research.

Author information

Authors and Affiliations

Authors

Contributions

All authors have made substantial contributions to the conception and design, revising the manuscript, and the final approval of the version to be published. Additionally, it was agreed that each author would be responsible for all aspects of the work and ensure that any concerns about the truthfulness or integrity of any part of the work would be duly investigated and addressed.

Corresponding author

Correspondence to Venkata Rama Rao Kolipaka.

Ethics declarations

Conflict of Interest

The authors declare no conflict of interest.

Informed consent

Not Applicable.

Ethical approval

Not Applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kolipaka, V.R.R., Namburu, A. An automatic crop yield prediction framework designed with two-stage classifiers: a meta-heuristic approach. Multimed Tools Appl 83, 28969–28992 (2024). https://doi.org/10.1007/s11042-023-16612-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-16612-2

Keywords

Navigation