Abstract
Interest in semiarid climate forecasting has prominently grown due to risks associated with above average levels of precipitation amount. Longer-lead forecasts in semiarid watersheds are difficult to make due to short-term extremes and data scarcity. The current research is a new application of classification and regression trees (CART) model, which is rule-based algorithm, for prediction of the precipitation over a highly complex semiarid climate system using climate signals. We also aimed to compare the accuracy of the CART model with two most commonly applied models including time series modeling (ARIMA), and adaptive neuro-fuzzy inference system (ANFIS) for prediction of the precipitation. Various combinations of large-scale climate signals were considered as inputs. The results indicated that the CART model had a better results (with Nash–Sutcliffe efficiency, NSE > 0.75) compared to the ANFIS and ARIMA in forecasting precipitation. Also, the results demonstrated that the ANFIS method can predict the precipitation values more accurately than the time series model based on various performance criteria. Further, fall forecasts ranked “very good” for the CART method, while the ANFIS and the time series model approximately indicated “satisfactory” and “unsatisfactory” performances for all stations, respectively. The forecasts from the CART approach can be helpful and critical for decision makers when precipitation forecast heralds a prolonged drought or flash flood.
Similar content being viewed by others
References
Afshin S, Fahmi H, Alizadeh A, Sedghi H, Kaveh F (2011) Long term rainfall forecasting by integrated artificial neural network-fuzzy logic-wavelet model in Karoon basin. Sci Res Essays 6(6):1200–1208. https://doi.org/10.5897/SRE10.448
Akaike H (1998) Information theory and an extension of the maximum likelihood principle. In: Selected papers of Hirotugu Akaike. Springer, New York, pp 199–213. https://doi.org/10.1007/978-1-4612-1694-0_15
Azadi S, Sepaskhah AR (2012) Annual precipitation forecast for west, southwest, and south provinces of Iran using artificial neural networks. Theoret Appl Climatol 109:175–189. https://doi.org/10.1007/s00704-011-0575-9
Berg N, Hall A, Capps SB, Hughes M (2013) El Nino-Southern Oscillation impacts on winter winds over Southern California. Clim Dyn 40(1–2):109–121. https://doi.org/10.1007/s00382-012-1461-6
Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth and Brooks/Cole, Monterey
Choubin B, Malekian A (2017) Combined gamma and M-test-based ANN and ARIMA models for groundwater fluctuation forecasting in semiarid regions. Environ Earth Sci 76(15):538. https://doi.org/10.1007/s12665-017-6870-8
Choubin B, Khalighi-Sigaroodi S, Malekian A, Ahmad S, Attarod P (2014) Drought forecasting in a semi-arid watershed using climate signals: a neuro-fuzzy modeling approach. J Mt Sci 11(6):1593–1605. https://doi.org/10.1007/s11629-014-3020-6
Choubin B, Khalighi-Sigaroodi S, Malekian A, Kişi Ö (2016a) Multiple linear regression, multi-layer perceptron network and adaptive neuro-fuzzy inference system for forecasting precipitation based on large-scale climate signals. Hydrol Sci J 61(6):1001–1009. https://doi.org/10.1080/02626667.2014.966721
Choubin B, Malekian A, Golshan M (2016b) Application of several data-driven techniques to predict a standardized precipitation index. Atmósfera 29(2):121–128. https://doi.org/10.20937/ATM.2016.29.02.02
Choubin B, Solaimani K, Habibnejad Roshan M, Malekian A (2017a) Watershed classification by remote sensing indices: a fuzzy c-means clustering approach. J Mt Sci. https://doi.org/10.1007/s11629-017-4357-4
Choubin B, Malekian A, Samadi S, Khalighi-Sigaroodi S, Sajedi-Hosseini F (2017b) An ensemble forecast of semi-arid rainfall using large-scale climate predictors. Meteorol Appl. https://doi.org/10.1002/met.1635
Choubin B, Darabi H, Rahmati O, Sajedi-Hosseini F, Kløve B (2018) River suspended sediment modelling using the CART model: a comparative study of machine learning techniques. Sci Total Environ 615:272–281. https://doi.org/10.1016/j.scitotenv.2017.09.293
De'ath AG, Fabricius KE (2000) Classification and regression trees: a powerful yet simple technique for ecological data analysis. Ecology 81:3178–3192
El-Shafie A, Jaafer O, Seyed A (2011) Adaptive neuro-fuzzy inference system based model for rainfall forecasting in Klang River, Malaysia. Int J Phys Sci 6(12):2875–2888. https://doi.org/10.5897/IJPS11.515
Fallah-Ghalhary GA, Habibi-Nokhandan M, Mousavi-Baygi M, Khoshhal J, Shaemi Barzoki A (2010) Spring rainfall prediction based on remote linkage controlling using adaptive neuro-fuzzy inference system (ANFIS). Theoret Appl Climatol 101:217–233. https://doi.org/10.1007/s00704-009-0194-x
Farokhnia A, Morid S, Byun HR (2011) Application of global SST and SLP data for drought forecasting on Tehran plain using data mining and ANFIS techniques. Theoret Appl Climatol 104:71–81. https://doi.org/10.1007/s00704-010-0317-4
Gamiz-Fortis SR, Esteban-Parra MJ, Trigo RM, Castro-Diez Y (2010) Potential predictability of an Iberian river flow based on its relationship with previous winter global SST. J Hydrol 385:143–149. https://doi.org/10.1016/j.jhydrol.2010.02.010
Gaughan AE, Waylen PR (2012) Spatial and temporal precipitation variability in the Okavangoe–Kwandoe–Zambezi catchment, southern Africa. J Arid Environ 82:19–30. https://doi.org/10.1016/j.jaridenv.2012.02.007
Hurvich CM, Tsai C-L (1989) Regression and time series model selection in small samples. Biometrika 76:297–307. https://doi.org/10.1093/biomet/76.2.297
Jang JSR (1993) ANFIS: adaptive network-based fuzzy inference systems. IEEE Trans Syst Man Cybern 23(3):665–685. https://doi.org/10.1109/21.256541
Khalili A (1997) Integrated water plan of Iran. Vol. 4: meteorological studies, ministry of energy, Iran. Lecha, L. and P. Shackleford, 1997. Climate services for tourism and recreation. WMO Bull 46:46–47
Kisi O, Haktanir T, Ardiclioglu M, Ozturk O, Yalcin E, Uludag S (2009) Adaptive neuro-fuzzy computing technique for suspended sediment estimation. Adv Eng Softw 40(6):438–444. https://doi.org/10.1016/j.advengsoft.2008.06.004
Kurtulus B, Razack M (2010) Modeling daily discharge responses of a large karstic aquifer using soft computing methods: artificial neural network and neuro-fuzzy. J Hydrol 381(1–2):101–111. https://doi.org/10.1016/j.jhydrol.2009.11.029
Lemons J (2003) Conserving biodiversity in arid regions: best practices in developing nations. Springer, New York
Loh WY (2011) Classification and regression trees. Wiley Interdiscip Rev Data Min Knowl Discov 1:14–23. https://doi.org/10.1002/widm.8
Moriasi DN, Arnold JG, Van Liew MW, Binger RL, Harmel RD, Veith T (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50(3):885–900. https://doi.org/10.13031/2013.23153
Nash JE, Sutcliffe JV (1970) River flow forecasting through conceptual models: part 1-A discussion of principles. J Hydrol 10(3):282–290. https://doi.org/10.1016/0022-1694(70)90255-6
Nazemosadat MJ, Cordey I (2000) On the relationship between ENSO and autumn rainfall in Iran. Int J Climatol 20(1):47–61. https://doi.org/10.1002/(sici)1097-0088(200001)20:1<47::aid-joc461>3.0.co;2-p
Rezaeian-Zadeh M, Tabari H, Abghari H (2012) Prediction of monthly discharge volume by different artificial neural network algorithms in semi-arid regions. Arab J Geosci 6(7):2529–2537. https://doi.org/10.1007/s12517-011-0517-y
Ruigar H, Golian S (2016) Prediction of precipitation in Golestan dam watershed using climate signals. Theoret Appl Climatol 123(3–4):671–682
Samadi S, Wilson CA, Moradkhani H (2013) Uncertainty analysis of statistical downscaling models using Hadley Centre Coupled Model. Theoret Appl Climatol. https://doi.org/10.1007/s00704-013-0844-x
Sanikhani H, Kisi O (2012) River flow estimation and forecasting by using two different adaptive neuro-fuzzy approaches. Water Resour Manag 26(6):1715–1729. https://doi.org/10.1007/s11269-012-9982-7
Seera M, Lim CP, Ishak D, Singh H (2012) Fault detection and diagnosis of induction motors using motor current signature analysis and a hybrid FMM–CART model. IEEE Trans Neural Netw Learn Syst 23(1):97–108
Sen P, Roy M, Pal P (2016) Application of ARIMA for forecasting energy consumption and GHG emission: a case study of an Indian pig iron manufacturing organization. Energy 116:1031–1038
Sharifi F, Samadi SZ, Wilson CAME (2012) Causes and consequences of recent floods in the Golestan catchments and Caprecipitationan Sea regions of Iran. Nat Hazards 61(2):533–550. https://doi.org/10.1007/s11069-011-9934-1
Sigaroodi SK, Chen Q, Ebrahimi S, Nazari A, Choobin B (2014) Long-term precipitation forecast for drought relief using atmospheric circulation factors: a study on the Maharloo Basin in Iran. Hydrol Earth Syst Sci 18(5):1995–2006. https://doi.org/10.5194/hess-18-1995-2014
Singh J, Knapp HV, Arnold JG, Demissie M (2005) Hydrologic modeling of the Iroquois River watershed using HSPF and SWAT. J Am Water Resour Assoc 41(2):343–360. https://doi.org/10.1111/j.1752-1688.2005.tb03740.x
Singh R, Wagener T, Crane R, Mann ME, Ning L (2014) A vulnerability driven approach to identify adverse climate and land use change combinations for critical hydrologic indicator thresholds: application to a watershed in Pennsylvania, USA. Water Resour Res 50:3409–3427. https://doi.org/10.1002/2013WR014988
Sutton CD (2005) Classification and regression trees, bagging, and boosting. Handb Stat 24:303–329. https://doi.org/10.1016/S0169-7161(04)24011-1
Timofeev R (2004) Classification and regression trees (CART) theory and applications. In: Master Thesis. Center of Applied Statistics and Economics, Humboldt University, Berlin
Wei M, Bai B, Sung AH, Liu Q, Wang J, Cather ME (2007) Predicting injection profiles using ANFIS. Inf Sci 177(20):4445–4461. https://doi.org/10.1016/j.ins.2007.03.021
Yatheendradas S, Wagener T, Gupta H, Unkrich C, Goodrich D, Schaffner M, Stewart A (2008) Understanding uncertainty in distributed flash flood forecasting for semiarid regions. Water Resour Res 44(5):W05S19. https://doi.org/10.1029/2007wr005940
Acknowledgements
This study was partially funded by University of Tehran (grant number 7401001/1/4).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Choubin, B., Zehtabian, G., Azareh, A. et al. Precipitation forecasting using classification and regression trees (CART) model: a comparative study of different approaches. Environ Earth Sci 77, 314 (2018). https://doi.org/10.1007/s12665-018-7498-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12665-018-7498-z