Abstract
This paper presents a hybrid model to better predict Indian summer monsoon rainfall. The algorithm considers suitable techniques for processing dense datasets. The proposed three-step algorithm comprises closed itemset generation-based association rule mining for feature selection, cluster membership for dimensionality reduction, and simple logistic function for prediction. The application of predicting rainfall into flood, excess, normal, deficit, and drought based on 36 predictors consisting of land and ocean variables is presented. Results show good accuracy in the considered study period of 37years (1969–2005).
Similar content being viewed by others
References
Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: 1993 ACM SIGMOD international conference on management of data - SIGMOD’93, pp 207–216
Angell JK (1981) Comparison of variations in atmospheric quantities with sea surface temperature variations in the equatorial pacific. Mon Weather Rev 109:230–243
Chakravarty SC (2011) Sea surface temperature (SST) and the Indian summer monsoon. InTech
Charlotte BV, Dhanya CT, Mathew B (2012) Equinoo: the entity and validity of this oscillation to Indian monsoon. Int J Eng Sci 1:45–54
Daellenbach H, McNickle D, Dye S (2012) Management science: decision-making through systems thinking, Palgrave Macmillan, chap Uncertainty, p 428
DelSole T, Shukla J (2002) Linear prediction of Indian monsoon rainfall. J Clim 15(24):3645–3658
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc 39:1–38
Dhanya CT, Nagesh Kumar D (2009) Data mining for evolution of association rules for droughts and floods in India using climate inputs. J Geophys Res:114
Dwivedi S, Pandey AC (2011) Forecasting the Indian summer monsoon intraseasonal oscillations using genetic algorithm and neural network. Geophys Res Lett:38
Francis PA, Gadgil S (2013) A note on new indices for the equatorial Indian ocean oscillation. J Earth Syst Sci 122(4):1005–1011
Gadgil S (2003) The Indian monsoon and its variability. Ann Rev Earth Planet Sci 31:429–467
Gago P, Bento C (1998) Principles of data mining and knowledge discovery: second European symposium, PKDD’98 nantes. Springer, Berlin, pp 19–27
Goswami P, Srividya P (1996) A neural network design for long range prediction of rainfall pattern. Curr Sci 70:447–457
Gowariker V, Thapliyal V, Sarker RP, Mandal GS, Sikka DR (1989) Parametric and power regression models: new approach to long range forecasting of monsoon rainfall in India. Mausam 40:115–122
Gowariker V, Thapliyal V, Kulshrestha SM, Mandal GS, Sen Roy N, Sikka DR (1991) A power regression model for long range forecast of southwest monsoon rainfall over India. Mausam 42:125–130
Guhathakurta P, Rajeevan M, Thapliyal V (1999) Long range forecasting Indian summer monsoon rainfall by a hybrid principal component neural network model. Meteorog Atmos Phys 71:255–266
Holcombe M, Paton R (1997) Information processing in cells and tissues. In: Proceedings of the international workshop on information processing in cells and tissues
Ihara C, Kushnir Y, Mark Cane A, Victor H (2006) Indian summer monsoon rainfall and its link with ENSO and Indian ocean climate indices. Int J Climatol 27:179–187
Iyengar RN, Raghu Kanth SGT (2005) Intrinsic mode functions and a strategy for forecasting Indian monsoon rainfall. Meteorog Atmos Phys 90:17–36
Khandekar ML, Neralla VR (1984) On the relationship between the sea surface temperatures in the equatorial pacific and the Indian monsoon rainfall. Geophys Res Lett 11:1137–1140
Kothawale RD, Munot AA, Borgaonkar PH (2008) Temperature variability over the Indian ocean and its relationship with Indian summer monsoon rainfall. Theor Appl Climatol 92(1):31–45
Krishna K, Soman M, Kumar K (1995) Seasonal forecasting of Indian summer monsoon rainfall: a review. Weather 50:449–467
Krishnamurti TN, Kishtawal CM (2000) A pronounced continental-scale diurnal mode of the Asian summer monsoon. Mon Weather Rev:462–473
Kumar KK, Soman MK, Roopa Kumar K (1995) Seasonal forecasting of Indian summer monsoon rainfall: a review. Weather:449–467
Lockyer N, Lockyer W (1904) The behavior of the short-period atmospheric pressure variation over the earth’s surface. Proc R Soc Lond:457–470
Mooley DA, Parthasarathy B (1984) Fluctuations in all-India summer monsoon rainfall during 1871–1978. Clim Chang 6:287–301
Niels L, Mark H, Eibe F (2005) Logistic model trees. Mach Learn 59(1):161–205
Ordonez C (2006) Association rule discovery with the train and test approach for heart disease prediction. IEEE Trans Inf Technol Biomed 10(2):334–343
Ordonez C, Zhao K (2011) Evaluating association rules and decision trees to predict multiple target attributes. Intell Data Anal 15(2):173–192
Parthasarathy B, Rupa kumar K, Sontakke NA (1990) Surface and upper air temperatures over Indian in relation to monsoon rainfall. Theor Appl Climatol 42:93–110
Parthasarathy B, Rupa Kumar K, Munot AA (1991) Evidence of secular variations in Indian monsoon rainfall-cerculation relationships. J Clim 4(9):927–938
Parthasarathy B, Rupa Kumar K, Munot AA (1992) Surface pressure and summer monsoon rainfall over India. Adv Atmos Sci 9(3):359–366
Prabhu A, Mahajan PN, Khaladkar RM, Chipade MD (2010) Role of Antarctic circumpolar wave in modulating the extremes of Indian summer monsoon rainfall. Geophys Res Lett 37(14)
Rajeevan M, Guhathakurta P, Thapliyal V (2000) New models for long range forecasts of summer monsoon rainfall over northwest and peninsular India. Meteorog Atmos Phys 73:211–225
Rajeevan M, Pai DS, Dikshit SK, Kelkar RR (2004) IMD’S new operational models for long range forecast of south-west monsoon rainfall over India and their verification for 2003. Curr Sci 86:422–431
Rajeevan M, Pai DS, Anil Kumar R (2005) New statistical models for long range forecasting of southwest monsoon rainfall over India. NCC research report no 1/2005. India Meteorological Department, Pune
Rajeevan M, Bhate J, Kale JD, Lal B (2006) High resolution daily gridded rainfall data for the Indian region: analysis of break and active monsoon spells. Curr Sci 91:296–306
Rajeevan M, Pai DS, Anil Kumar R, Lal B (2007) New statistical models for long-range forecasting of southwest monsoon rainfall over India. Clim Dyn 28(7):813–828
Rasmusson EM, Thomas Carpenter H (1982) Variations in tropical sea surface temperature and surface wind fields associated with the southern oscillation/El niño. Mon Weather Rev 110:354–384
Ravi S, Nanjundiah PA, Francis VM, Gadgil S (2013) Predicting the extremes of Indian summer monsoon rainfall with coupled ocean-atmosphere models. Curr Sci 104(10):1380–1393
Rayner NA, Parker DE, Horton EB, Folland CK, Alexander LV, Rowell DP, Kent EC, Kaplan A (2003) Global analyses of sst, sea ice and night marine air temperature since the late nineteenth century. J Geophys Res 108:4407
Satishkumar S, Kashid RM (2012) Prediction of monthly rainfall on homogeneous monsoon regions of India based on large scale circulation patterns using genetic programming. J Hydrol 454:26–41
Shukla J, Daniel AP (1983) The southern oscillation and long-range forecasting of the summer monsoon rainfall over India. Mon Weather Rev 111:1830–1837
Singh P, Borah B (2013) Indian summer monsoon rainfall prediction using artificial neural network. Stoch Env Res Risk A 27:1585–1599
Sperber KR, Palmer TN (1996) Interannual tropical rainfall variability in general circulation model simulations associated with the atmospheric model intercomparison project. J Clim 9:2727–2750
Sulochana G, Vinayachandran PN, Francis PA, Sidharata G (2004) Extremes of the Indian summer monsoon rainfall, ENSO and the equatorial Indian ocean oscillation. Geophys Res Lett:31
Sumner M, Frank E, Hall M (2005) Speeding up logistic model tree induction. In: 9th European conference on principles and practice of knowledge discovery in databases, pp 675–683
Taniguchi K, Rajan D, Koike T (2010) Erratum to: effect of the variation in the lower tropospheric temperature on the wind onset of the Indian summer monsoon. Meteorog Atmos Phys 106(3–4):227–227
Wilks DS (1995) Statistical methods in atmospheric sciences. Academic, Waltham
Xi J, Zhou L, Murtugudde R, Jiang L (2015) Impacts of intraseasonal sst anomalies on precipitation during Indian summer monsoon. J Clim 28:4561–4575
Xie J, Wu J, Qian Q (2009) Feature selection algorithm based on association rules mining method. In: Eighth IEEE/ACIS international conference on computer and information science, pp 357–362
Acknowledgments
We thank Prof. Ravi Nanjundiah, CAOS, IISc and Dr. Gayatri Vani Kanchibhotla, HPC-S&EA, CDAC, Pune for their valuable inputs given during the current work.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
H, V., Koolagudi, S.G. Long-range prediction of Indian summer monsoon rainfall using data mining and statistical approaches. Theor Appl Climatol 130, 19–33 (2017). https://doi.org/10.1007/s00704-016-1862-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-016-1862-2