Abstract
The weather forecasting process is used to predict the future atmospheric condition of a specified location. The evolution of the big data era gives the chances to significantly increase the prediction accuracy of weather conditions. In this paper, a deep learning-based stacked sparse autoencoder (DSSAE) has been proposed for predicting the weather condition of a particular area. This model requires a pre-processing approach to obtain essential data from big weather data and increase the prediction model’s speed. For this, the principal component analysis (PCA) is utilized to reduce dimensionality and extraction of features with more significant variance. Also, it integrates the feature selection algorithm based on Binary Butterfly Optimization Algorithm (BBOA) along with a deep stack autoencoder to improve prediction accuracy. The proposed model is validated using the weather data taken from the division of weather underground for short term and long term weather prediction. The simulation consequences illustrate that the proposed model overtakes the existing models in terms of computation time, accuracy and error rate.
Similar content being viewed by others
References
Aksu D, Üstebay S, Aydin MA, Atmaca T (2018) Intrusion detection with comparative analysis of supervised learning techniques and fisher score feature selection algorithm. In: International symposium on computer and information sciences. Springer, Cham, pp 141–149
Anjali T, Chandini K, Anoop K, Lajish VL (2019) Temperature prediction using machine learning approaches. In International Conference on Intelligent Computing, Instrumentation and Control Technologies (ICICICT), 1, pp. 1264-1268. IEEE
Arora S, Anand P (2019) Binary butterfly optimization approaches for feature selection. Expert Syst Appl 116:147–160
Bagirov AM, Mahmood A (2018) A comparative assessment of models to predict monthly rainfall in Australia. Water Resour Manag 32(5):1777–1794
Bayatvarkeshi M, Mohammadi K, Kisi O, Fasihi R (2020) A new wavelet conjunction approach for estimation of relative humidity: wavelet principal component analysis combined with ANN. Neural Comput & Applic 32(9):4989–5000
Booz J, Yu W, Xu G, Griffith D, Golmie N (2019) A deep learning-based weather forecast system for data volume and Recency analysis. In International Conference on Computing, Networking, and Communications (ICNC), IEEE, 697–701
Buszta A, Mazurkiewicz J (2015) Climate changes prediction system based on weather big data visualisation. DepCoS-RELCOMEX. Advances in Intelligent Systems and Computing, 365, Springer.
Dalto M, Matusko J, Vasak M (2015) Deep neural networks for ultra-short-term wind forecasting. In Proceedings of the IEEE International Conference on Industrial Technology. 1657–1663.
Dargan S, Kumar M, Ayyagari MR, Kumar G. (2019) A survey of deep learning and its applications: a new paradigm to machine learning. Arch Computational Methods Eng
Deo RC, Şahin M (2015) Application of the artificial neural network model for prediction of monthly standardized precipitation and evapotranspiration index using hydrometeorological parameters and climate indices in eastern Australia. Atmos Res 161–162:65–81
Deo RC, Şahin M (2015) Application of the extreme learning machine algorithm to predict monthly effective drought index in eastern Australia. Atmos Res 153:512–525
Doreswamy GI, Manjunatha BR (2018) Multi-label Classification of Big NCDC Weather Data Using Deep Learning Model.ICSCS 2018. In: Communications in Computer and Information Science,837. Springer, Singapore
Haidar A, Verma B (2018) Monthly rainfall forecasting using one-dimensional deep convolutional neural network. IEEE Access 6:69053–69063
Hardwinarto S, Aipassa M (2015) Rainfall monthly prediction based on artificial neural network: a case study in Tenggarong Station, East Kalimantan-Indonesia. Procedia Comput Sci 59:142–151
Hassani H, Silva ES (2015) Forecasting with big data: a review. Ann Data Sci 2:5–19. https://doi.org/10.1007/s40745-015-0029-9.6
Hegazy AE, Makhlouf MA, El-Tawel GS (2020) Improved salp swarm algorithm for feature selection. J King Saud Univ-Comput Inform Sci 32(3):335–344
Holmstrom M, Liu D, Vo C (2016) Machine learning applied to weather forecasting.
Huda RK, Haider Banka (2019) Efficient feature selection and classification algorithm based on PSO and rough sets. Neural Comput & Applic 31(8):4287–4303
Iraji MS (2019) Deep stacked sparse auto-encoders for prediction of post-operative survival expectancy in thoracic lung cancer surgery. J Appl Biomed 17:75–75. https://doi.org/10.32725/jab.2018.007
Ismail KA, Majid MA, Zain JM, Bakar NAA (2016) Big Data prediction framework for weather Temperature based on MapReduce algorithm. In 2016 IEEE Conference on Open Systems (ICOS), IEEE, pp 13–17
Jiang S, Chin K-S, Wang L, Gang Q, Tsui KL (2017) Modified genetic algorithm-based feature selection combined with pre-trained deep neural network for demand forecasting in outpatient department. Expert Syst Appl 82:216–230
Jiang L, Ge Z, Song Z (2017) Semi-supervised fault classification based on dynamic sparse stacked auto-encoders model. Chemom Intell Lab Syst 168:72–83
Karunakaran V, Iwin Joseph S, Teja R, Suganthi M, Rajasekar V (2019) A wrapper based feature selection approach using bees algorithm for extreme rainfall prediction via weather pattern recognition through svm classifier. Intl J Civil Eng Technol (IJCIET) 10.
Khalili N, Khodashenas SR, Davary K, Baygi MM, Karimaldini F (2016) Prediction of rainfall using artificial neural networks for synoptic station of Mashhad: a case study. Arab J Geosci 9(13):624
Kourou K, Exarchos TP (2015) Machine learning applications in cancer prognosis and prediction. Comput Struct biotechnol J 3:8–17
Larraondo PR et al. (2017) Automating weather forecasts based on convolutional networks.
Lefkovits S, Lefkovits L (2017) Gabor feature selection based on information gain. Procedia Eng 181:892–898
Li J, Liu H (2016) Challenges of feature selection for big data analytics. IEEE Intell Syst 32:9–15. https://doi.org/10.1109/MIS.2017.38
Liu H, Chen C (2019) Multi-objective data-ensemble wind speed forecasting model with stacked sparse autoencoder and adaptive decomposition-based error correction. Appl Energy 254:113686
Liu JNK, Hu Y, You JJ, Chan PW (2014) Deep neural network-based feature representation for weather forecasting. In Proceedings of the International Conference on Artificial Intelligence (ICAI)
Liu JN, Hu Y, He Y, Chan PW, Lai L (2015) Deep neural network modeling for big data weather forecasting. In Information Granularity, Big Data, and Computational Intelligence 389–408.
Mani K, Kalpana P (2016) An efficient feature selection based on bayes theorem, self-information and sequential forward selection. Intl J Inform Eng Electronic Business 8(6):46–54
Mcgovern A, Elmore K, Gagne D, Haupt S, Karstens C, Lagerquist R, Smith T, Williams J (2017) Using artificial intelligence to improve real-time decision-making for high-impact weather. Bull Am Meteorol Soc 98:2073–2090. https://doi.org/10.1175/BAMS-D-16-0123.1
Mehrkanoon S (2019) Deep shared representation, learning for weather elements forecasting. Knowl-Based Syst 179:120–128
Mekanik F, Imteaz MA, Talei A (2016) Seasonal rainfall forecasting by adaptive network-based fuzzy inference system (ANFIS) using large scale climate signals. Climate Dyn 46(9–10):3097–3111
Moradi E, Pepen A (2015) Machine learning framework for early MRI-based Alzheimer's conversion prediction in MCI subjects. Neuroimages 104:398–412
Najafabadi MM, Villanustre F (2015) Deep learning applications and challenges in big data analytics. J Big Data 2:1
Feiping Nie, Yizhen Huang, Xiaoqian Wang, and Heng Huang. (2014) New primal SVM solver with the linear computational cost for big data classifications. In Proceedings of the 31st International Conference on International Conference on Machine Learning, 32.
Olshannikova E, Visualizing AO (2015) Big data with augmented and virtual reality: challenges and research agenda. J Big Data 2(1):22
Palma-Mendoza R-J, Rodriguez D, De-Marcos L (2018) Distributed ReliefF-based feature selection in spark. Knowl Inf Syst 57(1):1–20
Palvanov A, Cho YI. (2019). VisNet: deep convolutional neural networks for forecasting atmospheric visibility. Sensors (Basel)DOI: https://doi.org/10.3390/s19061343, 19.
Patel J, Shah S, Thakkar P (2015) Predicting stock market index using a fusion of machine learning techniques. Expert Syst Appl 42:2162–2172
Poornima S, Pushpalatha M (2019) Prediction of rainfall using intensified lSTM based recurrent neural network with weighted linear units. Atmosphere 10(11):668
Qi G, Zhu Z, Erqinhu K (2018) Fault-diagnosis for reciprocating compressors using big data and machine learning. Simul Model Pract Theory 80:104–127
Roesch I, Günther T (2019) Visualization of neural network predictions for weather forecasting. Comput Graphics Forum 38:209–220
Rolnik D et al (2019) Tackling Climate Change with Machine Learning. arXiv:1906.05433
Rong M, Gong D, Gao XZ (2019) Feature Selection And Its Use In Big Data: Challenges, Methods, and Trends. IEEE Access,1–1. https://doi.org/10.1109/ACCESS.2019.2894366.
Saha M, Santara A, Mitra P, Chakraborty A, Nanjundiah RS (2020) Prediction of the Indian summer monsoon using a stacked autoencoder and ensemble regression model. Int J Forecast
Singh K, Guntuku SC (2014) Big data analytics framework for peer-to-peer botnet detection using random forests. Inf Sci 278:488–497
Sivarajah U, Kamal MM, Irani Z, Weerakkody V (2017) Critical analysis of big data challenges and analytical methods. J Bus Res 70:263–286
Tang J, Liu J. (2016) Visualizing large-scale and high-dimensional data. In Proceedings of the 25th International Conference on World Wide Web, 287–297.
Thaseen IS, Kumar CA (2017) Intrusion detection model using fusion of chi-square feature selection and multi class SVM. J King Saud Univ-Comput Inform Sci 29(4):462–472
Vathsala H, Koolagudi SG (2017) Prediction model for peninsular Indian summer monsoon rainfall using data mining and statistical approaches. Comput Geosci 98:55–63
Voyant C, Notton G, Kalogirou S, Nivet M-L, Paoli C, Motte F, Fouilloy A (2017) Machine learning methods for solar radiation forecasting: a review. Renew Energy 105:569–582
Wan C, Xu Z, Pinson P (2014) Probabilistic forecasting of wind power generation using extreme learning machine. IEEE Trans Power Syst 29(3):1033–1044
Wang X-W, Nie D, Bao-Liang L (2014) Emotional state classification from EEG data using machine learning approach. Neurocomputing 129:94–106
Wu W-C, Hung S-H (2014) DroidDolphin: a dynamic Android malware detection framework using big data and machine learning. In Proceedings of the 2014 Conference on research in adaptive and convergent systems, ACM, 247–252
Xin J, Wang Z, Luxuan Q, Guoren (2015) Wang elastic extreme learning machine for big data classification. Neurocomputing 149:464–471
Yonekura K, Hattori H, Suzuki T (2018) Short-term local weather forecast using dense weather station by the deep neural network. In IEEE International Conference on Big Data (Big Data), IEEE, 1683–1690
Zhai J, Zang L (2018) Ensemble dropout extreme learning machine via fuzzy integral for data classification. Neurocomputing 275:1043–1052
Zhang L, Ma W, Zhang D (2016) Stacked sparse autoencoder in PolSAR data classification using local spatial information. IEEE Geosci Remote Sens Lett 13:1–5. https://doi.org/10.1109/LGRS.2016.2586109
Acknowledgments
This work is supported by the Indian Institute of Technology (Indian School of Mines), Dhanbad, Govt. of India. The authors wish to express their gratitude and heartiest thanks to the Department of Computer Science & Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, India, for providing their continuous research support.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors do not have a conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
G, M.R., Dharavath, R. DSSAE-BBOA: deep learning-based weather big data analysis and visualization. Multimed Tools Appl 80, 27471–27493 (2021). https://doi.org/10.1007/s11042-021-11059-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11059-9