Abstract
Tornadoes, which are one of the most feared natural phenomena, present a significant challenge to forecasters who strive to provide adequate warnings of the imminent danger. Forecasters recognize the general environmental conditions within which a tornadic thunderstorm, called a supercell thunderstorm, will form. They also recognize a supercell thunderstorm with its rotating updraft, or mesocyclone, when it appears on radar. However, only a minority of supercell storms produce tornadoes. Although most tornadoes are warned in advance, the majority of the tornado warnings are false alarms. In this chapter, we discuss the development of novel spatiotemporal data mining techniques for discriminating between supercell storms that produce tornadoes and those that do not. To test the novel techniques, we initially applied them to numerical models having coarse 500 meter horizontal grid spacing that did not resolve tornadoes but that did resolve the parent mesocyclones.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Allen JF (1991) Time and time again: the many ways to represent time. Int J Intell Syst 6(4): 341–355
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Chiu B, Keogh E, Lonardi S (2003) Probabilistic discovery of time series motifs. In: In the 9th ACM SIGKDD international conference on knowledge discovery and data mining, Washington, DC, pp 493–498
Das G, Lin K, Mannila H, Renganathan G, Smyth P (1998) Rule discovery from time series. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, New York, pp 16–22
Donaldson Jr RJ, Dyer RM, Kraus MJ (1975) An objective evaluator of techniques for predicting severe weather events. In: Preprints: ninth conference on severe local storms, American Meteorological Society, Norman, pp 321–326
Dwyer K, Holte R (2007) Decision tree instability and active learning. In: ECML ’07: proceedings of the 18th European conference on machine learning, Warsaw. Springer-Verlag, Berlin, pp 128–139
Jensen D (2005) Proximity knowledge discovery system. http://kdl.cs.umass.edu/proximity
Johnson JT, MacKeen PL, Witt A, Mitchell ED, Stumpf GJ, Eilts MD, Thomas KW (1998) The storm cell identification and tracking algorithm: an enhanced WSR-88D algorithm. Weather Forecast 13(2):263–276
Keogh E, Lin J, Fu A (2005) HOT SAX: efficiently finding the most unusual time series subsequence. In: Proceedings of the 5th IEEE international conference on data mining (ICDM 2005), Houston, pp 226–233
Lin J, Keogh E, Lonardi S, Chiu B (2003) A symbolic representation of time series, with implications for streaming algorithms. In: Proceedings of the 8th ACM SIGMOD workshop on research issues in data mining and knowledge discovery, San Diego, pp 2–11
Lin J, Keogh E, Li W, Lonardi S (2007) Experiencing SAX: a novel symbolic representation of time series. Data Min Knowl Discov 15(2):107–144
McGovern A, Jensen D (2008) Optimistic pruning for multiple instance learning. Pattern Recognit Lett 29(9):1252–1260
McGovern A, Rosendahl DH, Kruger A, Beaton MG, Brown RA, Droegemeier KK (2007) Anticipating the formation of tornadoes through data mining. In: Preprints of the fifth conference on artificial intelligence and its applications to environmental sciences at the American Meteorological Society annual meeting, American Meteorological Society, San Antonio, Paper 4.3A
McGovern A, Hiers N, Collier M, Gagne II DJ, Brown RA (2008) Spatiotemporal relational probability trees. In: Proceedings of the 2008 IEEE international conference on data mining, Pisa, pp 935–940
McGovern A, Supinie T, Gagne II DJ, Troutman N, Collier M, Brown RA, Basara J, Williams J (2010) Understanding severe weather processes through spatiotemporal relational random forests. In: Proceedings of the 2010 NASA conference on intelligent data understanding, Mountain View, pp 213–227
McGovern A, Gagne II DJ, Troutman N, Brown RA, Basara J, Williams J (2011a) Using spatiotemporal relational random forests to improve our understanding of severe weather processes. Stat Anal Data Min 4(4):407–429
McGovern A, Rosendahl DH, Brown RA, Droegemeier KK (2011b) Identifying predictive multi-dimensional time series motifs: an application to understanding severe weather. Data Min Knowl Discov 22(1):232–258
McGovern A, Troutman N, Brown RA, Williams JK, Abernethy J (2013) Enhanced spatiotemporal relational probability trees and forests. Data Min Knowl Discov 26(2):398–433
Minnen D, Isbell C, Essa I, Starner T (2007) Detecting subdimensional motifs: an efficient algorithm for generalized multivariate pattern discovery. In: Proceedings of the 2007 seventh IEEE international conference on data mining (ICDM ’07), Omaha, pp 601–606
Mueen A, Keogh E, Zhu Q, Cash S, Westover B (2009) Exact discovery of time series motifs. In: Proceedings of the SIAM international conference on data mining, Sparks, pp 473–484
Neville J, Jensen D (2004) Dependency networks for relational data. In: Proceedings of the fourth IEEE international conference on data mining, Brighton, pp 170–177
Neville J, Jensen D, Friedland L, Hay M (2003) Learning relational probability trees. In: Proceedings of the ninth ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, pp 625–630
Noda A, Niino H (2005) Genesis and structure of a major tornado in a numerically–simulated supercell storm: importance of vertical vorticity in a gust front. Sci Online Lett Atmos 1:5–8
NWS (2009) Service assessment, Mother’s Day weekend tornado in Oklahoma and Missouri, May 10, 2008. http://www.nws.noaa.gov/os/assessments/pdfs/mothers_day09.pdf
NWS (2011) Nws central region service assessment, Joplin, Missouri, tornado – May 22, 2011. http://www.nws.noaa.gov/os/assessments/pdfs/Joplin_tornado.pdf
Oates T, Cohen PR (1996) Searching for structure in multiple streams of data. In: Proceedings of the thirteenth international conference on machine learning. Morgan Kaufmann, Bari, pp 346–354
Pérez JM, Muguerza J, Arbelaitz O, Gurrutxaga I, Martìn JI (2005) Consolidated trees: classifiers with stable explanation. A model to achieve the desired stability in explanation. In: Lecture notes in computer science, Springer Berlin, Heidelberg, pp 99–17
Quinlan JR (1993) C4.5: programs for Machine Learning. Morgan Kaufmann, San Francisco
Rosendahl DH (2008) Identifying precursors to strong low-level rotation within numerically simulated supercell thunderstorms: a data mining approach. Master’s thesis, School of Meteorology, University of Oklahoma
Schaefer JT (1990) The critical success index as an indicator of warning skill. Weather Forecast 5(4):570–575
Shieh J, Keogh E (2009) iSAX: indexing and mining terabyte sized time series. In: Proceedings of the IEEE international conference on data mining
Simmons KM, Sutter D (2011) Economic and societal impacts of tornadoes. American Meteorological Society, Boston
Supinie T, McGovern A, Williams J, Abernethy J (2009) Spatiotemporal relational random forests. In: Proceedings of the IEEE international conference on data mining (ICDM) workshop on spatiotemporal data mining.p electronically published
Vahdatpour A, Amini N, Sarrafzadeh M (2009) Toward unsupervised activity discovery using multi-dimensional motif detection in time series. In: Proceedings of the 21st international joint conference on artificial intelligence (IJCAI’09), Pasadena, pp 1261–1266
Webb GI (1995) OPUS: an efficient admissible algorithm for unordered search. J Artif Intell Res 3:431–465
Wicker LJ, Wilhelmson RB (1995) Simulation and analysis of tornado development and decay within a three–dimensional supercell thunderstorm. J Atmos Sci 52(15):2675–2703
Xue M, Droegemeier KK, Wong V (2000) The Advanced Regional Prediction System (ARPS) - a multiscale nonhydrostatic atmospheric simulation and prediction model. Part I: model dynamics and verification. Meteorol Atmos Phys 75:161–193
Xue M, Droegemeier KK, Wong V, Shapiro A, Brewster K, Carr F, Weber D, Liu Y, Wang D (2001) The Advanced Regional Prediction System (ARPS) – a multiscale nonhydrostatic atmospheric simulation and prediction tool. Part II: model physics and applications. Meteorol Atmos Phys 76:143–165
Xue M, Wang D, Gao J, Brewster K, Droegemeier KK (2003) The Advanced Regional Prediction System (ARPS), storm-scale numerical weather prediction and data assimilation. Meteorol Atmos Phys 82:139–170
Xue M, Droegemeier KK, Weber D (2007) Numerical prediction of high-impact local weather: a driver for petascale computing. In: Petascale computing: algorithms and applications. Chapman and Hall/CRC, Boca Raton, chap 18, pp 103–125
Zaki MJ (2001) Spade: an efficient algorithm for mining frequent sequences. Mach Learn 42(1/2):31–60. special issue on unsupervised learning
Acknowledgements
This material is based upon work supported by the National Science Foundation under IIS/CAREER/0746816 and corresponding REU Supplements IIS/0840956, 0938138, 1036023, 1129292, and the NSF ERC Center for Collaborative Adaptive Sensing of the Atmosphere (CASA, NSF ERC 0313747).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer Science+Business Media New York
About this chapter
Cite this chapter
McGovern, A., Rosendahl, D.H., Brown, R.A. (2014). Toward Understanding Tornado Formation Through Spatiotemporal Data Mining. In: Cervone, G., Lin, J., Waters, N. (eds) Data Mining for Geoinformatics. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-7669-6_2
Download citation
DOI: https://doi.org/10.1007/978-1-4614-7669-6_2
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-7668-9
Online ISBN: 978-1-4614-7669-6
eBook Packages: Computer ScienceComputer Science (R0)