Abstract
As many sensor networks are currently being deployed for environmental monitoring, there is a growing need to develop systems and applications for managing, processing and retrieving massive amounts of data generated from those networks. In this research, a query answering system with pattern mining techniques is investigated specifically for marine sensor data. We consider three applications of pattern mining: similar pattern search, predictive query and query by clustering. In pattern mining for query answering, we adopt the dynamic time warping (DTW) method for similarity measurement. We also propose the use of a query relaxation approach that recommends users change parameters of a given query to get an answer. Finally, we show implementation results of pattern query answering in a marine sensor network deployed in the South East of Tasmania, Australia. Pattern query answering system benefits in accessing and discovering knowledge from sensor data for decision making purposes.
Similar content being viewed by others
Notes
References
Adhikari PR, Hollmén J (2010) Patterns from multiresolution 0-1 data. In: UP ’10 Proceedings of the ACM SIGKDD workshop on useful patterns (UP), pp 8–16
Assent I, Kremer H, Gunnemann S, Seidl T (2010) Pattern detector: fast detection of suspicious stream patterns for immediate reaction. EDBT, pp 709–712
Assent I, Witchterich M, Krieger R, Kremer H, Seidl T (2009) Anticipatory DTW for efficient similarity search in time series databases. VLDB, pp 826–837
Bulut A, Singh AK (2005) A unified framework for monitoring data streams in real time. ICDE, pp 44–55
Buono P, Plaisant C, Simione A, Aris A, Shneiderman B, Shmueli G, Jank W (2007) Similarity-based forecasting with simultaneous previews: a river plot interface for time series forecasting. International Conference Information Visualization (IV’07), pp 191–196
Cao H, Qi Y, Candan S, Sapino ML (2010) Feedback-driven result ranking and query refinement for exploring semi-structured data collections. EDBT, pp 3–14
Chan FKP, Fu AWC, Yu C (2003) Haar wavelets for efficient similarity search of time-series: with and without time warping. IEEE TKDE 15(3):686–705
Chen H, Cheng C-C (2011) A distortion-aware intelligent context-aggregation agent for smart environments. IEEE Intelligent Systems, pp 42–49
Chen Y, Nascimento MA, Ooi BC, Tung AKH (2007) SpADE: on shape-based pattern detection in streaming time series. ICDE, pp 786–795
Cheng H, Tan P-N, Gao J, Scripps J (2009) Multistep-ahead time series prediction. PAKDD, pp 765–774
Ciglan M, Habela O, Tran V, Hluchy L, Kremler M, Gera M (2010) Application of ADMIRE data mining nd integration technologies in environmental scenarios. PPAM, pp 165–173
Diao Y, Ganesan D, Mathur G, Shenoy P (2007) Rethinking data management for storage-centric sensor networks. CIDR, pp 22–31
Ding H, Trajcevski G, Scheuermann P, Wang X, Keogh E (2008) Querying and mining of time series data: experimental comparison of representations and distance measures. VLDB, pp 1542–1552
Fu AWC, Keogh E, Lau LYH, Ratanamahatana CA (2005) Scaling and time warping in time series querying. VLDB, pp 649–660
Giberta K, Sanchez-Marrea M (2011) Outcomes from the iEMSs data mining in the environmental sciences workshop series. Environ Model Softw 26(7):983–985
Herzfeld M, Andrewartha J, Sakov P (2010) Modelling the physical oceanography of the d’entrecasteaux channel and the Huon estuary, south-eastern Tasmania. Marine and Freshwater Research vol 61, CSIRO publishing, pp 568– 586
Hluchy L, Habela O, Tran V, Ciglan M (2009) Hydro-meteorological scenarios using advanced data mining and integration. International Conference on Fuzzy Systems and Knowledge Discovery, pp 260–264
Hugo D, Howell B, D’este C, Timms G, Sharman C, de Souza P, Allen S (2011) Low-cost marine monitoring: from sensors to information delivery. IEEE Oceans, pp 1–7
Huh SY, Moon KH, Lee H (2000) A data abstraction approach for query relaxation. Inf Softw Technol 42:407–418
Keogh E, Kassety S, (2002) On the need for time series data mining benchmarks: a survey and impirical demonstration. SIGKDD, pp 102–111
Kidron A, Klein ST (2007) An information retrieval approach to predicting meteorological data. Int J Model Simul 27(3):218–225.
Koopman A, Knobbe A, Meeng M (2010) Pattern selection problems in multivariate time-series using equation discovery. In: UP ’10 Proceedings of the ACM SIGKDD workshop on useful patterns (UP), pp 74–81 Pattern selection problems in multivariate time-series using equation discovery, Useful Pattern (UP)
Lian X, Chen L (2008) Efficient similarity search over future stream time series. IEEE TKDE 20(1):40–54
Lian X, Chen L, Yu JX (2009) Multiscale representations for fast pattern matching in stream time series. IEEE TDKE 21(4):568–581
Liao TW (2005) Clustering of time series data—a survey. Pattern Recogn 38:1857–1874
Liu C, Li J, Yu JX, Zhou R (2010) Adaptive relaxation for querying heterogeneous XML data sources. Inf Syst 35:688–707
Mamoulis N, Cao H, Kollios G, Hadjieleftheriou M, Tao, Y, Cheung DW (2004) Mining, indexing, and querying historical spatiotemporal data. KDD, pp 236–245
Mirzadeh N, Ricci F, Bansal M (2004) Supporting user query relaxation in a recommender system. EC-Web, LNCS, vol 3182, pp 31–40
Morealle P, Callegari J, Valle G, Kendall F (2011) Sensor integration and analysis for visual identification of environmental patterns. IEEE SysCon., pp 7–12
Pan L, Luo J, Li J (2008) Probing queries in wireless sensor networks. IEEE International Conference on Distributed Computing Systems, pp 546–553
Ricci F, Mirzadeh N, Venturini A (2002) Intelligent query management in a mediator architecture. IEEE International Symposium on Intelligent Systems, pp 221–226
Sakurai Y, Faloutsos C, Yamamura M (2007) Stream monitoring under the time warping distance. ICDE, pp 1046–1055
Sakurai Y, Yoshikawa M, Faloutsos C (2005) FTW: fast similarity search under the time warping distance. PODS, pp 326– 337
SANY-an open service architecture for sensor networks. SANY Consortium, p 161 ISBN: 9783000285714 (2009) http://www.frisia-it.de/assets/images/SANY_Book.pdf
Shahriar MS, de Souza P, Timms G (2011) Smart query answering for marine sensor data. Sensors 11:2885–2897. doi:10.3390/s110302885
Shan J, Shen D, Nie T, Kou Y, Yu G (2010) An effective and high-quality query relaxation solution on the deep web. APWeb, pp 68–74
Tran V, Hluchy L, Habela O (2010) Data mining and integration for environmental scenarios. SoICT, pp 55–58
Timms GP, de Souza PA, Reznik L (2010) Automated assessment of data quality in marine sensor networks. IEEE Oceans, pp 1–5
Timms GP, McCulloch JW, McCarthy P, Howell B, de Souza PA, Dunbabin MD, Hartmann K (2009) The Tasmanian Marine Analysis Network (TasMAN). IEEE Oceans, pp 1–6
Wu J, Zhou Y, Aberer K, Tan KL (2009) Towards integrated and efficient scientific sensor data processing: a database approach. EDBT, pp 922–933
Yang K, Shahabi C (2004) A PCA-based similarity measure for multivariate time series, 2004. MMDB, pp 65–74
Yuelong Z, Dingsheng W, Xiaohua Z, (2008) A novel approach to the similarity analysis of multivariate time series and its application in hydrological data mining. International Conference on Computer Science and Software Engineering, pp 730–734
Zhang X, Liu J, Du Y, Lv T (2011) A novel clustering method on time series data. Expert Syst Appl 38(9):11891–11900
Zhang X, Wu J, Yang X (2009) A novel pattern extraction method for time series classification. Optimization Engineering 10:253–271
Zhou X, Gaugaz J, Balke TW, Nejdl W (2007) Query relaxation using malleable schemas. SIGMOD, pp 545–556
Acknowledgements
The Tasmanian ICT Centre is jointly funded by the Australian Government through the Intelligent Island Program and CSIRO. The Intelligent Island Program is administered by the Tasmanian Department of Economic Development, Tourism and the Arts. This research was conducted as part of the CSIRO Wealth from Oceans National Research Flagship and the Sensors and Sensor Networks Transformational Capability Platform(SSN-TCP). We thank Aidan O’Mara for providing improved prediction using clustering.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by: H. A. Babaie
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Shahriar, M.S., Souza, P.d. & Timms, G. On retrieving patterns in environmental sensor data. Earth Sci Inform 5, 43–59 (2012). https://doi.org/10.1007/s12145-012-0095-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-012-0095-x