Efficiently Mining Constrained Subsequence Patterns

Albarrak, Abdullah; Al-Maskari, Sanad; Ibrahim, Ibrahim A.; Almars, Abdulqader M.

doi:10.1007/978-3-030-05090-0_1

Abdullah Albarrak¹⁶,
Sanad Al-Maskari¹⁷,
Ibrahim A. Ibrahim^18,19 &
…
Abdulqader M. Almars¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11323))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

1459 Accesses

Abstract

Big time series data are generated daily by various application domains such as environment monitoring, internet of things, health care, industry and science. Mining this massive data is a very challenging task because conventional data mining algorithms are unable to scale effectively with massive time series data. Moreover, applying a global classification approach to a highly similar and noisy data will hinder the classification performance. Therefore, utilizing constrained subsequence patterns in data mining applications increases the efficiency, accuracy, and could provide useful insight into the data.

To address the above mentioned limitations, we propose an efficient subsequence processing technique with preferences constraints. Then, we introduce a sub-patterns analysis for time series data. The sub-pattern analysis objective is to maximize the interclass separability using a localization approach. Furthermore, we make use of the deviation from a correlation constraint as an objective to minimize in our problem, and we include users preferences as an objective to maximize in proportion to users’ preferred time intervals. We experimentally validate the efficiency and effectiveness of our proposed algorithm using real data to demonstrate its superiority and efficiency when compared to recently proposed correlation-based subsequence search algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Accelerating pattern-based time series classification: a linear time and space string mining approach

Article 12 July 2019

Towards Efficiently Mining Frequent Interval-Based Sequential Patterns in Time Series Databases

Parallel Variable-Length Motif Discovery in Time Series Using Subsequences Correlation

References

Al-Maskari, S., Bélisle, E., Li, X., Le Digabel, S., Nawahda, A., Zhong, J.: Classification with quantification for air quality monitoring. In: Bailey, J., Khan, L., Washio, T., Dobbie, G., Huang, J.Z., Wang, R. (eds.) PAKDD 2016. LNCS (LNAI), vol. 9651, pp. 578–590. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-31753-3_46
Chapter Google Scholar
Al-Maskari, S., Guo, W., Zhao, X.: Biologically inspired pattern recognition for e-nose sensors. In: Li, J., Li, X., Wang, S., Li, J., Sheng, Q.Z. (eds.) ADMA 2016. LNCS, vol. 10086, pp. 142–155. Springer International Publishing, Cham (2016). https://doi.org/10.1007/978-3-319-49586-6_10
Chapter Google Scholar
Al-Maskari, S., Ibrahim, I.A., Li, X., Abusham, E., Almars, A.: Feature extraction for smart sensing using multi-perspectives transformation. In: Wang, J., Cong, G., Chen, J., Qi, J. (eds.) ADC 2018. LNCS, vol. 10837, pp. 236–248. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-92013-9_19
Chapter Google Scholar
Al-Maskari, S., Li, X., Liu, Q.: An effective approach to handling noise and drift in electronic noses. In: Wang, H., Sharaf, M.A. (eds.) ADC 2014. LNCS, vol. 8506, pp. 223–230. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08608-8_21
Chapter Google Scholar
Fu, T.C.: A review on time series data mining. Eng. Appl. Artif. Intell. 24(1), 164–181 (2011)
Article Google Scholar
Gavrilov, M., Anguelov, D., Indyk, P., Motwani, R.: Mining the stock market (extended abstract): which measure is best? In: Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 20–23 August 2000, Boston, MA, USA, pp. 487–496 (2000)
Google Scholar
Ghazavi, S.N., Liao, T.W.: Medical data mining by fuzzy modeling with selected features. Artif. Intell. Med. 43(3), 195–206 (2008)
Article Google Scholar
Ibrahim, I.A., Albarrak, A.M., Li, X.: Constrained recommendations for query visualizations. Knowl. Inf. Syst. 51(2), 499–529 (2017)
Article Google Scholar
Keogh, E.J., Kasetty, S.: On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Min. Knowl. Discov. 7(4), 349–371 (2003)
Article MathSciNet Google Scholar
Li, Y., U, L.H., Yiu, M.L., Gong, Z.: Discovering longest-lasting correlation in sequence databases. PVLDB 6(14), 1666–1677 (2013)
Article Google Scholar
Mueen, A., Hamooni, H., Estrada, T.: Time series join on subsequence correlation. In: 2014 IEEE International Conference on Data Mining, ICDM 2014, 14–17 December 2014, Shenzhen, China, pp. 450–459 (2014)
Google Scholar
Mueen, A., Nath, S., Liu, J.: Fast approximate correlation for massive time-series data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2010, 6–10 June 2010, Indianapolis, Indiana, USA, pp. 171–182 (2010)
Google Scholar
Raghupathi, W., Raghupathi, V.: Big data analytics in healthcare: promise and potential. Health Inf. Sci. Syst. 2(1), 1 (2014)
Article Google Scholar
Rakthanmanon, T., et al.: Searching and mining trillions of time series subsequences under dynamic time warping. In: The 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2012, 12–16 August 2012, Beijing, China, pp. 262–270 (2012)
Google Scholar
Sakurai, Y., Papadimitriou, S., Faloutsos, C.: BRAID: stream mining through group lag correlations. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, 14–16 June 2005, Baltimore, Maryland, USA, pp. 599–610 (2005)
Google Scholar
Utomo, C., Li, X., Wang, S.: Classification based on compressive multivariate time series. In: Cheema, M.A., Zhang, W., Chang, L. (eds.) ADC 2016. LNCS, vol. 9877, pp. 204–214. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46922-5_16
Chapter Google Scholar
Nahar, V., Al-Maskari, S., Li, X., Pang, C.: Semi-supervised learning for cyberbullying detection in social networks. In: Wang, H., Sharaf, M.A. (eds.) ADC 2014. LNCS, vol. 8506, pp. 160–171. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08608-8_14
Chapter Google Scholar
Zhu, Y., Shasha, D.: Statstream: statistical monitoring of thousands of data streams in real time. In: Proceedings of 28th International Conference on Very Large Data Bases, VLDB 2002, 20–23 August 2002, Hong Kong, China, pp. 358–369 (2002)
Google Scholar

Download references

Acknowledgments

We would like to thank Lemma solutions (www.lemma.com.au) for their help during the production of this paper.

Author information

Authors and Affiliations

Al Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
Abdullah Albarrak
Sohar University, Sohar, Oman
Sanad Al-Maskari
University of Queensland, Brisbane, Australia
Ibrahim A. Ibrahim & Abdulqader M. Almars
Minia University, Minya, Egypt
Ibrahim A. Ibrahim

Authors

Abdullah Albarrak
View author publications
You can also search for this author in PubMed Google Scholar
Sanad Al-Maskari
View author publications
You can also search for this author in PubMed Google Scholar
Ibrahim A. Ibrahim
View author publications
You can also search for this author in PubMed Google Scholar
Abdulqader M. Almars
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abdullah Albarrak .

Editor information

Editors and Affiliations

University of Connecticut, Storrs, CT, USA
Guojun Gan
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Bohan Li
The University of Queensland, Brisbane, QLD, Australia
Xue Li
Beijing Institute of Technology, Beijing, China
Shuliang Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Albarrak, A., Al-Maskari, S., Ibrahim, I.A., Almars, A.M. (2018). Efficiently Mining Constrained Subsequence Patterns. In: Gan, G., Li, B., Li, X., Wang, S. (eds) Advanced Data Mining and Applications. ADMA 2018. Lecture Notes in Computer Science(), vol 11323. Springer, Cham. https://doi.org/10.1007/978-3-030-05090-0_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-05090-0_1
Published: 29 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05089-4
Online ISBN: 978-3-030-05090-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Efficiently Mining Constrained Subsequence Patterns

Abstract

Access this chapter

Similar content being viewed by others

Accelerating pattern-based time series classification: a linear time and space string mining approach

Towards Efficiently Mining Frequent Interval-Based Sequential Patterns in Time Series Databases

Parallel Variable-Length Motif Discovery in Time Series Using Subsequences Correlation

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Efficiently Mining Constrained Subsequence Patterns

Abstract

Access this chapter

Similar content being viewed by others

Accelerating pattern-based time series classification: a linear time and space string mining approach

Towards Efficiently Mining Frequent Interval-Based Sequential Patterns in Time Series Databases

Parallel Variable-Length Motif Discovery in Time Series Using Subsequences Correlation

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation