Novelty Detection from Evolving Complex Data Streams with Time Windows

Ceci, Michelangelo; Appice, Annalisa; Loglisci, Corrado; Caruso, Costantina; Fumarola, Fabio; Malerba, Donato

doi:10.1007/978-3-642-04125-9_59

Michelangelo Ceci²³,
Annalisa Appice²³,
Corrado Loglisci²³,
Costantina Caruso²³,
Fabio Fumarola²³ &
…
Donato Malerba²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5722))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

1253 Accesses
9 Citations

Abstract

Novelty detection in data stream mining denotes the identification of new or unknown situations in a stream of data elements flowing continuously in at rapid rate. This work is a first attempt of investigating the anomaly detection task in the (multi-)relational data mining. By defining a data block as the collection of complex data which periodically flow in the stream, a relational pattern base is incrementally maintained each time a new data block flows in. For each pattern, the time consecutive support values collected over the data blocks of a time window are clustered, clusters are then used to identify the novelty patterns which describe a change in the evolving pattern base. An application to the problem of detecting novelties in an Internet packet stream is discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Blockeel, H., Sebag, M.: Scalability and efficiency in multi-relational data mining. SIGKDD Explorations Newsletter 5(1), 17–30 (2003)
Article Google Scholar
Brenna, L., Demers, A., Gehrke, J., Hong, M., Ossher, J., Panda, B., Riedewald, M., Thatte, M., White, W.: Cayuga: a high-performance event processing engine. In: International Conference on Management of Data, pp. 1100–1102. ACM, New York (2007)
Google Scholar
Domingos, P., Hulten, G.: Mining high-speed data streams. In: the 6th International Conference on Knowledge Discovery and Data Mining, KDD 2000, pp. 71–80. ACM, New York (2000)
Google Scholar
Džeroski, S., Lavrač, N.: Relational Data Mining. Springer, Heidelberg (2001)
Book MATH Google Scholar
Gaber, M.M., Zaslavsky, A., Krishnaswamy, S.: Mining data streams: a review. SIGMOD Record 34(2), 18–26 (2005)
Article MATH Google Scholar
Gama, J.: Issues and challenges in learning from data streams. In: Kargupta, H., Han, J., Yu, P.S., Motwani, R., Kumar, V. (eds.) Data Mining and Knowledge Discovery Series on Next Generation of Data Mining, pp. 209–222. Chapman and Hall, CRC Press, Taylor and Francis Group (2009)
Google Scholar
Ganti, V., Gehrke, J., Ramakrishnan, R.: Mining data streams under block evolution. SIGKDD Explorations 3(2), 1–10 (2002)
Article Google Scholar
Guha, S., Koudas, N., Shim, K.: Data-streams and histograms. In: the 33th Symposium on Theory of Computing, STOC 2001, pp. 471–475. ACM, New York (2001)
Google Scholar
http://www.streambase.com/
Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: the 7th International Conference on Knowledge Discovery and Data Mining, KDD 2001, pp. 97–106. ACM, New York (2001)
Google Scholar
Lisi, F.A., Malerba, D.: Inducing multi-level association rules from multiple relations. Machine Learning 55(2), 175–210 (2004)
Article MATH Google Scholar
Ma, J., Perkins, S.: Online novelty detection on temporal sequences. In: the 9th International Conference on Knowledge Discovery and Data Mining, KDD 2003, pp. 613–618. ACM, New York (2003)
Google Scholar
Mannila, H., Toivonen, H.: Levelwise search and borders of theories in knowledge discovery. Data Mining and Knowledge Discovery 1(3), 241–258 (1997)
Article Google Scholar
Mitchell, T.: Machine Learning. McGraw Hill, New York (1997)
MATH Google Scholar
Plotkin, G.D.: A note on inductive generalization. Machine Intelligence 5, 153–163 (1970)
MathSciNet MATH Google Scholar
Sander, J., Ester, M., Kriegel, H.-P., Xu, X.: Density-based clustering in spatial databases: The algorithm gdbscan and its applications. Data Mining and Knowledge Discovery 2(2), 169–194 (1998)
Article Google Scholar
Spinosa, E.J., de Carvalho, A.P.d.L.F., Gama, J.: Cluster-based novel concept detection in data streams applied to intrusion detection in computer networks. In: The Symposium on Applied Computing, SAC 2008, pp. 976–980. ACM, New York (2008)
Chapter Google Scholar
Zhang, X., Dong, G., Kotagiri, R.: Exploring constraints to efficiently mine emerging patterns from large high-dimensional datasets. In: Knowledge Discovery and Data Mining, pp. 310–314 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Informatica, Università degli Studi di Bari, via Orabona, 4, 70126, Bari, Italy
Michelangelo Ceci, Annalisa Appice, Corrado Loglisci, Costantina Caruso, Fabio Fumarola & Donato Malerba

Authors

Michelangelo Ceci
View author publications
You can also search for this author in PubMed Google Scholar
Annalisa Appice
View author publications
You can also search for this author in PubMed Google Scholar
Corrado Loglisci
View author publications
You can also search for this author in PubMed Google Scholar
Costantina Caruso
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Fumarola
View author publications
You can also search for this author in PubMed Google Scholar
Donato Malerba
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics and Statistics, University of Economics, W. Churchill Sq. 4, 130 67, Prague 3, Czech Republic
Jan Rauch
Department of Computer Science, University of North Carolina, NC 27599-3175, Charlotte, USA
Zbigniew W. Raś
Faculty of Informatics and Statics, University of Economics, W. Churchill Sq. 4, 130 67, Prague, Czech Republic
Petr Berka
Institute of Software Systems, Tampere University of Technology, P. O. Box 553, 33101, Tampere, Finland
Tapio Elomaa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ceci, M., Appice, A., Loglisci, C., Caruso, C., Fumarola, F., Malerba, D. (2009). Novelty Detection from Evolving Complex Data Streams with Time Windows. In: Rauch, J., Raś, Z.W., Berka, P., Elomaa, T. (eds) Foundations of Intelligent Systems. ISMIS 2009. Lecture Notes in Computer Science(), vol 5722. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04125-9_59

Download citation

DOI: https://doi.org/10.1007/978-3-642-04125-9_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04124-2
Online ISBN: 978-3-642-04125-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics