Abstract
Streaming graph analysis is gaining importance in various fields due to the natural dynamicity in many real graph applications. Prior subgraph discovery problem over streaming graphs mostly focuses on characteristics like frequency and burstiness. Persistence, as a new characteristic, is getting increasing attention. Persistent subgraph discovery highlights behaviors where a subgraph appears recurrently in many time windows, which is vital for many real-world applications (e.g., anomaly detection). While persistent subgraph discovery enjoys many interesting real-life applications, there is no off-the-shelf solution to compute the persistent pattern efficiently. In this work, we are the first to study the persistent subgraph pattern discovering problem over the streaming graph. We devise an auxiliary data structure called \(\textsf {TFD} \) to detect the persistent subgraph patterns in real-time with limited memory usage. \(\textsf {TFD} \) maps each subgraph into the corresponding bucket based on hash functions to compute the persistence of each pattern. Then we introduce optimizations to separate persistent and non-persistent patterns, further improving the effectiveness and throughput in space-scarce scenarios. Extensive experiments confirm the superiority of our proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdelhamid, E., Canim, M., Sadoghi, M., Bhattacharjee, B., Chang, Y., Kalnis, P.: Incremental frequent subgraph mining on large evolving graphs. In: ICDE 2018, pp. 1767–1768 (2018)
Aslay, Ç., Nasir, M.A.U., Morales, G.D.F., Gionis, A.: Mining frequent patterns in evolving graphs. In: CIKM 2018. pp. 923–932 (2018)
Belth, C., Zheng, X., Koutra, D.: Mining persistent activity in continually evolving networks. In: KDD 2020, pp. 934–944 (2020)
Chen, Z., Wang, X., Wang, C., Li, J.: Explainable link prediction in knowledge hypergraphs. In: CIKM 2022, pp. 262–271 (2022)
Choudhury, S., Holder, L.B., Jr., G.C., Agarwal, K., Feo, J.: A selectivity based approach to continuous pattern detection in streaming graphs. In: EDBT 2015, pp. 157–168 (2015)
Dai, H., Shahzad, M., Liu, A.X., Zhong, Y.: Finding persistent items in data streams. Proc. VLDB Endow. 10(4), 289–300 (2016)
Hellmann, S., Stadler, C., Lehmann, J., Auer, S.: DBpedia live extraction. In: Meersman, R., Dillon, T., Herrero, P. (eds.) OTM 2009. LNCS, vol. 5871, pp. 1209–1223. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-05151-7_33
Li, R., Su, J., Qin, L., Yu, J.X., Dai, Q.: Persistent community search in temporal networks. In: ICDE 2018, pp. 797–808 (2018)
Li, Y., Zou, L., Özsu, M.T., Zhao, D.: Time constrained continuous subgraph search over streaming graphs. In: ICDE 2019, pp. 1082–1093 (2019)
Li, Z., Liu, X., Wang, X., Liu, P., Shen, Y.: Transo: a knowledge-driven representation learning method with ontology information constraints. World Wide Web 26, 297–319 (2023)
Ma, Z., Yang, J., Li, K., Liu, Y., Zhou, X., Hu, Y.: A parameter-free approach for lossless streaming graph summarization. In: Jensen, C.S., et al. (eds.) DASFAA 2021. LNCS, vol. 12681, pp. 385–393. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-73194-6_26
Min, S., Park, S.G., Park, K., Giammarresi, D., Italiano, G.F., Han, W.: Symmetric continuous subgraph matching with bidirectional dynamic programming. Proc. VLDB Endow. 14(8), 1298–1310 (2021)
Nasir, M.A.U., Aslay, Ç., Morales, G.D.F., Riondato, M.: Tiptap: approximate mining of frequent k-subgraph patterns in evolving graphs. ACM Trans. Knowl. Discov. Data 15(3), 48:1-48:35 (2021)
Pacaci, A., Bonifati, A., Özsu, M.T.: Regular path query evaluation on streaming graphs. In: SIGMOD Conference 2020, pp. 1415–1430 (2020)
Ray, A., Holder, L., Choudhury, S.: Frequent subgraph discovery in large attributed streaming graphs. In: Proceedings of the 3rd International Workshop on Big Data, vol. 36, pp. 166–181 (2014)
Yang, X., Song, C., Yu, M., Gu, J., Liu, M.: Distributed triangle approximately counting algorithms in simple graph stream. ACM Trans. Knowl. Discov. Data 16(4), 79:1–79:43 (2022)
Zhang, L., Jiang, H., Wang, F., Feng, D., Xie, Y.: T-sample: a dual reservoir-based sampling method for characterizing large graph streams. In: ICDE 2019, pp. 1674–1677 (2019)
Zhang, Q., Guo, D., Zhao, X.: Discovering bursting patterns over streaming graphs. In: DASFAA 2022, pp. 441–458 (2022)
Zhang, Y., et al.: On-off sketch: a fast and accurate sketch on persistence. Proc. VLDB Endow. 14(2), 128–140 (2020)
Acknowledgements
This work is partially supported by National Natural Science Foundation of China under Grant No. U19B2024, National Natural Science Foundation of China under Grant No.6227246 and Postgraduate Scientific Research Innovation Project of Hunan Province under Grant No. CX20210038.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Huang, C., Zhang, Q., Guo, D., Zhao, X. (2023). Discovering Persistent Subgraph Patterns over Streaming Graphs. In: Wang, X., et al. Database Systems for Advanced Applications. DASFAA 2023. Lecture Notes in Computer Science, vol 13945. Springer, Cham. https://doi.org/10.1007/978-3-031-30675-4_11
Download citation
DOI: https://doi.org/10.1007/978-3-031-30675-4_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-30674-7
Online ISBN: 978-3-031-30675-4
eBook Packages: Computer ScienceComputer Science (R0)