Abstract
Most existing network embedding based anomalous link detection methods regard network embedding and anomalous link detection as two independent tasks. However, removing anomalous links from the original network can reduce the data noise, thus hopefully improving the performance of network embedding models and anomalous link detection. In this paper, we propose an Anomaly Aware Network Embedding (AANE) framework by simultaneously learning node embedding and detecting anomalous links in a unified way. To instantiate the AANE framework, we propose a heuristic anomalous link selection based model AANE-H and an embedding disentangling based model AANE-D on Graph Auto-Encoder (GAE). In AANE-H, we adopt an anomalous link selector to iteratively select significant anomalous links based on a heuristic rule during model training, while in AANE-D the normal and anomalous links are generated by disentangled normal and anomalous embedding respectively. For the evaluation purpose, we propose a heuristic anomalous link generation algorithm to inject synthetic anomalous links into six real-world network datasets used in our experiments. Experimental results show that AANE outperforms both the state-of-the-art network embedding models and anomalous node detection models in terms of anomalous link detection performance. As a general network embedding model, AANE can also improve other downstream tasks like node classification.
Similar content being viewed by others
References
Akoglu L, Tong H, Koutra D (2015) Graph based anomaly detection and description: a survey. Data Min Knowl Disc 29(3):626–688
Bandyopadhyay S, Lokesh N, Murty MN (2019) Outlier aware network embedding for attributed networks. In: AAAI, pp 12–19
Bandyopadhyay S, N L, Vivek SV, Murty MN (2020) Outlier resistant unsupervised deep architectures for attributed network embedding. In: WSDM, pp 25–33
Bruna J, Zaremba W, Szlam A, Lecun Y (2014) Spectral networks and locally connected networks on graphs. In: ICLR
Cen Y, Zou X, Zhang J, Yang H, Zhou J, Tang J (2019) Representation learning for attributed multiplex heterogeneous network. In: KDD, pp 1358–1368. https://doi.org/10.1145/3292500.3330964
Chandola V, Banerjee A, Kumar V (2009) Anomaly detection: a survey. ACM Comput Surv 41(3):15–158
Defferrard M, Bresson X, Vandergheynst P (2016) Convolutional neural networks on graphs with fast localized spectral filtering. In: NIPS, pp 3844–3852
Ding Q, Katenka N, Barford P, Kolaczyk ED, Crovella M (2012) Intrusion as (anti)social communication: characterization and detection. In: Knowledge Discovery and Data Mining, pp 886–894
Ding K, Li J, Bhanushali R, Liu H (2019) Deep anomaly detection on attributed networks. In: SDM
Ding K, Li J, Liu H (2019) Interactive anomaly detection on attributed networks. In: WSDM, pp 357–365. https://doi.org/10.1145/3289600.3290964
Dong Y, Chawla N, Swami A (2017) metapath2vec: Scalable representation learning for heterogeneous networks. In: KDD, pp 135–144. https://doi.org/10.1145/3097983.3098036
Duan D, Tong L, Li Y, Lu J, Shi L, Zhang C (2020) Aane: anomaly aware network embedding for anomalous link detection. In: ICDM, pp 1002–1007
Fang L, Feng K, Gui J, Feng S, Hu A (2023) Anonymous edge representation for inductive anomaly detection in dynamic bipartite graph. Proc VLDB Endow 16(5):1154–1167
Fan H, Zhang F, Li Z (2020) Anomalydae: Dual autoencoder for anomaly detection on attributed networks. In: ICASSP, pp 5685–5689
Freeman LC (1977) A set of measures of centrality based on betweenness. Sociometry 40(1):35–41
Gao Y, Wang X, He X, Liu Z, Feng H, Zhang Y (2023) Alleviating structural distribution shift in graph anomaly detection. In: WSDM, pp 357–365
Grover A, Leskovec J (2016) node2vec: Scalable feature learning for networks. In: KDD, pp 855–864. https://doi.org/10.1145/2939672.2939754
Gupte M, Eliassirad T (2012) Measuring tie strength in implicit social networks. In: Web Science, pp 109–118
Hamilton W, Ying R, Leskovec J (2017) Inductive representation learning on large graphs. In: NIPS, pp 1024–1034
Ioannidis VN, Berberidis D, Giannakis GB (2019) Graphsac: Detecting anomalies in large-scale graphs. arXiv: Learning
Kingma DP, Ba J (2015) A method for stochastic optimization. In: ICLR
Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: ICLR
Kipf T, Welling M (2016) Variational graph auto-encoders. arXiv: Machine Learning
Liben-Nowell D, Kleinberg J (2007) The link-prediction problem for social networks. J Am Soc Inform Sci Technol 58(7):1019–1031
Li J, Dani H, Xia H, Liu H (2017) Radar: Residual analysis for anomaly detection in attributed networks. In: IJCAI
Li Y, Huang X, Li J, Du M, Zou N (2019) Specae: spectral autoencoder for anomaly detection in attributed networks. arXiv: Learning
Liu Y, Pan S, Wang YG, Xiong F, Wang L, Chen Q, Lee VC (2021) Anomaly detection in dynamic graphs via transformer. IEEE Transactions on Knowledge and Data Engineering 1–1 . https://doi.org/10.1109/TKDE.2021.3124061
Luo X, Wu J, Beheshti A, Yang J, Zhang X, Wang Y, Xue S (2022) Comga: Community-aware attributed graph anomaly detection. In: WSDM, pp 657–665
Mikolov T, Corrado Gs, Chen K, Dean J (2013) Efficient estimation of word representations in vector space. In: ICLR
Noh JD, Rieger H (2004) Random walks on complex networks. Phys Rev Lett 92(11):118701
Pei1 Y, Huang T, van Ipenburg W, Pechenizkiy M, (2022) Resgcn: attention-based deep residual modeling for anomaly detection on attributed networks. Mach Learn 111:519–541
Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. In: KDD, pp 701–710. https://doi.org/10.1145/2623330.2623732
Skillicorn D (2007) Detecting anomalies in graphs. In: IEEE Intelligence and Security Informatics, pp 209–216. https://doi.org/10.1109/ISI.2007.379473
Song X, Wu M, Jermaine C, Ranka S (2007) Conditional anomaly detection. IEEE Trans Knowl Data Eng 19(5):631–645. https://doi.org/10.1109/TKDE.2007.1009
Veličković P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y (2017) Graph attention networks. In: ICLR
Vincent P, Larochelle H, Bengio Y, Manzagol P-A (2008) Extracting and composing robust features with denoising autoencoders. In: ICML, pp 1096–1103
Wang D, Cui P, Zhu W (2016) Structural deep network embedding. In: KDD, pp 1225–1234
Watts DJ, Strogatz SH (1998) Collective dynamics of small world networks. Nature 393(6684):440–442
Xu Z, Li L, Li H, Sun Q, Hu S, Ji R (2023) Self-supervised graph representation learning for black market account detection. In: WSDM, pp 330–338
Xu B, Shen H, Cao Q, Qiu Y, Cheng X (2019) Graph wavelet neural network. In: ICLR
Xu X, Yuruk N, Feng Z, Schweiger TAJ (2007) Scan: a structural clustering algorithm for networks. Data Mining and Knowledge Discovery, pp 824–833
Yu W, Cheng W, Aggarwal C, Zhang K, Chen H, Wang W (2018) Netwalk: A flexible deep embedding approach for anomaly detection in dynamic networks. In: Data Mining and Knowledge Discovery, pp 2672–2681. https://doi.org/10.1145/3219819.3220024
Zheng L, Li Z, Li J, Li Z, Gao J (2019) Addgraph: Anomaly detection in dynamic graph using attention-based temporal gcn. In: IJCAI, pp 4419–4425
Acknowledgements
This work is supported by National Natural Science Foundation of China under Grants 62272125, 62192785, U1836111, U1936110.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
We declare that we have no conflict of interest.
Additional information
Responsible editor: Aristides Gionis.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Duan, D., Zhang, C., Tong, L. et al. An anomaly aware network embedding framework for unsupervised anomalous link detection. Data Min Knowl Disc 38, 501–534 (2024). https://doi.org/10.1007/s10618-023-00960-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10618-023-00960-6