Mining Outliers in Spatial Networks

Jin, Wen; Jiang, Yuelong; Qian, Weining; Tung, Anthony K. H.

doi:10.1007/11733836_13

Wen Jin¹⁹,
Yuelong Jiang¹⁹,
Weining Qian²⁰ &
…
Anthony K. H. Tung²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3882))

Included in the following conference series:

International Conference on Database Systems for Advanced Applications

1057 Accesses
10 Citations

Abstract

Outlier analysis is an important task in data mining and has attracted much attention in both research and applications. Previous work on outlier detection involves different types of databases such as spatial databases, time series databases, biomedical databases, etc. However, few of the existing studies have considered spatial networks where points reside on every edge. In this paper, we study the interesting problem of distance-based outliers in spatial networks. We propose an efficient mining method which partitions each edge of a spatial network into a set of length d segments, then quickly identifies the outliers in the remaining edges after pruning those unnecessary edges which cannot contain outliers. We also present algorithms that can be applied when the spatial network is updating points or the input parameters of outlier measures are changed. The experimental results verify the scalability and efficiency of our proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aggarwal, C., Yu, P.: Outlier detection for high dimensional data. In: SIGMOD (2001)
Google Scholar
Breunig, M.M., Kriegel, H.-P., Ng, R.T., Sander, J.: LOF: Identifying Density-Based Local Outliers. In: SIGMOD (2000)
Google Scholar
Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley & Sons, Chichester (1994)
MATH Google Scholar
Chakrabarti, D.: AutoPart: Parameter-Free Graph Partitioning and Outlier Detection. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, Springer, Heidelberg (2004)
Google Scholar
Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases. In: KDD (1996)
Google Scholar
Guha, S., Rastogi, R., Shim, K.: Cure: An efficient clustering algorithm for large databases. In: SIGMOD (1998)
Google Scholar
Hawkins, D.: Identification of Outliers. Chapman and Hall, London (1980)
Book MATH Google Scholar
Hautamki, V., Krkkinen, I., Frnti, P.: Outlier detection using k-nearest neighbour graph. In: ICPR (2004)
Google Scholar
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco
Google Scholar
Jagadish, H., Koudas, N., Muthukrishnan, S.: Mining deviants in a time series database. In: VLDB 1999 (1999)
Google Scholar
Jin, W., Tung, A.K.H., Han, J.W.: Mining Top-n Local Outliers in Large Databases. In: KDD (2001)
Google Scholar
Edwin, M., Knorr, R.T.: Ng: Algorithms for Mining Distance-Based Outliers in Large Datasets. In: VLDB (1998)
Google Scholar
Knorr, E., Ng, R.: Finding Intensional Knowledge of Distance-Based Outliers. In: VLDB (1999)
Google Scholar
Muthukrishnan, S.: Rahul Shah, Jeffrey Scott Vitter: Mining Deviants in Time Series Data Streams. In: SSDBM (2004)
Google Scholar
Ng, R., Han, J.: Efficient and effective clustering method for spatial data mining. In: VLDB (1994)
Google Scholar
Papadimitriou, S., Kitagawa, H., Gibbons, P.B., Faloutsos, C.: LOCI: Fast Outlier Detection Using the Local Correlation Integral. In: ICDE (2003)
Google Scholar
Papadimitriou, S., Faloutsos, C.: Cross-Outlier Detection. In: Hadzilacos, T., Manolopoulos, Y., Roddick, J.F., Theodoridis, Y. (eds.) SSTD 2003. LNCS, vol. 2750, Springer, Heidelberg (2003)
Chapter Google Scholar
Roussopoulos, N., Kelley, S., Vincent, F.: Nearest Neighbor Queries. In: SIGMOD (1995)
Google Scholar
Ramaswamy, S., Rastogi, R., Shim, K.: Efficient Algorithms for Mining Outliers from Large Data Sets. In: SIGMOD (2000)
Google Scholar
Shekhar, S., Lu, C.-T., Zhang, P.: Detecting graph-based spatial outliers: algorithms and applications (a summary of results). In: KDD (2001)
Google Scholar
Sander, J., Ng, R.T., Sleumer, M.C., Yuen, M.S., Jones, S.J.: A methodology for analyzing SAGE libraries for cancer profiling. ACM Trans. Inf. Syst. 23(1), 35–60 (2005)
Article Google Scholar
Wong, W.-K., Moore, A.W., Cooper, G.F., Wagner, M.: Rule-Based Anomaly Pattern Detection for Detecting Disease Outbreaks. In: AAAI (2002)
Google Scholar
Yiu, M.L., Mamoulis, N.: Clustering Objects on a Spatial Network. In: SIGMOD (2004)
Google Scholar
Yiu, M.L., Mamoulis, N., Papadias, D.: Aggregate Nearest Neighbor Queries in Road Networks. IEEE Trans. Knowl. Data Eng 17(6), 820–833 (2005)
Article Google Scholar
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: SIGMOD (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing Science, Simon Fraser University, Canada
Wen Jin & Yuelong Jiang
Department of Computer Science, Fudan University, China
Weining Qian
Department of Computer Science, National University of Singapore, Singapore
Anthony K. H. Tung

Authors

Wen Jin
View author publications
You can also search for this author in PubMed Google Scholar
Yuelong Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Weining Qian
View author publications
You can also search for this author in PubMed Google Scholar
Anthony K. H. Tung
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, National University of Singapore, Singapore
Mong Li Lee
School of Computing, National University of Singapore, Singapore
Kian-Lee Tan
School of Engineering and Technology, Asian Institute of Technology, P.O. Box 4, 12120, Klong Luang, Pathum Thani, Thailand
Vilas Wuwongse

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jin, W., Jiang, Y., Qian, W., Tung, A.K.H. (2006). Mining Outliers in Spatial Networks. In: Li Lee, M., Tan, KL., Wuwongse, V. (eds) Database Systems for Advanced Applications. DASFAA 2006. Lecture Notes in Computer Science, vol 3882. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11733836_13

Download citation

DOI: https://doi.org/10.1007/11733836_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33337-1
Online ISBN: 978-3-540-33338-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics