Skip to main content

Mining Outliers in Spatial Networks

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3882))

Included in the following conference series:

Abstract

Outlier analysis is an important task in data mining and has attracted much attention in both research and applications. Previous work on outlier detection involves different types of databases such as spatial databases, time series databases, biomedical databases, etc. However, few of the existing studies have considered spatial networks where points reside on every edge. In this paper, we study the interesting problem of distance-based outliers in spatial networks. We propose an efficient mining method which partitions each edge of a spatial network into a set of length d segments, then quickly identifies the outliers in the remaining edges after pruning those unnecessary edges which cannot contain outliers. We also present algorithms that can be applied when the spatial network is updating points or the input parameters of outlier measures are changed. The experimental results verify the scalability and efficiency of our proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aggarwal, C., Yu, P.: Outlier detection for high dimensional data. In: SIGMOD (2001)

    Google Scholar 

  2. Breunig, M.M., Kriegel, H.-P., Ng, R.T., Sander, J.: LOF: Identifying Density-Based Local Outliers. In: SIGMOD (2000)

    Google Scholar 

  3. Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley & Sons, Chichester (1994)

    MATH  Google Scholar 

  4. Chakrabarti, D.: AutoPart: Parameter-Free Graph Partitioning and Outlier Detection. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, Springer, Heidelberg (2004)

    Google Scholar 

  5. Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases. In: KDD (1996)

    Google Scholar 

  6. Guha, S., Rastogi, R., Shim, K.: Cure: An efficient clustering algorithm for large databases. In: SIGMOD (1998)

    Google Scholar 

  7. Hawkins, D.: Identification of Outliers. Chapman and Hall, London (1980)

    Book  MATH  Google Scholar 

  8. Hautamki, V., Krkkinen, I., Frnti, P.: Outlier detection using k-nearest neighbour graph. In: ICPR (2004)

    Google Scholar 

  9. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco

    Google Scholar 

  10. Jagadish, H., Koudas, N., Muthukrishnan, S.: Mining deviants in a time series database. In: VLDB 1999 (1999)

    Google Scholar 

  11. Jin, W., Tung, A.K.H., Han, J.W.: Mining Top-n Local Outliers in Large Databases. In: KDD (2001)

    Google Scholar 

  12. Edwin, M., Knorr, R.T.: Ng: Algorithms for Mining Distance-Based Outliers in Large Datasets. In: VLDB (1998)

    Google Scholar 

  13. Knorr, E., Ng, R.: Finding Intensional Knowledge of Distance-Based Outliers. In: VLDB (1999)

    Google Scholar 

  14. Muthukrishnan, S.: Rahul Shah, Jeffrey Scott Vitter: Mining Deviants in Time Series Data Streams. In: SSDBM (2004)

    Google Scholar 

  15. Ng, R., Han, J.: Efficient and effective clustering method for spatial data mining. In: VLDB (1994)

    Google Scholar 

  16. Papadimitriou, S., Kitagawa, H., Gibbons, P.B., Faloutsos, C.: LOCI: Fast Outlier Detection Using the Local Correlation Integral. In: ICDE (2003)

    Google Scholar 

  17. Papadimitriou, S., Faloutsos, C.: Cross-Outlier Detection. In: Hadzilacos, T., Manolopoulos, Y., Roddick, J.F., Theodoridis, Y. (eds.) SSTD 2003. LNCS, vol. 2750, Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  18. Roussopoulos, N., Kelley, S., Vincent, F.: Nearest Neighbor Queries. In: SIGMOD (1995)

    Google Scholar 

  19. Ramaswamy, S., Rastogi, R., Shim, K.: Efficient Algorithms for Mining Outliers from Large Data Sets. In: SIGMOD (2000)

    Google Scholar 

  20. Shekhar, S., Lu, C.-T., Zhang, P.: Detecting graph-based spatial outliers: algorithms and applications (a summary of results). In: KDD (2001)

    Google Scholar 

  21. Sander, J., Ng, R.T., Sleumer, M.C., Yuen, M.S., Jones, S.J.: A methodology for analyzing SAGE libraries for cancer profiling. ACM Trans. Inf. Syst. 23(1), 35–60 (2005)

    Article  Google Scholar 

  22. Wong, W.-K., Moore, A.W., Cooper, G.F., Wagner, M.: Rule-Based Anomaly Pattern Detection for Detecting Disease Outbreaks. In: AAAI (2002)

    Google Scholar 

  23. Yiu, M.L., Mamoulis, N.: Clustering Objects on a Spatial Network. In: SIGMOD (2004)

    Google Scholar 

  24. Yiu, M.L., Mamoulis, N., Papadias, D.: Aggregate Nearest Neighbor Queries in Road Networks. IEEE Trans. Knowl. Data Eng 17(6), 820–833 (2005)

    Article  Google Scholar 

  25. Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: SIGMOD (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jin, W., Jiang, Y., Qian, W., Tung, A.K.H. (2006). Mining Outliers in Spatial Networks. In: Li Lee, M., Tan, KL., Wuwongse, V. (eds) Database Systems for Advanced Applications. DASFAA 2006. Lecture Notes in Computer Science, vol 3882. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11733836_13

Download citation

  • DOI: https://doi.org/10.1007/11733836_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-33337-1

  • Online ISBN: 978-3-540-33338-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics