Density-Based Anomaly Detection in the Maritime Domain
Detecting anomalies in the maritime domain is a complex task. The complexity is due to the many information sources in the maritime domain and the knowledge required to interpret the information. Operators that monitor vessel movements may be supported by systems that help them to identify abnormal behaviors and events. The uncertain nature of some aspects of the maritime domain (e.g., weather predictions), suggests the use of statistical computer methods, rather than deterministic rule-based methods. In these methods, maritime objects and events (e.g., vessels and vessel behaviors) are represented as points in a (potentially high-dimensional) space in which the dissimilarities of pairs of points are represented by distances. Anomalies manifest themselves as points that are distant from all other points. We present the Stochastic Outlier Selection (SOS) method that takes an unlabeled set of points and automatically selects the outliers. In a comparative evaluation involving a wide range of complex (non-maritime) tasks, the SOS method is shown to outperform state-of-the-art outlier detection methods. We conclude that the SOS method is suitable to be applied to complex tasks.
This research has been carried out as a part of the Poseidon project at Thales under the responsibilities of the Embedded Systems Institute (ESI). This project is partially supported by the Dutch Ministry of Economic Affairs under the BSIK program.
- 1.Barnett V, Lewis T (1994) Outliers in statistical data. Wiley series in probability and mathematical statistics, 3rd edn. Wiley, ChichesterGoogle Scholar
- 2.Bishop CM (2006) Pattern recognition and machine learning. Series in information science and statistics. Springer, New YorkGoogle Scholar
- 3.Breunig MM, Kriegel HP, Ng RT, Sander J (2000) LOF: identifying density-based local outliers. ACM SIGMOD Rec. Dallas, TX 29(2):93–104Google Scholar
- 4.Fayyad UM, Piatetsky-Shapiro G, Smyth P (1996) Knowledge discovery and data mining: towards a unifying framework. Knowl Discov Data Min 82–88Google Scholar
- 5.Janssens JHM, Huszar F, Postma EO, van den Herik HJ (2012) Stochastic outlier selection. Technical report TiCC TR 2012-001, Tilburg University, Tilburg Center for Cognition and Communication, Tilburg, The NetherlandsGoogle Scholar
- 7.Nemenyi P (1963) Distribution-free multiple comparisons. Ph.D. thesis, PrincetonGoogle Scholar
- 8.Papadimitriou S, Kitagawa H, Gibbons PB, Faloutsos C (2003) LOCI: fast outlier detection using the local correlation integral. In: Proceedings of the 19th international conference on data engineering, Bangalore, India, pp 315–326Google Scholar
- 10.Riveiro M, Falkman G, Ziemke T (2008) Improving maritime anomaly detection and situation awareness through interactive visualization. In: 11th international conference on information fusion, 2008. IEEE, Piscataway, pp 1–8Google Scholar
- 12.Tax DMJ (2001) One-class classification: concept-learning in the absence of counter-examples. Ph.D. thesis, Delft University of Technology, Delft, The NetherlandsGoogle Scholar