Outlier detection with partial information: application to emergency mapping

D’Alimonte, Davide; Cornford, Dan

doi:10.1007/s00477-007-0164-8

Outlier detection with partial information: application to emergency mapping

Original Paper
Published: 30 June 2007

Volume 22, pages 613–620, (2008)
Cite this article

Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Davide D’Alimonte¹ &
Dan Cornford²

100 Accesses
3 Citations
Explore all metrics

Abstract

This paper, addresses the problem of novelty detection in the case that the observed data is a mixture of a known ‘background’ process contaminated with an unknown other process, which generates the outliers, or novel observations. The framework we describe here is quite general, employing univariate classification with incomplete information, based on knowledge of the distribution (the probability density function, pdf) of the data generated by the ‘background’ process. The relative proportion of this ‘background’ component (the prior ‘background’ probability), the pdf and the prior probabilities of all other components are all assumed unknown. The main contribution is a new classification scheme that identifies the maximum proportion of observed data following the known ‘background’ distribution. The method exploits the Kolmogorov–Smirnov test to estimate the proportions, and afterwards data are Bayes optimally separated. Results, demonstrated with synthetic data, show that this approach can produce more reliable results than a standard novelty detection scheme. The classification algorithm is then applied to the problem of identifying outliers in the SIC2004 data set, in order to detect the radioactive release simulated in the ‘joker’ data set. We propose this method as a reliable means of novelty detection in the emergency situation which can also be used to identify outliers prior to the application of a more general automatic mapping algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Outlier Detection with Arbitrary Probability Functions

Outlier Detection for Geostatistical Functional Data: An Application to Sensor Data

A further study comparing forward search multivariate outlier methods including ATLA with an application to clustering

Article Open access 01 June 2022

Notes

Notice how this step directly exploits the generative nature of the GMM algorithm.

References

EC EUR 21595 EN (2005) Automatic mapping algorithms for routine and emergency monitoring data. In: Dubios G (ed) Report on the spatial interpolation comparison (SIC2004) exercise
Dubois G, Galmarini S (2005) Introduction to the spatial interpolation comparison (SIC) 2004 exercise and presentation of the datasets. Appl GIS 1(2):1–11
Google Scholar
Bishop CM (1995) Neural networks for pattern recognition. Clarendon, Oxford
Google Scholar
Markou M, Singh S (2003) Novelty detection: a review. part 1: statistical approaches. Signal Process 83:2481–2497
Article Google Scholar
Bishop CM (1994) Novelty detection and neural network validation. IEEE Proc Vis Image Sig Proc 141:217–222
Article Google Scholar
Roberts S (1999) Novelty detection using extreme value statistics. IEE Proc Vis Image Signal Process 146(3):124–129
Article Google Scholar
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B39(1):1–38
Google Scholar
Press WH, Teukolsky SA, Vetterling WT, Flannery BP (1992) Numerical recipes in C: the art of scientific computing. Cambridge University Press, London
Nabney IT (2001) Netlab: algorithms for pattern recognition. Springer, Heidelberg
Google Scholar
Lowe D, Tipping ME (1997) Neuroscale: novel topographic feature extraction using RBF networks. In: Mozer MC, Jordan MI, Petsche T (eds) Advances in neural information processing systems. London, pp 543–549
D’Alimonte D, Lowe D, Nabney IT, Mersinias V, Smith CP (2005) Milva: an interactive tool for the exploration of multidimensional microarray data. Bioinformatics 21(22):4192–4193
Article CAS Google Scholar

Download references

Acknowledgments

This work was partially supported by the BBSRC contract 92/EGM17737. The SIC2004 data was obtained from [http://www.ai-geostats.org]. This work is partially funded by the European Commission, under the Sixth Framework Programme, by the Contract No. 033811 with DG INFSO, action Line IST-2005-2.5.12 ICT for Environmental Risk Management. The views expressed herein are those of the authors and are not necessarily those of the European Commission.

Author information

Authors and Affiliations

NASA/Goddard Space Flight Center, Greenbelt, MD, USA
Davide D’Alimonte
Neural Computing Research Group, Aston University, Birmingham, UK
Dan Cornford

Authors

Davide D’Alimonte
View author publications
You can also search for this author in PubMed Google Scholar
Dan Cornford
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dan Cornford.

Rights and permissions

Reprints and permissions

About this article

Cite this article

D’Alimonte, D., Cornford, D. Outlier detection with partial information: application to emergency mapping. Stoch Environ Res Risk Assess 22, 613–620 (2008). https://doi.org/10.1007/s00477-007-0164-8

Download citation

Published: 30 June 2007
Issue Date: August 2008
DOI: https://doi.org/10.1007/s00477-007-0164-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Outlier detection with partial information: application to emergency mapping

Abstract

Access this article

Similar content being viewed by others

Outlier Detection with Arbitrary Probability Functions

Outlier Detection for Geostatistical Functional Data: An Application to Sensor Data

A further study comparing forward search multivariate outlier methods including ATLA with an application to clustering

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Outlier detection with partial information: application to emergency mapping

Abstract

Access this article

Similar content being viewed by others

Outlier Detection with Arbitrary Probability Functions

Outlier Detection for Geostatistical Functional Data: An Application to Sensor Data

A further study comparing forward search multivariate outlier methods including ATLA with an application to clustering

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation