Fault Mining Using Peer Group Analysis
There has been increasing interest in deploying data mining methods for fault detection. For the case where we have potentially large numbers of devices to monitor, we propose to use peer group analysis to identify faults. First, we identify the peer group of each device. This consists of other devices that have behaved similarly. We then monitor the behaviour of a device by measuring how well the peer group tracks the device. Should the device’s behaviour deviate strongly from its peer group we flag the behaviour as an outlier. An outlier is used to indicate the potential occurrence of a fault. A device exhibiting outlier behaviour from its peer group need not be an outlier to the population of devices. Indeed a device exhibiting behaviour typical for the population of devices might deviate sufficiently far from its peer group to be flagged as an outlier. We demonstrate the usefulness of this property for detecting faults by monitoring the data output from a collection of privately run weather stations across the UK.
KeywordsTime Series Fault Detection Mahalanobis Distance Multiple Time Series Plastic Card
We would like to express appreciation to Weather Underground, Inc. for use of their data. The work of David Weston was supported by grant number EP/C532589/1 from the UK Engineering and Physical Sciences Research Council. The work of Yoonseong Kim was supported by the Korea Research Foundation Grant funded by the Korean Government (MOEHRD) (KRF-2006-612-D00100). The work of David Hand was partially supported by a Royal Society Wolfson Research Merit Award.
- Bolton RJ, Hand DJ (2001) Unsupervised profiling methods for fraud detection. In: Conference on Credit Scoring and Credit Control 7, Edinburgh, UK, 5-7 SeptGoogle Scholar