Discovering Non-compliant Window Co-Occurrence Patterns: A Summary of Results
Given a set of trajectories annotated with measurements of physical variables, the problem of Non-compliant Window Co-occurrence (NWC) pattern discovery aims to determine temporal signatures in the explanatory variables which are highly associated with windows of undesirable behavior in a target variable. NWC discovery is important for societal applications such as eco-friendly transportation (e.g. identifying engine signatures leading to high greenhouse gas emissions). Challenges of designing a scalable algorithm for NWC discovery include the non-monotonicity of popular spatio-temporal statistical interest measures of association such as the cross-K function. This challenge renders the anti-monotone pruning based algorithms (e.g. Apriori) inapplicable. To address this limitation, we propose two novel upper bounds for the cross-K function which help in filtering uninteresting candidate patterns. Using these bounds, we also propose a Multi-Parent Tracking approach (MTNMiner) for mining NWC patterns. A case study with real world engine data demonstrates the ability of the proposed approach to discover patterns which are interesting to engine scientists. Experimental evaluation on real-world data show that MTNMiner results in substantial computational savings over the naive approach.
KeywordsDioxide Torque Transportation
This material is based upon work supported by the National Science Foundation under Grant No. 1029711, IIS-1320580, 0940818 and IIS-1218168, the USDOD under Grant No. HM1582-08-1-0017 and HM0210-13-1-0005, and the University of Minnesota under the OVPR U-Spatial. We are particularly grateful to Kim Koffolt and the members of the University of Minnesota Spatial Computing Research Group for their valuable comments.
- 1.Heavy-Duty Onroad Engines. https://www.dieselnet.com/standards/us/hd.php (2015)
- 2.Sudden unintended acceleration. http://en.wikipedia.org/wiki/Sudden_unintended_acceleration (2015)
- 5.US EPA: Ground level ozone health effects (2014)Google Scholar
- 7.Office of Transportation & Air Quality: Mpg: Label values vs. corporate average fuel economy (cafe) values label mpg (2014)Google Scholar
- 10.Schluter, T., Conrad, S.: About the analysis of time series with temporal association rule mining. In: IEEE Symposium on Computational Intelligence and Data Mining, pp. 325–332 (2011)Google Scholar
- 13.Das, G., et al.: Rule discovery from time series. In: Proceedings of the ACM International Conference on Knowledge and Data Discovery, pp. 16–22 (1998)Google Scholar
- 15.Kotsiantis, S., Kanellopoulos, D.: Discretization techniques: a recent survey. GESTS Intl. Trans. Computer Sci. Eng. 32(1), 47–58 (2006)Google Scholar
- 17.Dixon, P.M.: Ripley’s k function. Encyclopedia of environmetrics. Wiley, New York (2002)Google Scholar
- 18.Turns, S.R.: An Introduction to Combustion: Concepts and Applications, vol. 287, 3rd edn. McGraw-hill, New York (2012)Google Scholar
- 21.Zakaria, W., Kotb, Y., Ghaleb, F.: Mcr-miner: maximal confident association rules miner algorithm for up/down-expressed genes. Appl. Math 8(2), 799–809 (2014)Google Scholar
- 22.Huang, Y., et al.: Mining confident co-location rules without a support threshold. In: Proceedings of the 2003 ACM symposium on Applied computing, pp. 497–501 (2003)Google Scholar