Discovering Non-compliant Window Co-Occurrence Patterns: A Summary of Results

  • Reem Y. Ali
  • Venkata M.V. Gunturi
  • Andrew J. Kotz
  • Shashi Shekhar
  • William F. Northrop
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9239)

Abstract

Given a set of trajectories annotated with measurements of physical variables, the problem of Non-compliant Window Co-occurrence (NWC) pattern discovery aims to determine temporal signatures in the explanatory variables which are highly associated with windows of undesirable behavior in a target variable. NWC discovery is important for societal applications such as eco-friendly transportation (e.g. identifying engine signatures leading to high greenhouse gas emissions). Challenges of designing a scalable algorithm for NWC discovery include the non-monotonicity of popular spatio-temporal statistical interest measures of association such as the cross-K function. This challenge renders the anti-monotone pruning based algorithms (e.g. Apriori) inapplicable. To address this limitation, we propose two novel upper bounds for the cross-K function which help in filtering uninteresting candidate patterns. Using these bounds, we also propose a Multi-Parent Tracking approach (MTNMiner) for mining NWC patterns. A case study with real world engine data demonstrates the ability of the proposed approach to discover patterns which are interesting to engine scientists. Experimental evaluation on real-world data show that MTNMiner results in substantial computational savings over the naive approach.

Keywords

Dioxide Torque Transportation 

Notes

Acknowledgement

This material is based upon work supported by the National Science Foundation under Grant No. 1029711, IIS-1320580, 0940818 and IIS-1218168, the USDOD under Grant No. HM1582-08-1-0017 and HM0210-13-1-0005, and the University of Minnesota under the OVPR U-Spatial. We are particularly grateful to Kim Koffolt and the members of the University of Minnesota Spatial Computing Research Group for their valuable comments.

References

  1. 1.
    Heavy-Duty Onroad Engines. https://www.dieselnet.com/standards/us/hd.php (2015)
  2. 2.
  3. 3.
    Wang, J., He, Q.P.: Multivariate statistical process monitoring based on statistics pattern analysis. Ind. Eng. Chem. Res. 49(17), 7858–7869 (2010)CrossRefGoogle Scholar
  4. 4.
    Vijayaraghavan, K., et al.: Effects of light duty gasoline vehicle emission standards in the united states on ozone and particulate matter. Atmos. Environ. 60, 109–120 (2012)MATHMathSciNetCrossRefGoogle Scholar
  5. 5.
    US EPA: Ground level ozone health effects (2014)Google Scholar
  6. 6.
    Misra, C., et al.: In-use nox emissions from model year 2010 and 2011 heavy-duty diesel engines equipped with aftertreatment devices. Env. Sci. tech. 47(14), 7892–7898 (2013)CrossRefGoogle Scholar
  7. 7.
    Office of Transportation & Air Quality: Mpg: Label values vs. corporate average fuel economy (cafe) values label mpg (2014)Google Scholar
  8. 8.
    Diggle, P.J., Chetwynd, A.G., Häggkvist, R., Morris, S.E.: Second-order analysis of space-time clustering. Stat. Methods Med. Res. 4(2), 124–136 (1995)CrossRefGoogle Scholar
  9. 9.
    Gabriel, E., Diggle, P.J.: Second-order analysis of inhomogeneous spatio-temporal point process data. Stat. Neerl. 63(1), 43–51 (2009)MathSciNetCrossRefGoogle Scholar
  10. 10.
    Schluter, T., Conrad, S.: About the analysis of time series with temporal association rule mining. In: IEEE Symposium on Computational Intelligence and Data Mining, pp. 325–332 (2011)Google Scholar
  11. 11.
    Harms, S.K., Deogun, J.S.: Sequential association rule mining with time lags. J. Intell. Inf. Syst. 22(1), 7–22 (2004)CrossRefGoogle Scholar
  12. 12.
    Sacchi, L., Larizza, C., Combi, C., Bellazzi, R.: Data mining with temporal abstractions: learning rules from time series. Data Min. Knowl. Disc. 15(2), 217–247 (2007)MathSciNetCrossRefGoogle Scholar
  13. 13.
    Das, G., et al.: Rule discovery from time series. In: Proceedings of the ACM International Conference on Knowledge and Data Discovery, pp. 16–22 (1998)Google Scholar
  14. 14.
    Daw, C.S., Finney, C.E.A., Tracy, E.R.: A review of symbolic analysis of experimental data. Rev. Sci. Instrum. 74(2), 915–930 (2003)CrossRefGoogle Scholar
  15. 15.
    Kotsiantis, S., Kanellopoulos, D.: Discretization techniques: a recent survey. GESTS Intl. Trans. Computer Sci. Eng. 32(1), 47–58 (2006)Google Scholar
  16. 16.
    Lin, J., Keogh, E., Wei, L., Lonardi, S.: Experiencing sax: a novel symbolic representation of time series. Data Min. Knowl. Disc. 15(2), 107–144 (2007)MathSciNetCrossRefGoogle Scholar
  17. 17.
    Dixon, P.M.: Ripley’s k function. Encyclopedia of environmetrics. Wiley, New York (2002)Google Scholar
  18. 18.
    Turns, S.R.: An Introduction to Combustion: Concepts and Applications, vol. 287, 3rd edn. McGraw-hill, New York (2012)Google Scholar
  19. 19.
    Cohen, E., et al.: Finding interesting associations without support pruning. IEEE Trans. Knowl. Data Eng. 13(1), 64–78 (2001)CrossRefGoogle Scholar
  20. 20.
    McIntosh, T., Chawla, S.: High confidence rule mining for microarray analysis. IEEE/ACM Trans. Comput. Biol. Bioinf. 4(4), 611–623 (2007)CrossRefGoogle Scholar
  21. 21.
    Zakaria, W., Kotb, Y., Ghaleb, F.: Mcr-miner: maximal confident association rules miner algorithm for up/down-expressed genes. Appl. Math 8(2), 799–809 (2014)Google Scholar
  22. 22.
    Huang, Y., et al.: Mining confident co-location rules without a support threshold. In: Proceedings of the 2003 ACM symposium on Applied computing, pp. 497–501 (2003)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Reem Y. Ali
    • 1
  • Venkata M.V. Gunturi
    • 1
  • Andrew J. Kotz
    • 2
  • Shashi Shekhar
    • 1
  • William F. Northrop
    • 2
  1. 1.Department of Computer Science and EngineeringUniversity of MinnesotaMinneapolisUSA
  2. 2.Department of Mechanical EngineeringUniversity of MinnesotaMinneapolisUSA

Personalised recommendations