Skip to main content

A Concept for a Tree-Based Log Parser Generator

  • Chapter
  • First Online:
Smart Log Data Analytics

Abstract

A key source of information describing a system’s current state is log data. However, accessing this information for further analysis is often complicated. Usually, log data is available in form of unstructured text lines and there exists no common standard for the appearance of logs. Hence, log parsers are required to pre-process log lines and structure their information for further analysis. State of the art log parsers still apply pre-defined lists of regular expressions, which are linearly processed and thus hinder online log analysis. Furthermore, defining log parsers manually is a cumbersome and time consuming task. Therefore, in this chapter we propose AECID-PG, a novel log parser generator. AECID-PG implements a density-based approach to automatically generate a tree-like parser, which reduces the complexity of log parsing significantly.

Parts of this chapter have been published in [120].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    A more exhaustive list of model elements can be found in the AMiner (which is an agent that can apply the parser) documentation at: https://github.com/ait-aecid/logdata-anomaly-miner/blob/V2.2.3/source/root/usr/share/doc/logdata-anomaly-miner/aminer/ParsingModel.txt.

  2. 2.

    https://github.com/ait-aecid/logdata-anomaly-miner/.

  3. 3.

    https://github.com/ait-aecid/aecid-parsergenerator.

  4. 4.

    Note that the implementation of the aecid-parsergenerator demonstrated in this try-it-out includes new features that go beyond the theoretical discussions of the previous sections. In particular, the third rule stated in Sect. 7.3.3 is updated to support combinations of static and variable nodes, and a branch similarity metric is used to merge nodes followed by similar sub-trees. Further details can be found in the repository of the aecid-parsergenerator.

References

  1. R Gerhards. The syslog protocol: Rfc 5424. IETF Trust: Reston, VA, USA, 2009.

    Google Scholar 

  2. Risto Vaarandi. A data clustering algorithm for mining patterns from event logs. In Proceedings of the 3rd Workshop on IP Operations & Management (IPOM 2003), pages 119–126. IEEE, 2003.

    Google Scholar 

  3. Markus Wurzenberger, Max Landauer, Florian Skopik, and Wolfgang Kastner. Aecid-pg: A tree-based log parser generator to enable log analysis. In 2019 IFIP/IEEE Symposium on Integrated Network and Service Management (IM), pages 7–12. IEEE, 2019.

    Google Scholar 

  4. Markus Wurzenberger, Florian Skopik, Giuseppe Settanni, and Roman Fiedler. Aecid: A self-learning anomaly detection approach based on light-weight log parser models. In ICISSP, pages 386–397, 2018.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Skopik, F., Wurzenberger, M., Landauer, M. (2021). A Concept for a Tree-Based Log Parser Generator. In: Smart Log Data Analytics. Springer, Cham. https://doi.org/10.1007/978-3-030-74450-2_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-74450-2_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-74449-6

  • Online ISBN: 978-3-030-74450-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics