A Unified Construction of the Glushkov, Follow, and Antimirov Automata
A number of different techniques have been introduced in the last few decades to create ε-free automata representing regular expressions such as the Glushkov automata, follow automata, or Antimirov automata. This paper presents a simple and unified view of all these construction methods both for unweighted and weighted regular expressions. It describes simpler algorithms with time complexities at least as favorable as that of the best previously known techniques, and provides a concise proof of their correctness. Our algorithms are all based on two standard automata operations: epsilon-removal and minimization. This contrasts with the multitude of complicated and special-purpose techniques previously described in the literature, and makes it straightforward to generalize these algorithms to the weighted case. In particular, we extend the definition and construction of follow automata to the case of weighted regular expressions over a closed semiring and present the first algorithm to compute weighted Antimirov automata.
KeywordsRegular Expression Weighted Case Empty String Unweighted Case Alphabet Symbol
Unable to display preview. Download preview PDF.
- 1.Aho, A.V., Sethi, R., Ullman, J.D.: Compilers, Principles, Techniques and Tools. Addison-Wesley, Reading (1986)Google Scholar
- 10.Giammarresi, D., Ponty, J.-L., Wood, D.: Glushkov and Thompson constructions: a synthesis (1998), http://www.cs.ust.hk/tcsc/RR/1998-11.ps.gz
- 13.Kleene, S.C.: Representations of events in nerve sets and finite automata. In: Shannon, C.E., McCarthy, J., Ashby, W.R. (eds.) Automata Studies, pp. 3–42. Princeton University Press, Princeton (1956)Google Scholar
- 17.Mohri, M.: Finite-State Transducers in Language and Speech Processing. Computational Linguistics 23, 2 (1997)Google Scholar
- 22.Ponty, J.-L., Ziadi, D., Champarnaud, J.-M.: A new quadratic algorithm to convert a regular expression into automata. In: Raymond, D.R., Yu, S., Wood, D. (eds.) WIA 1996. LNCS, vol. 1260, pp. 109–119. Springer, Heidelberg (1997)Google Scholar