Abstract
We present a framework for learning DFA from simple examples. We show that efficient PAC learning of DFA is possible if the class of distributions is restricted to simple distributions where a teacher might choose examples based on the knowledge of the target concept. This answers an interesting variant of an open research question posed in Pitt's seminal paper: Are DFA's PAC-identifiable if examples are drawn from the uniform distribution, or some other known simple distribution? Our approach uses the RPNI algorithm for learning DFA from labeled examples. In particular, we describe an efficient learning algorithm for exact learning of the target DFA with high probability when a bound on the number of states (N) of the target DFA is known in advance. When N is not known, we show how this algorithm can be used for efficient PAC learning of DFAs.
This research was partially supported by grants from the National Science Foundation (IRI-9409580 and IRI-9643299) and the John Deere Foundation to Vasant Honavar.
Preview
Unable to display preview. Download preview PDF.
References
Angluin, D. (1981). A Note on the Number of Queries Needed to Identify Regular Languages. Information and Control, 51, 76–87.
Angluin, D. (1987). Learning Regular Sets from Queries and Counterexamples. Information and Computation, 75, 87-106.
Chomsky, N. (1956). Three Models for the Description of Language. PGIT, 2(3), 113–124.
Denis, F., D'Halluin, C., & Gilleron, R. (1996). PAC Learning with Simple Examples. STACS'96 — Proceedings of the 13th Annual Symposium on the Theoretical Aspects of Computer Science, 231–242.
Dupont, P. (1996a). Incremental Regular Inference. Pages 222–237 of: Miclet, L., & Higuera, C. (eds), Proceedings of the Third ICGI-96, Lecture Notes in Artificial Intelligence 1147. Montpellier, France: Springer.
Dupont, P. (1996b). Utilisation et Apprentissage de Modees de Language pour la Reconnaissance de la Parole Continue. Ph.D. thesis, Ecole Normale Supérieure des Télécommunications, Paris, France.
Dupont, P., Miclet, L., & Vidal, E. (September 1994). What is the Search Space of the Regular Inference? Pages 25–37 of: Proceedings of the Second International Colloquium on Grammatical Inference (ICGI'94).
Gold, E. M. (1978). Complexity of Automaton Identification from Given Data. Information and Control, 37(3), 302–320.
Hopcroft, J., & Ullman, J. (1979). Introduction to Automata Theory, Languages, and Computation. Addison Wesley.
Kearns, M., & Valiant, L. G. (May 1989). Cryptographic Limitations on Learning Boolean Formulae and Finite Automata. Pages 433–444 of: Proceedings of the 21 st Annual ACM Symposium on Theory of Computing. ACM, New York.
Lang, K. J. (1992). Random DFA's can be approximately learned from sparse uniform sample. Pages 45–52 of., Proceedings of the 5th ACM workshop on Computational Learning Theory.
Li, M., & Vitányi, P. (1991). Learning Simple Concepts under Simple Distributions. SIAM Journal of Computing, 20, 911–935.
Li, M., & Vitányi, P. (1997). An Introduction to Kolmogorov Complexity and its Applications, 2nd edition. New York: Springer Verlag.
Oncina, J., & GarcÃa, P. (1992). Inferring Regular Languages in Polynomial Update Time. Pages 49–61 of:, Pérez, N. et al (ed), Pattern Recognition and Image Analysis. World Scientific.
Pao, T., & Carr, J. (1978). A Solution of the Syntactic Induction-Inference Problem for Regular Languages. Computer Languages, 3, 53–64.
Parekh, R.G., & Honavar, V. G. (August 1993). Efficient Learning of Regular Languages using Teacher Supplied Positive Examples and Learner Generated Queries. Pages 195–208 of: Proceedings of the Fifth UNB Conference on AI.
Pitt, L. (1989). Inductive Inference, DFAs and Computational Complexity. Pages 18–44 of: Analogical and Inductive Inference, Lecture Notes in Artificial Intelligence 397. Springer-Verlag.
Pitt, L., & Warmuth, M. K. (1988). Reductions among prediction problems: on the difficulty of predicting automata. Pages 60–69 of: Proceedings of the 3 rd I.E.E.E. Conference on Structure in Complexity Theory.
Pitt, L., & Warmuth, M. K. (1989). The minimum consistency DFA problem cannot be approximated within any polynomial. Pages 421–432 of Proceedings of the 21 st ACM Symposium on the Theory of Computing. ACM.
Trakhtenbrot, B., & Barzdin, Ya. (1973). Finite Automata: Behavior and Synthesis. Amsterdam: North Holland Publishing Company.
Valiant, L. (1984). A Theory of the Learnable. Communications of the ACM, 27, 1134–1142.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Parekh, R., Honavar, V. (1997). Learning DFA from simple examples. In: Li, M., Maruoka, A. (eds) Algorithmic Learning Theory. ALT 1997. Lecture Notes in Computer Science, vol 1316. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63577-7_39
Download citation
DOI: https://doi.org/10.1007/3-540-63577-7_39
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63577-2
Online ISBN: 978-3-540-69602-5
eBook Packages: Springer Book Archive