The Teaching Complexity of Erasing Pattern Languages with Bounded Variable Frequency

Gao, Ziyuan

doi:10.1007/978-3-030-24886-4_11

Ziyuan Gao¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11647))

Included in the following conference series:

International Conference on Developments in Language Theory

320 Accesses

Abstract

Patterns provide a concise, syntactic way of describing a set of strings, but their expressive power comes at a price: a number of fundamental decision problems concerning (erasing) pattern languages, such as the membership problem and inclusion problem, are known to be NP-complete or even undecidable, while the decidability of the equivalence problem is still open; in learning theory, the class of pattern languages is unlearnable in models such as the distribution-free (PAC) framework (if \(\mathcal {P}/poly \ne \mathcal {NP}/poly\)). Much work on the algorithmic learning of pattern languages has thus focussed on interesting subclasses of patterns for which positive learnability results may be achieved. A natural restriction on a pattern is a bound on its variable frequency – the maximum number m such that some variable occurs exactly m times in the pattern. This paper examines the effect of limiting the variable frequency of all patterns belonging to a class \(\varPi \) on the worst-case minimum number of labelled examples needed to uniquely identify any pattern of \(\varPi \) in cooperative teaching-learning models. Two such models, the teaching dimension model as well as the preference-based teaching model, will be considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Roughly speaking, a class of languages is learnable in the limit if there is a learning algorithm such that, given any infinite sequence of all positive examples for any language L in the class, the algorithm outputs a corresponding sequence of guesses for the target language (based on a representation system for the languages in the class) that converges to a fixed representation for L; this model is due to Gold [14].
2.
This implies that for every pattern \(\pi \) belonging to any one of these classes, \(L(\pi )\) contains a finite set that distinguishes \(\pi \) from all \(\pi '\) in the class such that \(L(\pi ') \subset L(\pi )\) [4, Theorem 1].

References

Aho, A.V.: Algorithms for finding patterns in strings. In: van Leeuwen, J. (ed.) Handbook of Theoretical Computer Science. Algorithms and Complexity, vol. A, chap. 5, pp. 257–300. MIT Press, Oxford (1990)
Chapter Google Scholar
Amir, A., Nor, I.: Generalized function matching. J. Disc. Algorithms 5(3), 514–523 (2007)
Article MathSciNet Google Scholar
Angluin, D.: Finding patterns common to a set of strings. J. Comput. Syst. Sci. 21, 46–62 (1980)
Article MathSciNet Google Scholar
Angluin, D.: Inductive inference of formal languages from positive data. Inf. Control 45(2), 117–135 (1980)
Article MathSciNet Google Scholar
Angluin, D., Aspnes, J., Eisenstat, S., Kontorovich, A.: On the learnability of shuffle ideals. J. Mach. Learn. Res. 14, 1513–1531 (2013)
MathSciNet MATH Google Scholar
Baker, B.S.: Parameterized pattern matching: algorithms and applications. J. Comput. Syst. Sci. 52(1), 28–42 (1996)
Article MathSciNet Google Scholar
Bayeh, F., Gao, Z., Zilles, S.: Erasing pattern languages distinguishable by a finite number of strings. In: ALT, pp. 72–108 (2017)
Google Scholar
Campeanu, C., Salomaa, K., Yu, S.: A formal study of practical regular expressions. Int. J. Found. Comput. Sci. 14(6), 1007–1018 (2003)
Article MathSciNet Google Scholar
Day, J.D., Fleischmann, P., Manea, F., Nowotka, D.: Local patterns. In: FSTTCS, pp. 24:1–24:14 (2017)
Google Scholar
Day, J.D., Fleischmann, P., Manea, F., Nowotka, D., Schmid, M.L.: On matching generalised repetitive patterns. In: Hoshi, M., Seki, S. (eds.) DLT 2018. LNCS, vol. 11088, pp. 269–281. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98654-8_22
Chapter Google Scholar
Fernau, H., Manea, F., Mercas, R., Schmid, M.L.: Pattern matching with variables: fast algorithms and new hardness results. In: STACS, pp. 302–315 (2015)
Google Scholar
Fernau, H., Schmid, M.L.: Pattern matching with variables: a multivariate complexity analysis. Inf. Comput. 242, 287–305 (2015)
Article MathSciNet Google Scholar
Freydenberger, D.D., Schmid, M.L.: Deterministic regular expressions with back-references. In: STACS, pp. 33:1–33:14 (2017)
Google Scholar
Gold, E.M.: Language identification in the limit. Inf. Control 10, 447–474 (1967)
Article MathSciNet Google Scholar
Gao, Z., Ries, C., Simon, H.U., Zilles, S.: Preference-based teaching. J. Mach. Learn. Res. 18, 1–32 (2017)
MathSciNet MATH Google Scholar
Goldman, S.A., Kearns, M.J.: On the complexity of teaching. J. Comput. Syst. Sci 50, 20–31 (1995)
Article MathSciNet Google Scholar
Jain, S., Ong, Y.S., Stephan, F.: Regular patterns, regular languages and context-free languages. Inf. Proc. Lett. 110(24), 1114–1119 (2010)
Article MathSciNet Google Scholar
Jiang, T., Kinber, E., Salomaa, A., Salomaa, K., Yu, S.: Pattern languages with and without erasing. Int. J. Comput. Math. 50, 147–163 (1994)
Article Google Scholar
Lothaire, M.: Combinatorics on Words, Cambridge Mathematical Library. Cambridge University Press, Cambridge (1997). Corrected reprint of the 1983 original
Google Scholar
Lothaire, M.: Algebraic Combinatorics on Words. Encyclopedia of Mathematics and its Applications. Cambridge University Press, Cambridge (2002)
Book Google Scholar
Matsumoto, S., Shinohara, A.: Learning pattern languages using queries. In: Ben-David, S. (ed.) EuroCOLT 1997. LNCS, vol. 1208, pp. 185–197. Springer, Heidelberg (1997). https://doi.org/10.1007/3-540-62685-9_16
Chapter Google Scholar
Mitchell, A.R.: Learnability of a subclass of extended pattern languages. In: COLT, pp. 64–71 (1998)
Google Scholar
Ohlebusch, E., Ukkonen, E.: On the equivalence problem for e-pattern languages. Theor. Comput. Sci 186(1–2), 231–248 (1997)
Article MathSciNet Google Scholar
Reidenbach, D.: A non-learnable class of e-pattern languages. Theor. Comput. Sci 350(1), 91–102 (2006)
Article MathSciNet Google Scholar
Reidenbach, D.: Discontinuities in pattern inference. Theor. Comput. Sci 397, 166–193 (2008)
Article MathSciNet Google Scholar
Schmid, M.L.: Characterising REGEX languages by regular languages equipped with factor-referencing. Inf. Comput. 249, 1–17 (2016)
Article MathSciNet Google Scholar
Shinohara, A., Miyano, S.: Teachability in computational learning. New Gener. Comput. 8(4), 337–347 (1991)
Article Google Scholar
Shinohara, T.: Polynomial time inference of extended regular pattern languages. In: Goto, E., Furukawa, K., Nakajima, R., Nakata, I., Yonezawa, A. (eds.) RIMS Symposia on Software Science and Engineering. LNCS, vol. 147, pp. 115–127. Springer, Heidelberg (1983). https://doi.org/10.1007/3-540-11980-9_19
Chapter Google Scholar
Zhu, X., Singla, A., Zilles, S., Rafferty, A.N.: An overview of machine teaching (2018, manuscript). http://arxiv.org/abs/1801.05927

Download references

Acknowledgements

The author was supported (as RF) by the Singapore Ministry of Education Academic Research Fund grant MOE2016-T2-1-019/R146-000-234-112. I sincerely thank Fahimeh Bayeh, Sanjay Jain and Sandra Zilles for proofreading the manuscript; their numerous suggestions for corrections and improvements (such as studying the PBTD of m-quasi-regular patterns over unary alphabets) are gratefully acknowledged. Many thanks are also due to the anonymous referees of this paper for their very helpful comments and suggestions.

Author information

Authors and Affiliations

Department of Mathematics, National University of Singapore, 10 Lower Kent Ridge Road, Singapore, 119076, Republic of Singapore
Ziyuan Gao

Authors

Ziyuan Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ziyuan Gao .

Editor information

Editors and Affiliations

University of Warsaw, Warsaw, Poland
Piotrek Hofman
University of Warsaw, Warsaw, Poland
Michał Skrzypczak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gao, Z. (2019). The Teaching Complexity of Erasing Pattern Languages with Bounded Variable Frequency. In: Hofman, P., Skrzypczak, M. (eds) Developments in Language Theory. DLT 2019. Lecture Notes in Computer Science(), vol 11647. Springer, Cham. https://doi.org/10.1007/978-3-030-24886-4_11

Download citation

DOI: https://doi.org/10.1007/978-3-030-24886-4_11
Published: 10 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-24885-7
Online ISBN: 978-3-030-24886-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics