Skip to main content

The Teaching Complexity of Erasing Pattern Languages with Bounded Variable Frequency

  • Conference paper
  • First Online:
Developments in Language Theory (DLT 2019)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11647))

Included in the following conference series:

  • 320 Accesses

Abstract

Patterns provide a concise, syntactic way of describing a set of strings, but their expressive power comes at a price: a number of fundamental decision problems concerning (erasing) pattern languages, such as the membership problem and inclusion problem, are known to be NP-complete or even undecidable, while the decidability of the equivalence problem is still open; in learning theory, the class of pattern languages is unlearnable in models such as the distribution-free (PAC) framework (if \(\mathcal {P}/poly \ne \mathcal {NP}/poly\)). Much work on the algorithmic learning of pattern languages has thus focussed on interesting subclasses of patterns for which positive learnability results may be achieved. A natural restriction on a pattern is a bound on its variable frequency – the maximum number m such that some variable occurs exactly m times in the pattern. This paper examines the effect of limiting the variable frequency of all patterns belonging to a class \(\varPi \) on the worst-case minimum number of labelled examples needed to uniquely identify any pattern of \(\varPi \) in cooperative teaching-learning models. Two such models, the teaching dimension model as well as the preference-based teaching model, will be considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Roughly speaking, a class of languages is learnable in the limit if there is a learning algorithm such that, given any infinite sequence of all positive examples for any language L in the class, the algorithm outputs a corresponding sequence of guesses for the target language (based on a representation system for the languages in the class) that converges to a fixed representation for L; this model is due to Gold [14].

  2. 2.

    This implies that for every pattern \(\pi \) belonging to any one of these classes, \(L(\pi )\) contains a finite set that distinguishes \(\pi \) from all \(\pi '\) in the class such that \(L(\pi ') \subset L(\pi )\) [4, Theorem 1].

References

  1. Aho, A.V.: Algorithms for finding patterns in strings. In: van Leeuwen, J. (ed.) Handbook of Theoretical Computer Science. Algorithms and Complexity, vol. A, chap. 5, pp. 257–300. MIT Press, Oxford (1990)

    Chapter  Google Scholar 

  2. Amir, A., Nor, I.: Generalized function matching. J. Disc. Algorithms 5(3), 514–523 (2007)

    Article  MathSciNet  Google Scholar 

  3. Angluin, D.: Finding patterns common to a set of strings. J. Comput. Syst. Sci. 21, 46–62 (1980)

    Article  MathSciNet  Google Scholar 

  4. Angluin, D.: Inductive inference of formal languages from positive data. Inf. Control 45(2), 117–135 (1980)

    Article  MathSciNet  Google Scholar 

  5. Angluin, D., Aspnes, J., Eisenstat, S., Kontorovich, A.: On the learnability of shuffle ideals. J. Mach. Learn. Res. 14, 1513–1531 (2013)

    MathSciNet  MATH  Google Scholar 

  6. Baker, B.S.: Parameterized pattern matching: algorithms and applications. J. Comput. Syst. Sci. 52(1), 28–42 (1996)

    Article  MathSciNet  Google Scholar 

  7. Bayeh, F., Gao, Z., Zilles, S.: Erasing pattern languages distinguishable by a finite number of strings. In: ALT, pp. 72–108 (2017)

    Google Scholar 

  8. Campeanu, C., Salomaa, K., Yu, S.: A formal study of practical regular expressions. Int. J. Found. Comput. Sci. 14(6), 1007–1018 (2003)

    Article  MathSciNet  Google Scholar 

  9. Day, J.D., Fleischmann, P., Manea, F., Nowotka, D.: Local patterns. In: FSTTCS, pp. 24:1–24:14 (2017)

    Google Scholar 

  10. Day, J.D., Fleischmann, P., Manea, F., Nowotka, D., Schmid, M.L.: On matching generalised repetitive patterns. In: Hoshi, M., Seki, S. (eds.) DLT 2018. LNCS, vol. 11088, pp. 269–281. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98654-8_22

    Chapter  Google Scholar 

  11. Fernau, H., Manea, F., Mercas, R., Schmid, M.L.: Pattern matching with variables: fast algorithms and new hardness results. In: STACS, pp. 302–315 (2015)

    Google Scholar 

  12. Fernau, H., Schmid, M.L.: Pattern matching with variables: a multivariate complexity analysis. Inf. Comput. 242, 287–305 (2015)

    Article  MathSciNet  Google Scholar 

  13. Freydenberger, D.D., Schmid, M.L.: Deterministic regular expressions with back-references. In: STACS, pp. 33:1–33:14 (2017)

    Google Scholar 

  14. Gold, E.M.: Language identification in the limit. Inf. Control 10, 447–474 (1967)

    Article  MathSciNet  Google Scholar 

  15. Gao, Z., Ries, C., Simon, H.U., Zilles, S.: Preference-based teaching. J. Mach. Learn. Res. 18, 1–32 (2017)

    MathSciNet  MATH  Google Scholar 

  16. Goldman, S.A., Kearns, M.J.: On the complexity of teaching. J. Comput. Syst. Sci 50, 20–31 (1995)

    Article  MathSciNet  Google Scholar 

  17. Jain, S., Ong, Y.S., Stephan, F.: Regular patterns, regular languages and context-free languages. Inf. Proc. Lett. 110(24), 1114–1119 (2010)

    Article  MathSciNet  Google Scholar 

  18. Jiang, T., Kinber, E., Salomaa, A., Salomaa, K., Yu, S.: Pattern languages with and without erasing. Int. J. Comput. Math. 50, 147–163 (1994)

    Article  Google Scholar 

  19. Lothaire, M.: Combinatorics on Words, Cambridge Mathematical Library. Cambridge University Press, Cambridge (1997). Corrected reprint of the 1983 original

    Google Scholar 

  20. Lothaire, M.: Algebraic Combinatorics on Words. Encyclopedia of Mathematics and its Applications. Cambridge University Press, Cambridge (2002)

    Book  Google Scholar 

  21. Matsumoto, S., Shinohara, A.: Learning pattern languages using queries. In: Ben-David, S. (ed.) EuroCOLT 1997. LNCS, vol. 1208, pp. 185–197. Springer, Heidelberg (1997). https://doi.org/10.1007/3-540-62685-9_16

    Chapter  Google Scholar 

  22. Mitchell, A.R.: Learnability of a subclass of extended pattern languages. In: COLT, pp. 64–71 (1998)

    Google Scholar 

  23. Ohlebusch, E., Ukkonen, E.: On the equivalence problem for e-pattern languages. Theor. Comput. Sci 186(1–2), 231–248 (1997)

    Article  MathSciNet  Google Scholar 

  24. Reidenbach, D.: A non-learnable class of e-pattern languages. Theor. Comput. Sci 350(1), 91–102 (2006)

    Article  MathSciNet  Google Scholar 

  25. Reidenbach, D.: Discontinuities in pattern inference. Theor. Comput. Sci 397, 166–193 (2008)

    Article  MathSciNet  Google Scholar 

  26. Schmid, M.L.: Characterising REGEX languages by regular languages equipped with factor-referencing. Inf. Comput. 249, 1–17 (2016)

    Article  MathSciNet  Google Scholar 

  27. Shinohara, A., Miyano, S.: Teachability in computational learning. New Gener. Comput. 8(4), 337–347 (1991)

    Article  Google Scholar 

  28. Shinohara, T.: Polynomial time inference of extended regular pattern languages. In: Goto, E., Furukawa, K., Nakajima, R., Nakata, I., Yonezawa, A. (eds.) RIMS Symposia on Software Science and Engineering. LNCS, vol. 147, pp. 115–127. Springer, Heidelberg (1983). https://doi.org/10.1007/3-540-11980-9_19

    Chapter  Google Scholar 

  29. Zhu, X., Singla, A., Zilles, S., Rafferty, A.N.: An overview of machine teaching (2018, manuscript). http://arxiv.org/abs/1801.05927

Download references

Acknowledgements

The author was supported (as RF) by the Singapore Ministry of Education Academic Research Fund grant MOE2016-T2-1-019/R146-000-234-112. I sincerely thank Fahimeh Bayeh, Sanjay Jain and Sandra Zilles for proofreading the manuscript; their numerous suggestions for corrections and improvements (such as studying the PBTD of m-quasi-regular patterns over unary alphabets) are gratefully acknowledged. Many thanks are also due to the anonymous referees of this paper for their very helpful comments and suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ziyuan Gao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gao, Z. (2019). The Teaching Complexity of Erasing Pattern Languages with Bounded Variable Frequency. In: Hofman, P., Skrzypczak, M. (eds) Developments in Language Theory. DLT 2019. Lecture Notes in Computer Science(), vol 11647. Springer, Cham. https://doi.org/10.1007/978-3-030-24886-4_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-24886-4_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-24885-7

  • Online ISBN: 978-3-030-24886-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics