Skip to main content

How to Make SQL Stand for String Query Language

  • Conference paper
  • First Online:
Research Issues in Structured and Semistructured Database Programming (DBPL 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1949))

Included in the following conference series:

Abstract

A string database is simply a collection of tables, the columns of which contain strings over some given alphabet.We address in this paper the issue of designing a simple, user friendly query language for string databases. We focus on the language FO(●), which is classical first order logic extended with a concatenation operator, and where quantifiers range over the set of all strings.We wish to capture all string queries, i.e., well-typed and computable mappings involving a notion of string genericity. Unfortunately, unrestricted quantification may allow some queries to have infinite output. This leads us to study the “safety” problem for FO(●), that is, how to build syntactic and/or semantic restrictions so as to obtain a language expressing only queries with finite output, hopefully all string queries. We introduce a family of such restrictions and study their expressivness and complexity. We prove that none of these languages express all string queries. We prove that a family of these languages is equivalent to a simple, tractable language that we call SriQueL, standing for String Query Language, which thus emerges a robust and natural language suitable for string querying.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Abiteboul, R. Hull and V. Vianu. Foundations of Databases. Addison-Wesley, 1995. 62, 64, 65, 67, 69, 74, 78

    Google Scholar 

  2. N. Balkir, E. Sukan, G. Ozsoyoglu and Z. Oszoyoglu. VISUAL: A graphical iconbased query language. In IEEE International Conferecne on Data Engineering, 1996. 75

    Google Scholar 

  3. M. Benedikt, L. Libkin. Languages for Relational Databases over Interpreted Structures. In Proc. ACM Symp. on Principles of Database Systems, 1997. 68, 74

    Google Scholar 

  4. M. Benedikt, L. Libkin. Safe Constraint Queries. In Proc. ACM Symp. on Principles of Database Systems, 1998.

    Google Scholar 

  5. E. F. Codd. A Relational Model of Data for Large Shared Data Banks. Communications of ACM, 13:6, 1970 61

    Article  Google Scholar 

  6. J. Collado-Vides. The search for a grammatical theory of gene regulation is formally justified by showing the inadequacy of context-free grammars. Computer applications in the Biosciences, 7(3):321–326, 1991. 61

    Google Scholar 

  7. C. Date. An introduction to database systems. Addison-Wesley, 1994. 75

    Google Scholar 

  8. V. Diekert, Makanin’s algorithm. Available at http://www.lri.fr/~rtaloop/92.html 66, 76

  9. T. Eiter, G. Gottlob, Y. Gurevich. Existential Second-Order Logic over Strings. In Proc. Symp. on Logic in Computer Science, 1998. 76

    Google Scholar 

  10. T. Etzold and P. Argos. SRS-an indexing and retrieval tool for flat file data libraries. Computer applications in the Biosciences, 9(1):49–57, 1993. 74

    Google Scholar 

  11. T. Etzold and P. Argos. Transforming a set of biological flat file libraries to a fast access network. Computer applications in the Biosciences, 9(1):59–64, 1993 74

    Google Scholar 

  12. European Molecular Biologiy Network (EMBNET). Strategies in Bioinformatics: a european perspective, 1994. 75

    Google Scholar 

  13. Z. Galil, J. I. Seiferas. Time-Space-Optimal String Matching. Journal of Computer and System Sciences 26(3): 280–294, 1983. 75

    Article  MathSciNet  Google Scholar 

  14. S. Ginsburg, X. S. Wang. Pattern Matching by Rs-Operations: Toward a Unified Approach to Querying Sequenced Data. In Proc. ACM Symp. on Principles of Database Systems, 1992. 62, 75, 76

    Google Scholar 

  15. S. Ginsburg and W. Wang. Regular Sequence Operations and Their Use in Database Queries. Journal of Computer and System Science 56(1): 1–26 (1998) 67, 75

    Article  MATH  MathSciNet  Google Scholar 

  16. G. Grahne, M. Nykänen, E. Ukkonen. Reasoning about Strings in Databases. In Proc. ACM Symp. on Principles of Database Systems, 1994. 62, 75, 76

    Google Scholar 

  17. G. Grahne, M. Nykänen. Safety, translation and evaluation of Alignment calculus. First East-European Symposium on Advances in Database Information Systems (ABDIS’97), St-Petersburg, Russia, 1997. 75

    Google Scholar 

  18. G. Grahne, R. Hakli, M. Nykanen and E. Ukkonen. AQL: An alignment based language for querying string databases. Proc. of the 9TH International Conference on Management of Data (Comad’ 98), Hyderabad, India, December 1998. 75

    Google Scholar 

  19. R. Hull, J. Su. Domain independence and the relational calculus. Acta informatica, 31:6, 1994. 77

    Article  MathSciNet  Google Scholar 

  20. J. W. Lloyd Foundations of logic programming. Springer, 1984. 75

    Google Scholar 

  21. J. Barwise (Ed). Handbool of mathematical logic. North-Holland, 1991.

    Google Scholar 

  22. G. S. Makanin. The problem of solvability of equations in a free semi-group. english translation in Math. USSR Sbornik, 32, 129–198, 1977. 66, 76

    Article  MATH  Google Scholar 

  23. G. Mecca, A. J. Bonner. Sequences, Datalog and Transducers. In Proc. ACM Symp. on Principles of Database Systems, 1995. 62, 67, 75, 76

    Google Scholar 

  24. C. H. Papadimitriou. Computational complexity. Addison-Wesley, Chap. 17, 1994.

    Google Scholar 

  25. W. V. Quine. Concatenation as a basis for arithmetic. Journal of Symbolic Logic, 11(4):105–114, 1946 62, 76

    Article  MATH  MathSciNet  Google Scholar 

  26. J. Richardson. Supporting lists in a data model (a timely approach). Proc. of Intl. Conf. on Very Large Data Bases, 1992. 75

    Google Scholar 

  27. G. Rozenberg, A. Salomaa (Eds). Handbook of formal languages. Springer, 1997. 76

    Google Scholar 

  28. A. Salomaa. Formal languages, Chap. 3, Academic Press, 1973. 76

    Google Scholar 

  29. D. Searls. String variable grammar: a logic grammar formalism for the biological language of DNA. Journal of Logic Programming, pp. 73–102, 1995. 61

    Google Scholar 

  30. P. Seshadri, M. Livny and R. Ramakrishnan. SEQ: a model for sequence databases. IEEE International Conference on Data Engineering, pp. 232–239, 1995. 75

    Google Scholar 

  31. P. Seshadri, M. Livny and R. Ramakrishnan. The design and implementation of a sequence database system. Proc. of Intl. Conf. on Very Large Data Bases, 1996. 75

    Google Scholar 

  32. L. J. Stockmeyer. The polynomial-time hierarchy. Theoretical Computer Science, 3:1–22, 1977 62, 76

    Article  MATH  MathSciNet  Google Scholar 

  33. B. Subramanian, T. Leung, S. Vandenberg and S. Zdonik. The AQUA appraoch to querying lists and trees in objsct-oriented databases. IEEE International Conference on Data Engineering, pp. 80–89, 1995. 62

    Google Scholar 

  34. R. Treinen. A new method for undecidability proofs of first order theories. J. Symbolic Computation, 14:437–457, 1992. 76

    Article  MATH  MathSciNet  Google Scholar 

  35. J. Ullman. Principles of database and knowledge-base systems, Volume 1. Computer Science Press, 1988. 62

    Google Scholar 

  36. P. Wolper. Temporal logic can be more expressive. Information and Control, 56:72–99, 1983. 75

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Grahne, G., Waller, E. (2000). How to Make SQL Stand for String Query Language. In: Connor, R., Mendelzon, A. (eds) Research Issues in Structured and Semistructured Database Programming. DBPL 1999. Lecture Notes in Computer Science, vol 1949. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44543-9_5

Download citation

  • DOI: https://doi.org/10.1007/3-540-44543-9_5

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41481-0

  • Online ISBN: 978-3-540-44543-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics