Query Languages and Evaluation Techniques for Biological Sequence Data

Tata, Sandeep; Patel, Jignesh M.

doi:10.1007/978-1-4899-7993-3_630-2

Sandeep Tata³ &
Jignesh M. Patel⁴

62 Accesses

Synonyms

Querying DNA sequences; Querying protein sequences

Definition

A common type of data that is used in life science applications is biological sequence data. Data such as DNA sequence and protein sequence data are growing at a very fast rate. For example, the data at GenBank[GB07] has been growing exponentially, doubling roughly every 18 months. These sequence datasets are often queried in complex ways and the methods required to query these sequences go far beyond the simple string matching methods that have been used in more traditional string applications. In order to enable users to easily pose sophisticated queries on these biological sequences, different languages have been designed to support a rich library of functions. In addition, some database systems have been extended to support a rich set of operators on the sequence data type. Compared to the stand-alone approach, the database method brings the power of algebraic query optimization and the use of indexes making it...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Author information

Authors and Affiliations

IBM Almaden Research Center, San Jose, CA, USA
Sandeep Tata
University of Wisconsin-Madison, Madison, WI, USA
Jignesh M. Patel

Authors

Sandeep Tata
View author publications
You can also search for this author in PubMed Google Scholar
Jignesh M. Patel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sandeep Tata .

Editor information

Editors and Affiliations

Georgia Institute of Technology College of Computing, Atlanta, Georgia, USA
Ling Liu
University of Waterloo School of Computer Science, Waterloo, Ontario, Canada
M. Tamer Özsu

Section Editor information

Robert H. Smith School of Business, University of Maryland, Van Munching Hall (Mowatt Lane), 20742, College Park, MD, USA
Louiqa Raschid

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Tata, S., Patel, J.M. (2016). Query Languages and Evaluation Techniques for Biological Sequence Data. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_630-2

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7993-3_630-2
Received: 09 January 2015
Accepted: 21 October 2016
Published: 11 January 2017
Publisher Name: Springer, New York, NY
Online ISBN: 978-1-4899-7993-3
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics

Query Languages and Evaluation Techniques for Biological Sequence Data

Synonyms

Definition

Access this chapter

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Navigation

Query Languages and Evaluation Techniques for Biological Sequence Data

Synonyms

Definition

Access this chapter

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Search

Navigation