Server-Side Query Language for Protein Structure Similarity Searching

  • B. Małysiak-Mrozek
  • S. Kozielski
  • D. Mrozek
Part of the Advances in Intelligent and Soft Computing book series (AINSC, volume 99)


Protein structure similarity searching is a complex process, which is usually carried out through comparison of the given protein structure to a set of protein structures from a database. Since existing database management systems do not offer integrated exploration methods for querying protein structures, the structural similarity searching is usually performed by external tools. This often lengthens the processing time and requires additional processing steps, like adaptation of input and output data formats. In the paper, we present our extension to the SQL language, which allows to formulate queries against a database in order to find proteins having secondary structures similar to the structural pattern specified by a user. Presented query language is integrated with the relational database management system and it simplifies the manipulation of biological data.


Secondary Structure Secondary Structure Element Query Pattern Relational Database Management System Protein Structure Similarity 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [Allen 2008]
    Allen, J.P.: Biophysical chemistry. Wiley-Blackwell, London (2008)Google Scholar
  2. [Apweiler et al. 2004]
    Apweiler, R., Bairoch, A., Wu, C.H., et al.: Uniprot: the Universal Protein knowledgebase. Nucleic Acids Res. (Database issue) D115–119 (2004)Google Scholar
  3. [Berman et al. 2000]
    Berman, H.M., Westbrook, J., Feng, Z., et al.: The protein data bank. Nucleic Acids Res. 28, 235–242 (2000)CrossRefGoogle Scholar
  4. [Can and Wang 2003]
    Can, T., Wang, Y.F.: CTSS: a robust and efficient method for protein structure alignment based on local geometrical and biological features. In: Proc. 2003 IEEE Bioinf. Conf., Stanford, CA, pp. 169–179 (2003)Google Scholar
  5. [Frishman and Argos 1996]
    Frishman, D., Argos, P.: Incorporation of non-local interactions in protein secondary structure prediction from the amino acid sequence. Protein Eng. 9(2), 133–142 (1996)CrossRefGoogle Scholar
  6. [Murzin et al. 1995]
    Murzin, A.G., Brenner, S.E., Hubbard, T., et al.: SCOP: A structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536–540 (1995)Google Scholar
  7. [Orengo et al. 1997]
    Orengo, C.A., Michie, A.D., Jones, S., et al.: CATH – A hierarchic classification of protein domain structures. Structure 5(8), 1093–1108 (1997)CrossRefGoogle Scholar
  8. [Shapiro et al. 2004]
    Shapiro, J., Brutlag, D.: FoldMiner and LOCK 2: protein structure comparison and motif discovery on the web. Nucleic Acids Res. 32, 536–541 (2004)CrossRefGoogle Scholar
  9. [Smith and Waterman 1981]
    Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. J. Mol. Biol. 147, 195–197 (1981)CrossRefGoogle Scholar
  10. [Yang 2008]
    Yang, J.: Comprehensive description of protein structures using protein folding shape code. Proteins 71(3), 1497–1518 (2008)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • B. Małysiak-Mrozek
    • 1
  • S. Kozielski
    • 1
  • D. Mrozek
    • 1
  1. 1.Institute of InformaticsSilesian University of TechnologyGliwicePoland

Personalised recommendations