Biological Databases

Selzer, Paul M.; Marhöfer, Richard J.; Koch, Oliver

doi:10.1007/978-3-319-68301-0_2

Paul M. Selzer⁴,
Richard J. Marhöfer⁵ &
Oliver Koch⁶

124k Accesses
1 Citations

Abstract

The most important basis for applied bioinformatics is the collection of sequence data and its associated biological information. For example, with genome sequencing projects such data are generated daily in very large quantities worldwide. In order to use these data appropriately, a structured filing system of the data is necessary, yet the data should also be accessible to those interested. Annually, the journal Nucleic Acids Research [nar] dedicates an entire issue (first issue in January) to all available biological databases that are recorded in tabular form with the respective URLs. Furthermore, for a number of databases, original articles describe their functions. This database issue, which is freely accessible also on the Web, is a good starting point for working with biological databases. Depending on the kind of data included, different categories of biological databases can be distinguished. Primary databases contain primary sequence information (nucleotide or protein) and accompanying annotation information regarding function, bibliographies, cross references to other databases, and so forth. Secondary biological databases, however, summarize the results from analyses of primary protein sequence databases. The aim of these analyses is to derive common features for sequence classes, which in turn can be used for the classification of unknown sequences (annotation). In addition, all other databases that save biological or medical information, for example, literature databases, are frequently classified as secondary databases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Andreeva A, Howorth D, Chothia C, Kulesha E, Murzin AG (2014) SCOP2 prototype: a new approach to protein structure mining. Nucleic Acids Res 42(Databaseissue):D310–D314
Article CAS Google Scholar
Attwood TK, Bradley P, Flower DR, Gaulton A et al (2003) PRINTS and its automatic supplement, prePRINTS. Nucleic Acids Res 31:400–402
Article CAS Google Scholar
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The Protein Data Bank. Nucleic Acids Res 28:235–242
Article CAS Google Scholar
Finn RD, Coggill P, Eberhardt RY, Eddy SR et al (2016) The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res 44:D279–D285
Article CAS Google Scholar
Greene LH, Lewis TE, Addou S, Cuff A et al (2007) The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution. Nucleic Acids Res 35:D291–D297
Article CAS Google Scholar
Kahraman A, Avramov A, Nashev L, Popov D et al (2005) PhenomicDB: a multi-species genotype/phenotype database for comparative phenomics. Bioinformatics 21:418–420
Article CAS Google Scholar
Kim KS, Lilburn TG, Renner MJ, Breznak JA (1998) arfI and arfII, two genes of encoding alpha-L-arabinofuranosidases in Cytophaga xylanolytica. Appl Environ Microbiol 64:1919–1923
CAS PubMed PubMed Central Google Scholar
Mulder NJ, Apweiler R, Attwood TK, Bairoch A et al (2007) New developments in the InterPro database. Nucleic Acids Res 35:D224–D228
Article CAS Google Scholar
Murzin AG, Brenner SE, Hubbard T, Chothia C (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 247:536–540
CAS PubMed Google Scholar
NCBI Resource Coordinators (2016) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 45:D12–D17
Article Google Scholar
Sigrist CJA, de Castro E, Cerutti L, Cuche BA, Bougueleret L, Xenarios I (2012) New and continuing developments at PROSITE. Nucleic Acids Res 41:D344–D347
Article Google Scholar
The UniProt Consortium (2016) UniProt: the universal protein knowledgebase. Nucleic Acids Res 45:D158–D169
Article Google Scholar

Author information

Authors and Affiliations

Boehringer Ingelheim, Animal Health, Ingelheim am Rhein, Germany
Paul M. Selzer
MSD Animal Health, Innovation GmbH, Schwabenheim, Germany
Richard J. Marhöfer
TU Dortmund University, Faculty of Chemistry and Chemical Biology, Dortmund, Germany
Oliver Koch

Authors

Paul M. Selzer
View author publications
You can also search for this author in PubMed Google Scholar
Richard J. Marhöfer
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Koch
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Selzer, P.M., Marhöfer, R.J., Koch, O. (2018). Biological Databases. In: Applied Bioinformatics. Springer, Cham. https://doi.org/10.1007/978-3-319-68301-0_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-68301-0_2
Published: 03 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68299-0
Online ISBN: 978-3-319-68301-0
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics

Biological Databases

Abstract

Access this chapter

References

Further Reading

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Biological Databases

Abstract

Access this chapter

References

Further Reading

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation