Abstract
KEGG is an integrated database resource for linking sequences to biological functions from molecular to higher levels. Knowledge on molecular functions is stored in the KO (KEGG Orthology) database, while cellular- and organism-level functions are represented in the PATHWAY and MODULE databases. Genes in the complete genomes, which are stored in the GENES database, are given KO identifiers by the internal annotation procedure, enabling reconstruction of KEGG pathways and modules for interpretation of higher-level functions. This is possible because all the KEGG pathways and modules are represented as networks of KO nodes. Here we present knowledge-based prediction methods for functional characterization of amino acid sequences using the KEGG resource. Specifically we show how the tools available at the KEGG website including BlastKOALA and KEGG Mapper can be utilized for enzyme annotation and metabolic reconstruction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M (2016) KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res 44:D457–D462
McDonald AG, Tipton KF (2014) Fifty-five years of enzyme classification: advances and difficulties. FEBS J 281:583–592
Ogata H, Goto S, Fujibuchi W, Kanehisa M (1998) Computation with the KEGG pathway database. Biosystems 47:119–128
Kanehisa M, Sato Y, Morishima K (2016) BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J Mol Biol 428:726–731
Li W, Godzik A (2006) Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22:1658–1659
Kanehisa M, Goto S, Sato Y, Kawashima M, Furumichi M, Tanabe M (2014) Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res 42:D199–D205
Kanehisa M (2013) Chemical and genomic evolution of enzyme-catalyzed reaction networks. FEBS Lett 587:2731–2737
Pearson WR (1991) Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms. Genomics 11:635–650
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Suzuki S, Kakuta M, Ishida T, Akiyama Y (2014) GHOSTX: an improved sequence homology search algorithm using a query suffix array and a database suffix array. PLoS One 9:e103833
Wang JL, Ma KD, Wang YW, Wang HM, Li YB, Zhou S, Chen XR, Kong DL, Guo X, He MX, Ruan ZY (2016) Lentibacillus amyloliquefaciens sp. nov., a halophilic bacterium isolated from saline sediment sample. Antonie Van Leeuwenhoek 109:171–178
Takami H, Takaki Y, Uchiyama I (2002) Genome sequence of Oceanobacillus iheyensis isolated from the Iheya Ridge and its unexpected adaptive capabilities to extreme environments. Nucleic Acids Res 30:3927–3935
Acknowledgment
This work was partially supported by the National Bioscience Database Center of the Japan Science and Technology Agency. The computational resource for developing and servicing KEGG is provided by the Bioinformatics Center, Institute for Chemical Research, Kyoto University.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media LLC
About this protocol
Cite this protocol
Kanehisa, M. (2017). Enzyme Annotation and Metabolic Reconstruction Using KEGG. In: Kihara, D. (eds) Protein Function Prediction. Methods in Molecular Biology, vol 1611. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7015-5_11
Download citation
DOI: https://doi.org/10.1007/978-1-4939-7015-5_11
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7013-1
Online ISBN: 978-1-4939-7015-5
eBook Packages: Springer Protocols