Gene Annotation and Pathway Mapping in KEGG

  • Kiyoko F Aoki-Kinoshita
  • Minoru Kanehisa
Part of the Methods In Molecular Biology™ book series (MIMB, volume 396)


KEGG is a database resource ( that provides all knowledge about genomes and their relationships to biological systems such as cells and whole organisms as well as their interactions with the environment. KEGG is categorized in terms of building blocks in the genomic space, known as KEGG GENES, the chemical space, KEGG LIGAND, as well as wiring diagrams of interaction and reaction networks, known as KEGG PATHWAY. A fourth database called KEGG BRITE was also recently incorporated to provide computerized annotations and pathway reconstruction based on the current KEGG knowledgebase. KEGG BRITE contains KEGG Orthology (KO), a classification of ortholog and paralog groups based on highly confident sequence similarity scores, and the reaction classification system for biochemical reaction classification, along with other classifications for compounds and drugs. BRITE is also the basis for the KEGG Automatic Annotation Server (KAAS), which automatically annotates a given set of genes and correspondingly generates pathway maps. This chapter introduces KEGG and its various tools for genomic analyses, focusing on the usage of the KEGG GENES, PATHWAY, and BRITE resources and the KAAS tool (see Note 1)

Key Words

KEGG KO KEGG Orthology gene annotation pathway mapping KAAS 


  1. 1.
    Kanehisa, M., Goto, S., Hattori, M., et al. (2006) From genomics to chemical genomics: new developments in KEGG. Nucleic AcIDs Res. 34, D354–D357.CrossRefPubMedGoogle Scholar
  2. 2.
    Kanehisa, M., Goto, S., Kawashima, S., Okuno, Y., and Hattori, M. (2004) The KEGG resource for deciphering the genome. Nucleic AcIDs Res. 32, D277–D280.CrossRefPubMedGoogle Scholar
  3. 3.
    Pruitt, K. D., Tatusova, T., and Maglott, D. R. (2005) NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic AcIDs Res. 33, D501–D504.CrossRefPubMedGoogle Scholar
  4. 4.
    Benson, D. A., Karsch-Mizrachi, I., Lipman, D. J., Ostell, J., and Wheeler, D. L. (2006) GenBank. Nucleic AcIDs Res. 34, D16–D20.CrossRefPubMedGoogle Scholar
  5. 5.
    Smith, T. F. and Waterman, M. S. (1981) IDentification of common molecular subsequences. J. Mol. Biol. 147, 195–197.CrossRefPubMedGoogle Scholar
  6. 6.
    Kanehisa, M., Goto, S., Kawashima, S., and Nakaya, A. (2002) The KEGG databases at GenomeNet. Nucleic AcIDs Res. 30, 42–46.CrossRefPubMedGoogle Scholar
  7. 7.
    Moriya, Y., Itoh, M., Okuda, S., Yoshizawa, A., and Kanehisa, M. (2007) KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic AcIDs Res. in press.Google Scholar
  8. 8.
    Tatusov, R. L., Fedorova, N. D., Jackson, J. D., et al. (2003) The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4, 41.CrossRefPubMedGoogle Scholar
  9. 9.
    (2006) The Gene Ontology (GO) project in 2006. Nucleic AcIDs Res. 34, D322–D326.Google Scholar
  10. 10.
    Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990) Basic local alignment search tool. J. Mol. Biol. 215, 403–410.PubMedGoogle Scholar
  11. 11.
    Altschul, S. F., Madden, T. L., Schaffer, A. A., et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic AcIDs Res. 25, 3389–3402.CrossRefPubMedGoogle Scholar
  12. 12.
    Aoki, K. F. and Kanehisa, M. (2005) in Current Protocols in Bioinformatics, (Andreas D. and Baxevanis, L. D. S., ed.), John Wiley and Sons, Ltd.Google Scholar

Copyright information

© Humana Press Inc. 2007

Authors and Affiliations

  • Kiyoko F Aoki-Kinoshita
    • 1
  • Minoru Kanehisa
    • 2
  1. 1.Faculty of EngineeringDepartment of Bioinformatics Soka UniversityUSA
  2. 2.Kyoto University Bioinformatics Center, Human Genome CenterInstitute of Medical Science, University of TokyoJapan

Personalised recommendations