Abstract
Advances in sequence genomics have resulted in an accumulation of a huge number of protein sequences derived from genome sequences. However, the functions of a large portion of them cannot be inferred based on the current methods of sequence homology detection to proteins of known functions. Three-dimensional structure can have an important impact in providing inference of molecular function (physical and chemical function) of a protein of unknown function. Structural genomics centers worldwide have been determining many 3-D structures of the proteins of unknown functions, and possible molecular functions of them have been inferred based on their structures. Combined with bioinformatics and enzymatic assay tools, the successful acceleration of the process of protein structure determination through high throughput pipelines enables the rapid functional annotation of a large fraction of hypothetical proteins. We present a brief summary of the process we used at the Berkeley Structural Genomics Center to infer molecular functions of proteins of unknown function.
Similar content being viewed by others
References
Liolios K, Tavernarakis N, Hugenholtz P, Kyrpides NC (2006) Nucleic Acids Res 34:D332–4
Kim SH, Shin DH, Choi IG, Schulze-Gahmen U, Chen S, Kim R (2003) J Struct Funct Genomics 4:129–135
Chandonia JM, Kim SH (2006) BMC Struct Biol 6:7
Huang L, Hung L, Odell M, Yokota H, Kim R, Kim SH (2002) J Struct Funct Genomics 2:121–127
Chen S, Yakunin AF, Kuznetsova E, Busso D, Pufan R, Proudfoot M, Kim R, Kim SH (2004) J Biol Chem 279:31854–31862
Liu J, Yokota H, Kim R, Kim SH (2004) Proteins 55:1082–1086
Oganesyan V, Pufan R, DeGiovanni A, Yokota H, Kim R, Kim SH (2004) Acta Crystallogr D Biol Crystallogr 60:1266–1271
Schulze-Gahmen U, Aono S, Chen S, Yokota H, Kim R, Kim SH (2005) Acta Crystallogr D Biol Crystallogr 61:1343–1347
Kim JS, Shin DH, Pufan R, Huang C, Yokota H, Kim R, Kim SH (2006) Proteins 62:322–328
Oganesyan V, Busso D, Brandsen J, Chen S, Jancarik J, Kim R, Kim SH (2003) Acta Crystallogr D Biol Crystallogr 59:1219–1223
Liu J, Lou Y, Yokota H, Adams PD, Kim R, Kim SH (2005) J Biol Chem 280:15960–15966
Shin DH, Oganesyan N, Jancarik J, Yokota H, Kim R, Kim SH (2005) J Biol Chem 280:18326–18335
Shin DH, Roberts A, Jancarik J, Yokota H, Kim R, Wemmer DE, Kim SH (2003) Protein Sci 12:1464–1472
Roberts A, Lee SY, McCullagh E, Silversmith RE, Wemmer DE (2005) Proteins 58:790–801
Zarembinski TI, Hung LW, Mueller-Dieckmann HJ, Kim KK, Yokota H, Kim R, Kim SH (1998) Proc Natl Acad Sci USA 95:15189–15193
Schulze-Gahmen U, Pelaschier J, Yokota H, Kim R, Kim SH (2003) Proteins 50:526–530
Shin DH, Lou Y, Jancarik J, Yokota H, Kim R, Kim SH (2004) Proc Natl Acad Sci USA 101:13198–13203
Liu J, Lou Y, Yokota H, Adams PD, Kim R, Kim SH (2005) J Mol Biol 354:289–303
Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL, Studholme DJ, Yeats C, Eddy SR (2004) Nucleic Acids Res 32:D138–141
Tatusov RL, Natale DA, Garkavtsev IV, Tatusova TA, Shankavaram UT, Rao BS, Kiryutin B, Galperin MY, Fedorova ND, Koonin EV (2001) Nucleic Acids Res 29:22–28
Hwang KY, Chung JH, Kim SH, Han YS, Cho Y (1999) Nat Struct Biol 6:691–696
Numata T, Fukai S, Ikeuchi Y, Suzuki T, Nureki O (2006) Structure 14:357–366
Kim KK, Kim R, Kim SH (1998) Nature 394:595–599
Choi IG, Shin DH, Brandsen J, Jancarik J, Busso D, Yokota H, Kim R, Kim SH (2003) J Struct Funct Genomics 4:31–34
Kim JS, DeGiovanni A, Jancarik J, Adams PD, Yokota H, Kim R, Kim SH (2005) Proc Natl Acad Sci USA 102:3248–3253
Liu J, Huang C, Shin DH, Yokota H, Jancarik J, Kim JS, Adams PD, Kim R, Kim SH (2005) J Mol Biol 350:987–996
Hou J, Sims GE, Zhang C, Kim SH (2003) Proc Natl Acad Sci USA 100:2386–2390
Holm L, Sander C (1998) Nucleic Acids Res 26:316–319
Chandonia JM, Brenner SE (2006) Science 311:347–51
Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, Moxon S, Marshall M, Khanna A, Durbin R, Eddy SR, Sonnhammer EL, Bateman A (2006) Nucleic Acids Res 34:D247–51
Chandonia JM, Kim SH, Brenner SE (2005) Proteins 62:356–70
Baker D, Sali A (2001) Science 294:93–6
Chandonia JM, Brenner SE (2005) Proteins 58:166–79
Chandonia JM, Brenner SE (2005) Proceedings of the 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference, Shanghai, China, 751–55
Service R (2005) Science 307:1554–8
Hutchison CA, Peterson SN, Gill SR, Cline RT, White O, Fraser CM, Smith HO, Venter JC (1999) Science 286:2165–9
Acknowledgements
We thank all the component members of BSGC for their efforts towards accomplishing the BSGC objectives. We gratefully acknowledge the supports of the NIH grant GM62412 for most of the structures cited in this article, NIH (R01-GM073109) and the U.S. Department of Energy under contract DE-AC02-05CH11231.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Shin, D.H., Hou, J., Chandonia, JM. et al. Structure-based inference of molecular functions of proteins of unknown function from Berkeley Structural Genomics Center. J Struct Funct Genomics 8, 99–105 (2007). https://doi.org/10.1007/s10969-007-9025-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10969-007-9025-4