Protein Function Prediction pp 45-57 | Cite as
MPFit: Computational Tool for Predicting Moonlighting Proteins
Abstract
An increasing number of proteins have been found which are capable of performing two or more distinct functions. These proteins, known as moonlighting proteins, have drawn much attention recently as they may play critical roles in disease pathways and development. However, because moonlighting proteins are often found serendipitously, our understanding of moonlighting proteins is still quite limited. In order to lay the foundation for systematic moonlighting proteins studies, we developed MPFit, a software package for predicting moonlighting proteins from their omics features including protein–protein and gene interaction networks. Here, we describe and demonstrate the algorithm of MPFit, the idea behind it, and provide instruction for using the software.
Keywords
Moonlighting proteins Protein function prediction Dual function Function annotation Protein association Feature imputation Omics-data GenomeNotes
Acknowledgments
This work was partly supported by the National Institute of General Medical Sciences of the National Institutes of Health (R01GM097528) and the National Science Foundation (IIS1319551, DBI1262189, IOS1127027, DMS1614777).
References
- 1.Campbell RM, Scanes CG (1995) Endocrine peptides ‘moonlighting’ as immune modulators: roles for somatostatin and GH-releasing factor. J Endocrinol 147(3):383–396CrossRefPubMedGoogle Scholar
- 2.Jeffery CJ (1999) Moonlighting proteins. Trends Biochem Sci 24(1):8–11CrossRefPubMedGoogle Scholar
- 3.Weaver DT (1998) Telomeres: moonlighting by DNA repair proteins. Curr Biol 8(14):R492–R494CrossRefPubMedGoogle Scholar
- 4.Jeffery CJ (2011) Proteins with neomorphic moonlighting functions in disease. IUBMB Life 63(7):489–494CrossRefPubMedGoogle Scholar
- 5.Jeffery CJ (2009) Moonlighting proteins—an update. Mol Biosyst 5(4):345–350CrossRefPubMedGoogle Scholar
- 6.Jeffery CJ (2004) Molecular mechanisms for multitasking: recent crystal structures of moonlighting proteins. Curr Opin Struct Biol 14(6):663–668CrossRefPubMedGoogle Scholar
- 7.Jeffery CJ (2003) Moonlighting proteins: old proteins learning new tricks. Trends Genet 19(8):415–417CrossRefPubMedGoogle Scholar
- 8.Ozimek P, Kotter P, Veenhuis M, van der Klei IJ (2006) Hansenula polymorpha and Saccharomyces cerevisiae Pex5p's recognize different, independent peroxisomal targeting signals in alcohol oxidase. FEBS Lett 580(1):46–50CrossRefPubMedGoogle Scholar
- 9.Chen XJ, Wang X, Kaufman BA, Butow RA (2005) Aconitase couples metabolic regulation to mitochondrial DNA maintenance. Science 307(5710):714–717CrossRefPubMedGoogle Scholar
- 10.Banerjee S, Nandyala AK, Raviprasad P, Ahmed N, Hasnain SE (2007) Iron-dependent RNA-binding activity of Mycobacterium tuberculosis aconitase. J Bacteriol 189(11):4046–4052CrossRefPubMedPubMedCentralGoogle Scholar
- 11.Gomez A, Domedel N, Cedano J, Pinol J, Querol E (2003) Do current sequence analysis algorithms disclose multifunctional (moonlighting) proteins? Bioinformatics 19 (7):895–896CrossRefPubMedGoogle Scholar
- 12.Hawkins T, Luban S, Kihara D (2006) Enhanced automated function prediction using distantly related sequences and contextual association by PFP. Protein Sci 15(6):1550–1556CrossRefPubMedPubMedCentralGoogle Scholar
- 13.Khan IK, Wei Q, Chitale M, Kihara D (2015) PFP/ESG: automated protein function prediction servers enhanced with Gene Ontology visualization tool. Bioinformatics 31(2):271–272CrossRefPubMedGoogle Scholar
- 14.Hawkins T, Chitale M, Luban S, Kihara D (2009) PFP: Automated prediction of gene ontology functional annotations with confidence scores using protein sequence data. Proteins 74(3):566–582CrossRefPubMedGoogle Scholar
- 15.Chitale M, Hawkins T, Park C, Kihara D (2009) ESG: extended similarity group method for automated protein function prediction. Bioinformatics 25(14):1739–1745CrossRefPubMedPubMedCentralGoogle Scholar
- 16.Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25(17):3389–3402CrossRefPubMedPubMedCentralGoogle Scholar
- 17.Khan I, Chitale M, Rayon C, Kihara D (2012) Evaluation of function predictions by PFP, ESG,and PSI-BLAST for moonlighting proteins. BMC Proc 6(Suppl 7):S5CrossRefPubMedPubMedCentralGoogle Scholar
- 18.Khan I, Chen Y, Dong T, Hong X, Takeuchi R, Mori H, Kihara D (2014) Genome-scale identification and characterization of moonlighting proteins. Biol Direct 9:30CrossRefPubMedPubMedCentralGoogle Scholar
- 19.Chapple CE, Robisson B, Spinelli L, Guien C, Becker E, Brun C (2015) Extreme multifunctional proteins identified from a human protein interaction network. Nat Commun 6:7412CrossRefPubMedPubMedCentralGoogle Scholar
- 20.Pritykin Y, Ghersi D, Singh M (2015) Genome-wide detection and analysis of multifunctional genes. PLoS Comput Biol 11(10):e1004467CrossRefPubMedPubMedCentralGoogle Scholar
- 21.Hernandez S, Franco L, Calvo A, Ferragut G, Hermoso A, Amela I, Gomez A, Querol E, Cedano J (2015) Bioinformatics and moonlighting proteins. Front Bioeng Biotechnol 3:90CrossRefPubMedPubMedCentralGoogle Scholar
- 22.Hernandez S, Amela I, Cedano J, Pinol J, Perez-Pons J, Mozo-Villarias A, Querol E (2012) Do moonlighting proteins belong to the intrinsically disordered protein class? J Proteomics Bioinform 5:262–264CrossRefGoogle Scholar
- 23.Gomez A, Hernandez S, Amela I, Pinol J, Cedano J, Querol E (2011) Do protein–protein interaction databases identify moonlighting proteins? Mol Biosyst 7(8):2379–2382CrossRefPubMedGoogle Scholar
- 24.Khan IK, Kihara D (2016) Genome-scale prediction of moonlighting proteins using diverse protein association information. Bioinformatics 32(15):2281–2288CrossRefPubMedPubMedCentralGoogle Scholar
- 25.Consortium GO (2015) Gene Ontology Consortium: going forward. Nucleic Acids Res 43(Database issue):D1049–D1056CrossRefGoogle Scholar
- 26.Mani M, Chen C, Amblee V, Liu H, Mathur T, Zwicke G, Zabad S, Patel B, Thakkar J, Jeffery CJ (2015) MoonProt: a database for proteins that are known to moonlight. Nucleic Acids Res 43(Database issue):D277–D282CrossRefPubMedGoogle Scholar
- 27.Breiman L (2001) Random forests. Mach Learn 45(1):5–32CrossRefGoogle Scholar
- 28.UniProt Consortium (2015) UniProt: a hub for protein information. Nucleic Acids Res 43(Database issue):D204–D212CrossRefGoogle Scholar
- 29.Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, Simonovic M, Roth A, Santos A, Tsafou KP, Kuhn M, Bork P, Jensen LJ, von Mering C (2015) STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res 43(Database issue):D447–D452CrossRefPubMedGoogle Scholar
- 30.Okamura Y, Aoki Y, Obayashi T, Tadaka S, Ito S, Narise T, Kinoshita K (2015) COXPRESdb in 2015: coexpression database for animal species by DNA-microarray and RNAseq-based expression data with multiple quality assessment systems. Nucleic Acids Res 43(Database issue):D82–D86CrossRefPubMedGoogle Scholar
- 31.Oughtred R, Chatr-aryamontri A, Breitkreutz BJ, Chang CS, Rust JM, Theesfeld CL, Heinicke S, Breitkreutz A, Chen D, Hirschman J, Kolas N, Livstone MS, Nixon J, O'Donnell L, Ramage L, Winter A, Reguly T, Sellam A, Stark C, Boucher L, Dolinski K, Tyers M (2016) BioGRID: a resource for studying biological interactions in yeast. Cold Spring Harb Protoc (1):pdb top080754Google Scholar
- 32.Oates ME, Romero P, Ishida T, Ghalwash M, Mizianty MJ, Xue B, Dosztanyi Z, Uversky VN, Obradovic Z, Kurgan L, Dunker AK, Gough J (2013) D(2)P(2): database of disordered protein predictions. Nucleic Acids Res 41(Database issue):D508–D516PubMedGoogle Scholar