Abstract
Molecular mimicry of host proteins by pathogens constitutes a strategy to hijack the host pathways. At present, there is no dedicated resource for mimicked domains and motifs in the host–pathogen interactome. In this work, the experimental host–pathogen (HP) and host–host (HH) protein–protein interactions (PPIs) were collated. The domains and motifs of these proteins were annotated using CD Search and ScanProsite, respectively. Host and pathogen proteins with a shared host interactor and similar domain/motif constitute a mimicry pair exhibiting global structural similarity (domain mimicry pair; DMP) or local sequence motif similarity (motif mimicry pair; MMP). Mimicry pairs are likely to be co-expressed and co-localized. 1,97,607 DMPs and 32,67,568 MMPs were identified in 49,265 experimental HP-PPIs and organized in a web-based resource, ImitateDB (http://imitatedb.sblab-nsit.net) that can be easily queried. The results are externally integrated using hyperlinked domain PSSM ID, motif ID, protein ID and PubMed ID. Kinase, UL36, Smc and DEXDc were frequent DMP domains whereas protein kinase C phosphorylation, casein kinase 2 phosphorylation, glycosylation and myristoylation sites were frequent MMP motifs. Novel DMP domains SANT, Tudor, PhoX and MMP motif microbody C-terminal targeting signal, cornichon signature and lipocalin signature were proposed. ImitateDB is a novel resource for identifying mimicry in interacting host and pathogen proteins.




Similar content being viewed by others
Data availability
The authors confirm that the data supporting the findings of this study are available at http://imitatedb.sblab-nsit.net. The python script for the identification of DMP/MMP is available at the following link: https://github.com/sblab/ImitateDB.git. At present, Imitate DB contains information from HP-PPIs up to April 2021. We plan to update this database regularly as new releases of the contributing databases become available. Apart from collating all the HP-PPIs from different databases, we are working towards mining the HP-PPIs from literature for computation of additional DMPs and MMPs. We plan to compute DMPs and MMPs for predicted HP-PPIs. Work is also ongoing for providing information on DMPs/MMPs for Sars-CoV-2.
Abbreviations
- HP-PPI:
-
Host–pathogen protein–protein interaction
- HHI:
-
Host–host interaction
- DMP:
-
Domain mimicry pair
- MMP:
-
Motif mimicry pair
- SANT:
-
Swi3, Ada2, N-Cor, and TFIIIB
- CD search:
-
Conserved domain search
- PSSM:
-
Position-specific scoring matrices
- COG:
-
Cluster of orthologs
- PKC:
-
Protein kinase C
- CK2:
-
Casein kinase II
References
Ahmed H et al (2018) Network biology discovers pathogen contact points in host protein–protein interactomes. Nat Commun 9(1):2312
Ammari MG et al (2016) HPIDB 2.0: a curated database for host-pathogen interactions. Database (Oxford) 2016
Beattie E, T J, Paoletti E (1991) Vaccinia virus-encoded eIF-2 alpha homolog abrogates the antiviral effect of interferon. Virology 183(1):419–22
Bennett MK et al (1993) The syntaxin family of vesicular transport receptors. Cell 74(5):863–873
Beyer AR et al (2015) The Anaplasma phagocytophilum effector AmpA hijacks host cell SUMOylation. Cell Microbiol 17(4):504–519
Bhat MI et al (2019) Escherichia coli K12: An evolving opportunistic commensal gut microbe distorts barrier integrity in human intestinal cells. Microb Pathog 133:103545
Brown KR, Jurisica I (2007) Unequal evolutionary conservation of human protein interactions in interologous networks. Genome Biol 8(5):R95
Burg JS et al (2015) Structural biology. Structural basis for chemokine recognition and activation of a viral G protein-coupled receptor. Science 347(6226):1113–7
Cazzanelli G et al (2018) The Yeast Saccharomyces cerevisiae as a model for understanding RAS proteins and their role in human tumorigenesis. Cells 7(2):4
Chatr-aryamontri A et al (2007) MINT: the molecular INTeraction database. Nucleic Acids Res 35:D572-4
Chen YF, Xia Y (2021) Structural profiling of bacterial effectors reveals enrichment of host-interacting domains and motifs. Front Mol Biosci 8:626600
Chill JH et al (2003) The human type I interferon receptor: NMR structure reveals the molecular basis of ligand binding. Structure 11(7):791–802
Crispin M, Doores KJ (2015) Targeting host-derived glycans on enveloped viruses for antibody-based vaccine design. Curr Opin Virol 11:63–69
Crispin M, Ward AB, Wilson IA (2018) Structure and Immune recognition of the HIV glycan shield. Annu Rev Biophys 47:499–523
Cromie GA, C JC, Leach DR (2001) Recombination at double-strand breaks and DNA ends: conserved mechanisms from phage to humans. Mol Cell 8:1163–74
Crua Asensio N et al (2017) Centrality in the host-pathogen interactome is associated with pathogen fitness during infection. Nat Commun 8:14092
Cusick MF, Libbey JE, Fujinami RS (2012) Molecular mimicry as a mechanism of autoimmune disease. Clin Rev Allergy Immunol 42(1):102–111
Damian RT (1964) Molecular mimicry: antigen sharing by parasite and host and its consequences. Am Nat 98(200):129–149
Dar AC, Sicheri F (2002) X-ray crystal structure and functional analysis of vaccinia virus K3L reveals molecular determinants for PKR subversion and substrate recognition. Mol Cell 10(2):295–305
Davey NE, Trave G, Gibson TJ (2011) How viruses hijack cell regulation. Trends Biochem Sci 36(3):159–169
de Castro E et al (2006) ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins. Nucleic Acids Res 34:W362-5
Dean Southwood SR (2019) Host-pathogen interactions. Encycl Bioinform Comput Biol 3:103–112
Diaz A, Ferreira A, Sim RB (1997) Complement evasion by Echinococcus granulosus: sequestration of host factor H in the hydatid cyst wall. J Immunol 158(8):3779–3786
Doxey AC, McConkey BJ (2013) Prediction of molecular mimicry candidates in human pathogenic bacteria. Virulence 4(6):453–466
Durmus Tekir S, Cakir T, Ulgen KO (2012) Infection strategies of bacterial and viral pathogens through pathogen-human protein–protein interactions. Front Microbiol 3:46
Durmus Tekir S et al (2013) PHISTO: pathogen-host interaction search tool. Bioinformatics 29(10):1357–1358
Duro N, Miskei M, Fuxreiter M (2015) Fuzziness endows viral motif-mimicry. Mol Biosyst 11(10):2821–2829
Ferreira PG, Azevedo PJ (2007) Evaluating deterministic motif significance measures in protein databases. Algorithms Mol Biol 2:16
Flower DR (1996) The lipocalin protein family: structure and function. Biochem J 318:1–14
Franzosa EA, Xia Y (2011) Structural principles within the human-virus protein–protein interaction network. Proc Natl Acad Sci U S A 108(26):10538–10543
Gagnaire A et al (2017) Collateral damage: insights into bacterial mechanisms that predispose host cells to cancer. Nat Rev Microbiol 15(2):109–128
Gandhi TK et al (2006) Analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets. Nat Genet 38(3):285–293
Garamszegi S, Franzosa EA, Xia Y (2013) Signatures of pleiotropy, economy and convergent evolution in a domain-resolved map of human-virus protein–protein interaction networks. PLoS Pathog 9(12):e1003778
Garg A et al (2017) miPepBase: a database of experimentally verified peptides involved in molecular mimicry. Front Microbiol 8:2053
Goll J et al (2008) MPIDB: the microbial protein interaction database. Bioinformatics 24(15):1743–1744
Gould SJ et al (1989) A conserved tripeptide sorts proteins to peroxisomes. J Cell Biol 108(5):1657–1664
Guarino E, Salguero I, Kearsey SE (2014) Cellular regulation of ribonucleotide reductase in eukaryotes. Semin Cell Dev Biol 30:97–103
Guven-Maiorov E, Tsai CJ, Nussinov R (2016) Pathogen mimicry of host protein–protein interfaces modulates immunity. Semin Cell Dev Biol 58:136–145
Guven-Maiorov E et al (2020) HMI-PRED: a web server for structural prediction of host-microbe interactions based on interface mimicry. J Mol Biol 432(11):3395–3403
Haigis MC, Guarente LP (2006) Mammalian sirtuins–emerging roles in physiology, aging, and calorie restriction. Genes Dev 20(21):2913–2921
Hermjakob H et al (2004) IntAct: an open source molecular interaction database. Nucleic Acids Res 32:D452-5
Huang Z et al (2009) Structural insights into host GTPase isoform selection by a family of bacterial GEF mimics. Nat Struct Mol Biol 16(8):853–860
Huerta-Cepas J et al (2019) eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 47(D1):D309–D314
Ivanov SS et al (2010) Lipidation by the host prenyltransferase machinery facilitates membrane localization of Legionella pneumophila effector proteins. J Biol Chem 285(45):34686–34698
Kang H et al (2017) Sirt1 carboxyl-domain is an ATP-repressible domain that is transferrable to other proteins. Nat Commun 8:15560
Keating JA, Striker R (2012) Phosphorylation events during viral infections provide potential therapeutic targets. Rev Med Virol 22(3):166–181
Kumar R et al (2020) Role of host-mediated post-translational modifications (PTMs) in RNA virus pathogenesis. Int J Mol Sci 22(1):323
Kvansakul M, Hinds MG (2013) Structural biology of the Bcl-2 family and its mimicry by viral proteins. Cell Death Dis 4:e909
Launay G et al (2015) MatrixDB, the extracellular matrix interaction database: updated content, a new navigator and expanded functionalities. Nucleic Acids Res 43:D321-7
Lee PC et al (2020) The Legionella kinase LegK7 exploits the Hippo pathway scaffold protein MOB1A for allostery and substrate phosphorylation. Proc Natl Acad Sci U S A 117(25):14433–14443
Li T et al (2013) SET-domain bacterial effectors target heterochromatin protein 1 to activate host rDNA transcription. EMBO Rep 14(8):733–740
Lilley CE, Schwartz RA, Weitzman MD (2007) Using or abusing: viruses and the cellular DNA damage response. Trends Microbiol 15(3):119–126
Ludin P, Nilsson D, Maser P (2011) Genome-wide identification of molecular mimicry candidates in parasites. PLoS ONE 6(3):e17546
Lynn DJ et al (2008) InnateDB: facilitating systems-level analyses of the mammalian innate immune response. Mol Syst Biol 4:218
Marchler-Bauer A et al (2011) CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res 39:D225-9
Mathews MB, Hershey JW (2015) The translation factor eIF5A and human cancer. Biochim Biophys Acta 1849(7):836–844
Maurer-Stroh S, Eisenhaber F (2004) Myristoylation of viral and bacterial proteins. Trends Microbiol 12(4):178–185
Mayer KA et al (2019) Hijacking the supplies: metabolism as a novel facet of virus-host interaction. Front Immunol 10:1533
McClain MT et al (2005) Early events in lupus humoral autoimmunity suggest initiation through molecular mimicry. Nat Med 11(1):85–89
Meier-Stephenson V et al (2018) DEAD-box helicases: the Yin and Yang roles in viral infections. Biotechnol Genet Eng Rev 34(1):3–32
Mi H et al (2021) PANTHER version 16: a revised family classification, tree-based classification tool, enhancer regions and extensive API. Nucleic Acids Res 49(D1):D394–D403
Mudunuri U et al (2009) bioDBnet: the biological database network. Bioinformatics 25(4):555–556
Navratil V et al (2009) VirHostNet: a knowledge base for the management and the analysis of proteome-wide virus-host interaction networks. Nucleic Acids Res 37:D661-8
Newcomb WW, Brown JC (2010) Structure and capsid association of the herpesvirus large tegument protein UL36. J Virol 84(18):9408–9414
Nicod C, Banaei-Esfahani A, Collins BC (2017) Elucidation of host-pathogen protein–protein interactions to uncover mechanisms of host cell rewiring. Curr Opin Microbiol 39:7–15
Pek JW, Anand A, Kai T (2012) Tudor domain proteins in development. Development 139(13):2255–2266
Perez-Torrado R, Querol A (2015) Opportunistic strains of saccharomyces cerevisiae: a potential risk sold in food products. Front Microbiol 6:1522
Rosas-Santiago P et al (1864) (2017) Plant and yeast cornichon possess a conserved acidic motif required for correct targeting of plasma membrane cargos. Biochimica et biophysica acta. Mol Cell Res 10:1809–1818
Samano-Sanchez H, Gibson TJ (2020) Mimicry of short linear motifs by bacterial pathogens: a drugging opportunity. Trends Biochem Sci 45(6):526–544
Schipke J et al (2012) The C terminus of the large tegument protein pUL36 contains multiple capsid binding sites that function differently during assembly and cell entry of herpes simplex virus. J Virol 86(7):3682–3700
Schwikowski B, Uetz P, Fields S (2000) A network of protein–protein interactions in yeast. Nat Biotechnol 18(12):1257–1261
Seabright GE et al (2019) Protein and glycan mimicry in HIV vaccine design. J Mol Biol 431(12):2223–2247
Sharp TV, Witzel JE, Jagus R (1997) Homologous regions of the alpha subunit of eukaryotic translational initiation factor 2 (eIF2alpha) and the vaccinia virus K3L gene product interact with the same domain within the dsRNA-activated protein kinase (PKR). Eur J Biochem 250(1):85–91
Spiess C et al (2004) Mechanism of the eukaryotic chaperonin: protein folding in the chamber of secrets. Trends Cell Biol 14(11):598–604
Standfuss J (2015) Structural biology. Viral Chemokine Mimicry Sci 347(6226):1071–1072
Stark C et al (2006) BioGRID: a general repository for interaction datasets. Nucleic Acids Res 34:D535-9
Stebbins CE, Galan JE (2000) Modulation of host signaling by a bacterial mimic: structure of the Salmonella effector SptP bound to Rac1. Mol Cell 6(6):1449–1460
The UniProt C (2017) UniProt: the universal protein knowledgebase. Nucleic Acids Res 45(D1):D158–D169
Venigalla SSK, Premakumar S, Janakiraman V (2020) A possible role for autoimmunity through molecular mimicry in alphavirus mediated arthritis. Sci Rep 10(1):938
Via A et al (2015) How pathogens use linear motifs to perturb host cell networks. Trends Biochem Sci 40(1):36–48
Wagh K et al (2018) Completeness of HIV-1 envelope glycan shield at transmission determines neutralization breadth. Cell Rep 25(4):893-908 e7
Weaver TM et al (2019) The EZH2 SANT1 domain is a histone reader providing sensitivity to the modification state of the H4 tail. Sci Rep 9(1):987
Weitao T, Dasgupta S, Nordström K (2000) Role of the mukB gene in chromosome and plasmid partition in Escherichia coli. Mol Microbiol 38(2):392–400
Wolfe CJ, Kohane IS, Butte AJ (2005) Systematic survey reveals general applicability of “guilt-by-association” within gene coexpression networks. BMC Bioinformatics 6:227
Wu S et al (2005) LMP1 protein from the Epstein-Barr virus is a structural CD40 decoy in B lymphocytes for binding to TRAF3. J Biol Chem 280(39):33620–33626
Xenarios I et al (2000) DIP: the database of interacting proteins. Nucleic Acids Res 28(1):289–291
Xu Y et al (2001) The Phox homology (PX) domain, a new player in phosphoinositide signalling. Biochem J 360(Pt 3):513–530
Yapici-Eser H et al (2021) Neuropsychiatric symptoms of COVID-19 Explained by SARS-CoV-2 proteins’ mimicry of human protein interactions. Front Hum Neurosci 15:656313
Yoshida T, Claverie JM, Ogata H (2011) Mimivirus reveals Mre11/Rad50 fusion proteins with a sporadic distribution in eukaryotes, bacteria, viruses and plasmids. Virol J 8:427
Yoshimura SH, Hirano T (2016) HEAT repeats - versatile arrays of amphiphilic helices working in crowded environments? J Cell Sci 129(21):3963–3970
Yuan S et al (2020) Viruses harness YxxO motif to interact with host AP2M1 for replication: a vulnerable broad-spectrum antiviral target. Sci Adv 6(35):eaba7910
Acknowledgements
ST acknowledges Netaji Subhas University of Technology for grant of Teaching cum Research Fellowship. SB acknowledges Indian Council of Medical Research for project Grant ID 2021-6412.
Author information
Authors and Affiliations
Contributions
ST: carried out data cleaning, enrichment and organisation, development of the database, analysis, and manuscript preparation. VB: developed the backend and front end of the web interface. TM: carried out data acquisition and cleaning. SB: was involved in conception, design, analysis, and supervision of the study. The manuscript was reviewed by all the authors.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Ethical approval
Research involving human participants and/or animals. It is declared that no human participants and/or animals are used in this work.
Informed consent
Not applicable.
Additional information
Handling editor: D. Frishman.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Tayal, S., Bhatia, V., Mehrotra, T. et al. ImitateDB: A database for domain and motif mimicry incorporating host and pathogen protein interactions. Amino Acids 54, 923–934 (2022). https://doi.org/10.1007/s00726-022-03163-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00726-022-03163-3

