Introduction

The ancient innate immune system provides the first line of defense against infection. This is dependent on various interacting cell types such as dendritic cells (DCs) and macrophages that are capable of detecting invading pathogens, responding to the infections through secretion of proinflammatory cytokines, chemokines, and interferons, upregulating expression of co-stimulatory molecules, and processing and presenting antigens to naїve T cells to activate the adaptive immune system. The power of the innate immune system has been harnessed to great public health benefits since the first published case of vaccination in the late eighteenth century [1]. Nonetheless, the underlying molecular mechanisms of innate immune recognition only started to emerge in the last decade, when major families of innate immune receptors, or the pattern recognition receptors (PRRs), were identified and their functions characterized [25]. PRRs were originally proposed to recognize pathogen-associated molecular patterns (PAMPs), which implied inherent distinction of chemical “patterns” between hosts and pathogens, or “self” versus “non-self” recognition. It is now realized that PAMPs are present in both pathogenic and symbiotic microbes [6], and many of the PRRs are capable of responding to host molecules [7, 8]. This is particularly significant for those PRRs that recognize universal components of life such as nucleic acids, the activation of which poses potential threat of autoimmune diseases [9]. Here I will focus on these nucleic acid-recognizing PRRs, namely, the endosomal Toll-like receptors (TLRs), the cytoplasmic RLRs, and a member of the NLR family, as well as a recently identified cytoplasmic DNA sensor containing Z-DNA binding domains (Fig. 1).

Fig. 1
figure 1

Recognition of nucleic acids by receptors/sensors of the innate immune system. Several families of receptors/sensors respond to nucleic acids in different subcellular compartments. This recognition triggers adapter protein-mediated signaling pathways that eventually lead to activation of transcription factors or caspases. TLR7 and 8 can also be activated by small nucleoside analogs not illustrated here. NLRP3 responds to a variety of other endogeneous, microbial, and environmental stimuli besides nucleic acids. Unknown receptors/sensors/adapters are marked with question marks. The abbreviations are: LRR, leucine-rich repeats; TIR, Toll/interleukin-1 receptor; DD, death domain; Zα/Zβ, Z-DNA binding domain α/β; D3, domain 3 (for DNA binding); CARD, caspase activation and recruitment domain; Helicase, RNA helicase domain; CTD, C-terminal domain; PYD, pyrin domain; NOD, nucleotide-binding and oligomerization domain; MSU, mono-sodium urate

The endosomal TLRs

The TLRs are a major family of membrane-bound PRRs [3, 4]. TLRs 3, 7, 8, and 9 are mainly implicated in antiviral defense because of their roles in the recognition of viral nucleic acids, delivered inside the cells through either endocytosis or autophagy [10, 11]. In accordance, they are highly expressed in DCs, particularly plasmacytoid DCs (pDCs), the primary source of type I interferons in antiviral immunity [12]. TLR7 is activated by ssRNA viruses such as influenza virus and vesicular stomatitis virus (VSV) [10]. Both TLR7 and TLR8 are activated by HIV-1-derived uridine-rich RNAs [13]. TLR9 is essential for type I interferon production in response to DNA viruses such as herpes simplex virus, murine cytomegalovirus (MCMV), and adenovirus. TLR3 recognizes influenza virus, vaccinia virus, flavivirus, and MCMV [10, 12]. It contributes to immune protection against MCMV infection, nonetheless promotes the pathogenesis of infections by influenza virus and vaccinia virus through initiation of excessive inflammation [14]. The protective role of TLR3 during flavivirus infection is still controversial [15, 16].

The ability of these TLRs to respond to a particular type of virus is reflected by their ligand-binding specificities. For example, TL3 and 7 recognize double-stranded (ds) and single-stranded (ss) RNAs, respectively [17, 18]. TLR8 is a close paralog of TLR7 and there appears to be functional redundancy between these two receptors. Both TLR7 and 8 also recognize small synthetic ligands such as guanine nucleoside analog R-848 [19]. TLR9 is activated by ssDNAs with natural phosphodiester (PD) backbones independent of specific sequence and base modifications, whereas in the context of artificial phosphorothionate (PS) backbones, unmethylated cytosine–phosphate–guanine motifs are indispensable for activation [20]. In light of the significant impact of DNA backbones on the sequence requirement of TLR9 stimulation, the previous reports of various classes of oligodeoxyribonucleotides (ODNs) using PS and/or PS/PD hybrid backbones [21] may have to be re-evaluated to differentiate the contributions of backbones and/or sequences to TLR9 stimulation. Moreover, TLR9 stimulation by ODNs is also dependent on cell types and aggregation states of ODNs. A recent study using conditional knock-out of MyD88 in DCs shows that MyD88-deficient DCs and other wild-type innate cells fail to induce inflammatory response to “naked” TLR9 ligands. In contrast, stimulation with lipid-packaged or aggregated forms of ODNs activates non-DC innate cells such as macrophages and natural killer cells and thus compensate for the MyD88-deficient DCs in initiation of adaptive immune response [22]. This illustrates distinct recognition of different forms of the same ligands by TLR9, as well as variable activation mechanisms of the same receptors in diverse cell types.

The antiviral functions of the nucleic acid-binding TLRs are correlated with their unique subcellular environments. TLR3, 7, 8, and 9 are present in the endoplasmic reticulum (ER) and are delivered to the endosomes through their association with ER membrane protein UNC93B1 [23, 24]. This strategic compartmentalization and trafficking of TLRs appears to have evolved to prevent inadvertent reaction to host nucleic acids. However, exceptions have been reported for TLR9 and TLR3 in their cellular localization. Using flow cytometry techniques and TLR9 peptide competition studies, Eaton-Bassiri et al. [25] reported expression of TLR9 in human peripheral blood mononuclear cells and tonsil B cells, suggesting a role of TLR9 ligands in B-cell activation and differentiation into plasma cells. More recently, cell surface TLR3 expression and response to siRNAs was linked to its anti-angiogenesis activities in immune privileged retina-choroid interface [26]. Furthermore, innate tolerance facilitated by compartmentalization of TLRs breaks down in autoimmune diseases. In patients with systemic lupus erythematosus, a nuclear DNA-binding protein high-mobility group box 1 facilitates self-DNA uptake in immune complexes containing auto-antibodies, thus inducing TLR9-dependent production of proinflammatory cytokines and interferons [7]. In psoriasis patients, an antimicrobial peptide LL37/CAMP packages self-DNA into a condensed structure that is delivered to early endosomal compartments and activates TLR9 [27]. In both cases, the presence of self-DNAs in extracellular milieu alerts the innate immune system to defective clearance of cell death, further exacerbate tissue damage by invoking proinflammatory responses.

Signal transduction by the TLRs

Ligand binding by TLRs at the extracellular or endosomal milieu initiates a signal transduction cascade in the cytoplasm that leads to activation of transcription factors nuclear factor kB (NF-kB), interferon regulatory factors (IRFs) and activating protein-1 (AP-1) [28]. Specific domain architecture of TLRs provides the basis for distinct functions within different cellular environment. TLRs consist of amino-terminal ligand-binding leucine-rich repeats (LRRs), a central transmembrane domain, and a carboxyl-terminal Toll/interleukin-1 receptor (TIR) domain. The TIR domains are responsible for signaling to cytoplasmic TIR domain-containing adapter proteins. X-ray crystal structures for the extracellular domains of TLR 1, 2, 3, and 4 show similar horseshoe-shaped solenoid assemblies and characterized a number of important receptor-ligand interactions for the TLR1–TLR2 heterodimer, TLR3 homodimer, and TLR4-MD2 complexes [2933]. The common theme that emerged from these structures is that ligand binding by TLRs mediates receptor dimerization and potential conformational change, which results in close association of the transmembrane and cytoplasmic domains of the receptors. The juxtaposition of the cytoplasmic TIR domains and formation of dimers or oligomers provides a platform for the recruitment of TIR domain-containing adapters. Such a TIR domain signaling complex triggers downstream signaling events that lead to activation of gene expressions for proinflammatory cytokines and interferons [34].

The formation of homo- and hetero-typic TIR–TIR domain complexes among the TLRs and their adapters is a crucial event in TLR signaling pathways [35]. Severe immuno-deficiency has been attributed to mutations in the TIR domains, presumably due to defective signal transduction [3]. There are five TIR domain containing adapters in the TLR signaling pathways: MyD88 (myeloid differentiation primary response gene 88), TIRAP/MAL (TIR domain-containing adapter protein/MyD88-adapter like), TRIF/TICAM1 (TIR-domain-containing adapter-inducing interferon-β/TLR adapter molecule 1), TRAM/TICAM2, and SARM (sterile α- and armadillo-motif containing protein) [35]. SARM is the only negative regulator that functions to suppress TRIF signaling pathways. These adapters are differentially recruited to the TLRs to generate signaling specificity in response to various TLR ligands, with MyD88 as the central adapter molecule that is utilized by all TLRs except for TLR3. In fact, a number of microbial virulence factors that contain inhibitory TIR domains have evolved to specifically target MyD88 signaling pathways [36]. MyD88-deficient mice are highly susceptible to infections by various viruses, bacteria, fungi, and parasites, whereas human patients with recessive MyD88 deficiency suffer from pyrogenic bacterial infections, particularly early in life [37]. These patients are nonetheless resistant to infections by most other pathogens, highlighting species differences in redundant immune functions by various families of PRRs (see the following section).

The TIR domains are 150–200 residue motifs composed of five β strands surrounded by α helices and loops, as reported for the TIR domains of TLR1, TLR2, TLR10, and IL1RAPL [3841]. In particular, the “BB,” “DD” and “EE” loops have been proposed to reside at the TIR–TIR dimer interface. Mutation of a highly conserved proline residue on the BB loop of the TLR4 TIR domain renders the C3H/HeJ mice unresponsive to bacterial endotoxin [3, 42]. Peptidomimetic compounds based on the BB loop of MyD88 have been reported to interfere with MyD88-dependent signaling [43, 44]. A variant of the TIRAP TIR domain, Ser180Leu, in humans confers resistance to malaria, tuberculosis, and pneumococcal pneumonia, a likely consequence of altered TIRAP-mediated signaling from TLR2 and/or TLR4 [45]. Even though a number of docking models for the TIR–TIR domain complexes have been proposed, all of which focused on constructing interfaces centered on the BB and DD loops, a consistent mode of the TIR–TIR domain interaction has yet to emerge [35, 38]. Recently NMR structures of the TIR domain of MyD88 were deposited in the protein data bank. The BB loop of MyD88 TIR domain adopts a drastically different conformation compared with other TIR domain structures. It is not clear whether this represents a unique feature of the MyD88 TIR domain, nor is it apparent how the different BB loop conformations facilitate formation of specific TIR–TIR signaling complexes involving MyD88 homodimer or MyD88 in association with TLRs.

RLRs as cytoplasmic RNA sensors

In addition to the membrane-bound receptors described above, a number of cytosolic receptors for RNAs have evolved by the innate immune system to initiate antiviral responses. These include a new family of RNA helicase receptors referred to as RLRs: RIG-I, melanoma differentiation factor-5 (MDA5), and laboratory of genetics and physiology-2 (LGP2) (Fig. 1) [10, 46]. Sensing of viral RNAs by RLRs activates IRFs and induces expression of interferons, which in turn upregulates expression of the above receptors, creating a powerful positive feedback. Mice lacking RIG-I are highly susceptible to infections by Japanese encephalitis virus, influenza virus, and VSV, whereas MDA5-deficient mice succumb to encephalomyocarditis virus (EMCV), Theiler’s encephalomyelitis virus, and mengovirus [47]. Enhanced susceptibility to EMCV was observed for LGP2-deficient mice [48].

Specific ligand recognition mechanisms have evolved for these RLRs to prevent or decrease basal activation by abundant host messenger, ribosomal, and transfer RNAs in the cytoplasm. 5′ phosphate in ssRNAs has been shown to be the critical element for RIG-I stimulation but is largely absent from host mRNA and tRNA, and 5′ phosphate observed in host 5S rRNAs may be obscured by the bound ribosomal proteins [49, 50]. In addition, RIG-I binds short (~1 kb) dsRNA to a lesser extent compared with ssRNA. In comparison, MDA5 appears to prefer long (>2 kb) dsRNA and is implicated in recognizing long viral genomes or stable viral RNA duplexes formed as replication intermediates [51]. The molecular basis of these distinct ligand specificities for RIG-I and MDA5 is unclear, as is the ligand recognition mechanisms of the homologous receptor LGP2. LGP2 was initially identified as a suppressor of RIG-I signaling, but has been implicated in positive regulation of MDA5 signaling through a potential heterodimerization mechanism [48].

All three RLRs contain a DExD/H box helicase domain that binds ATP. The ATPase activity of their helicase domains is central to receptor activation and signaling [5]. Both RIG-I and LGP2, but not MDA5, contain a C-terminal repressor domain (RD) that maintains the receptors in a self-inhibited state in the absence of ligand binding [52]. The RD is also the primary binding site for RNA, which releases the receptors from self-inhibition through ATP-dependent conformational changes [53]. A well-defined subdomain called the C-terminal domain (CTD) within the RD was recently identified for all three RLRs and the crystal and NMR structures of RIG-I CTD were determined [54]. The structures illustrate a basic concave surface that is required for RNA binding and subsequent release of self-inhibition. Dimerization of CTD mediated by RNA binding was also demonstrated, suggesting an analogous mode of activation to that of TLR3 binding to dsRNA [54]. The signaling components of RIG-I and MDA5 reside at their N-terminal caspase activation and recruitment domains (CARDs), which interact with the CARD of their adapter molecule interferon β promoter stimulator-1 (IPS-1) on the mitochondria outer membrane [5559]. The mechanism of CARD–CARD interactions during RLR-mediated signaling remains unclear, though biophysical studies of CARD domains have suggested heterodimer formation of partner domains through complementary charge–charge interactions [60, 61]. The presence of tandem CARD domains in RIG-I and MDA5 and the complete absence of CARD domain in LGP2 hint at novel mechanisms of signaling regulation by RLRs compared with other known single CARD domain proteins.

New nucleic acid sensors in the cytoplasm

Recently, TLR-independent recognition of cytoplasmic DNAs [62, 63] and response to DNA vaccines [64, 65] have been reported. Interestingly, both microbial and host DNAs induce type I interferon expression, with right-handed B-form DNAs being more potent than left-handed Z-form DNAs [62]. Even though the exact identities of cytoplasmic DNA sensors remain controversial, several intriguing candidates have been proposed.

NLRP3 (CIAS1/cryopyrin/NALP3) was first identified in patients with familial cold autoinflammatory syndrome and Muckle–Wells syndrome [66]. Importantly, treatment of these patients with IL-1 receptor antagonists dramatically alleviates their symptoms [67], highlighting a prominent role of NLRP3 in pro-IL1β processing. NLRP3 belongs to a large family of cytoplasmic PRRs: the NLRs [68, 69]. Among these are NALPs (NACHT-, LRR-, and pyrin-domain-containing proteins), NLRC4/IPAF (ICE-protease-activating factor or CARD12 and CLAN), NOD proteins, and class II trans-activator (CIITA). The majority of NLRs utilize a central adapter protein ASC (apoptosis-associated speck-like protein containing a CARD) in their signaling pathways [70]. Activation of caspase-1 and subsequent pro-IL1β processing is mediated by the formation of a large protein complex composed of NLRs (such as NLRP1, NLRP3, or NLRC4), the adapter ASC and pro-caspase-1, collectively termed the “inflammasome” [71].

NLRP3 was recently implicated in sensing bacterial and viral RNA, although the evidence is debatable since the presence of ATP, another potent activator of procaspase-1, is required [72]. More convincing evidence was reported for NLRP3-dependent maturation of IL-1β in response to infection by adenovirus, a non-enveloped DNA virus [73]. In comparison, infection with empty viral capsids without DNA failed to induce IL-1β processing. Surprisingly, liposome-mediated transfection of viral, bacteria, fish, or human DNAs resulted in IL-1β maturation in the absence of NLRP3. Instead, the crucial adapter protein ASC is required, suggesting that other members of the NLR family may be involved in DNA sensing as well [73].

The exact mechanism of NLRP3 recognition of DNA is unclear, neither is its ability to respond to various other stimuli from endogeneous, microbial, and environmental sources, such as β-amyloid [74], uric acid crystals [75], asbestos [76], silica crystals, or aluminum salts, an adjuvant frequently used in human vaccines [77]. It should be emphasized that none of the reports have definitively characterized direct binding of NLRP3 to any of the above “ligands.” Instead, a more likely scenario is indirect activation of NLRP3 through a common cellular stimulus, such as potassium efflux or elevated reactive oxygen species (ROS) level [68, 78, 79], which may enhance the formation of NLRP3 inflammasome and activation of procaspase-1. Members of the NLR family are therefore often referred to as “sensors” instead of “receptors” because of the proposed indirect activation process.

Z-DNA binding protein 1 (ZBP1) was first identified as a tumor response gene that is highly expressed in lymphatic tissues [80, 81]. It contains N-terminal Zα and Zβ domains known to stabilize and induce Z-forms of DNA and RNA. Such Z-DNA binding activity is also present in virulence factor E3L from vaccinia virus. Abrogation of the Z-DNA binding function of E3L significantly decreased viral pathogenicity, whereas replacing the mutant Z-DNA binding domain of E3L with the Zα domain of ZBP1 restored viral lethality [82]. The role of ZBP1 in antiviral innate defense was underscored by recent reports that established its ability to induce type I interferon secretion in response to cytoplasmic DNAs [83]. However, gene knock-out studies suggested that innate immune response to DNA vaccines is independent of both ZBP1 and TLR9 [84]. This apparent inconsistency suggests that additional DNA sensors such as NLRP3 may play predominant roles under certain experimental settings, and different DNA recognition mechanisms may have redundant roles in innate immune responses.

Future perspectives

As discussed above, a prominent feature of the immune system is the built-in redundancy and complementarity from partially overlapping functions of different cell types and receptors/sensors that respond to various forms of stimuli. The power of redundancy perhaps evolves to ensure fail-safe response to infections; nonetheless it renders hosts vulnerable to autoimmune diseases precisely because the same stimuli such as nucleic acids are present in both pathogens and hosts.

Complementary and orchestrated innate immune response to nucleic acids is reflected by concurrent activation of different PRRs to the same stimuli, sometimes in various cell types. For example, pDCs respond to influenza virus infection through the recognition of ssRNA viral genome by TLR7, whereas RIG-I/IPS-1 pathway is activated by the same virus in fibroblasts and myeloid DCs. Abrogation of either pathway retained interferon response to viral infection, whereas deficiency in both leads to failed antiviral response and much higher viral titers [85]. It is also interesting to note that induction of influenza-specific B and T cell responses do appear to be more dependent on the TLR7/MyD88 pathway than the RIG-I/IPS-1 pathway [85]. In contrast to the positive contribution by both TLR7 and RIG-I, activation of TLR3 by influenza virus promotes excessive inflammation and lethality in mice [14]. Therefore, the “net” outcome of immune response against a particular pathogen originates from elaborate interplay among various signaling pathways or networks.

A major motivation for studying innate immunity is the potential for better vaccine adjuvants through thorough understanding of the pattern recognition mechanisms. This requires detailed biophysical characterization of receptor:ligand interactions and structural studies of receptors in association with diverse ligands. Targeting nucleic acid-recognizing PRRs has become a particularly fertile field for vaccine adjuvant development because highly purified ligand analogs are readily available through chemical synthesis. Many of the nucleic acid analogs have been demonstrated to enhance both humoral and cellular immune responses. A well-studied example is TLR9 ligand ODNs that have been shown to be effective adjuvants not only in traditional prophylactic vaccination, but also in therapeutic vaccination against cancer and allergy because of their strong induction of Th1 polarization [86]. Future structural characterization of various ODNs in complex with TLR9 will reveal mechanisms of ligand-binding specificity, as well as molecular basis of agonistic or antagonistic effects of various classes of ODNs. Structures of signaling complexes containing adapter proteins MyD88, IPS-1, or ASC will furnish invaluable insights into the common determinants of respective signaling pathways. Structures of full-length receptors such as RIG-I in the presence and absence of ligands are crucial for our understanding of the mechanisms of self-inhibition and conformational changes involved in receptor activation. Biochemical and biophysical studies of NLRP3 and other NLR members will unravel the mystery of ligand sensing and indirect activation by potassium efflux or ROS. Ultimately, structural and molecular characterization of PRRs, ligands, signaling adapters, and their complexes will be instrumental in guiding the design of more efficient vaccine adjuvants for clinical applications.