Introduction

Nuclear speckles (NS), previously referred to as interchromatin granule clusters (IGCs), are small, membrane-less organelles that reside within the nucleus (Fig. 1). A large number of proteins are located within NS, and classes of NS proteins include gene transcription regulatory molecules, chromatin remodelers, precursor mRNA (pre-mRNA) splicing, RNA-associated proteins, cleavage and polyadenylation factors, and mRNA export. Based on the known functions of proteins located within the NS, they are considered a hub of complex RNA metabolism steps. In this review article, the current understanding of NS function and genetic disorders associated with NS dysfunction are discussed.

Fig. 1
figure 1

Intranuclear distribution of nuclear speckles and its contents. a Immunofluorescence staining of SRRM2, a core structural protein of nuclear speckles in the HCT 116 colon cancer cell line. Anti-SRRM2 antibody produced in rabbit (Sigma Aldrich: HPA041411-100UL.1:500 dilution) was used for the immunofluorescence staining. Merged image: SRRM2 (red), and DAPI (blue). Bar: 5 µm. b Schema showing the classes of RNA metabolism-regulating proteins located within nuclear speckles

Historical background

NS were originally described by Spanish physician Santiago Ramón y Cajal in (1910). By using acid aniline staining, Dr. Cajal discovered irregular and transparent clumps in the nucleus of a neuron. This structure was later referred to as speckles by Dr. J. Swanson Beck in 1961, in his article describing the nuclear staining patterns of patients with autoimmune disorders (Beck 1961). NS was also called as IGCs that were visualized by electron microscopy (Swift 1959). The first connection between NS/IGCs and RNA processing was established in the early 1980s based on the discovery of small nuclear ribonucleoprotein particles (snRNPs), which catalyze pre-mRNA splicing and colocalize within NS/IGCs (Lerner et al. 1981; Spector et al. 1983). Subsequent studies have revealed the diverse roles of NS beyond mRNA splicing, extending to gene transcriptional regulation, various RNA modifications, and nuclear export.

Mechanism of NS organization

Immunofluorescence staining of NS proteins revealed that NS are irregular, mottled structures found inside the nucleus (Fig. 1). In interphase, the nucleus usually contains 20–50 speckles whose sizes are up to several micrometers (Lamond and Spector 2003). The number of NS and their size possibly vary among different cell types, reflecting the levels of gene expression required for each cell type, because transcriptional suppression or splicing inhibition causes a reduced number of NS accompanied by an enlargement of each NS’s size (Fei et al. 2017; Kim et al. 2019; O'Keefe et al. 1994; Spector et al. 1991).

NS demonstrate a layered structure. SON and SRRM2 locate at the core of NS, and these core proteins are surrounded by MALAT1 (noncoding RNA), U1 and U2 small nuclear RNAs (Fei et al. 2017). SC35 was also considered as a core structural NS protein, but a recent paper raised a possibility that SC35 detected by immunofluorescence was SRRM2 (Ilik et al. 2020). In contrast to organelles, whose borders are demarcated by membranes, NS forms their structures by self-assembly of its protein content. This self-assembly is facilitated by the unique physical properties of NS proteins that demonstrate liquid–liquid phase separation. Phase separation enables the formation of droplet-like structures. These physical properties facilitate the formation of membrane-less NS, whose core proteins (SRRM2 and SON) facilitate NS formation (Ilik et al. 2020; Xu et al. 2022). NS proteins tend to contain arginine repeats, and arginine-enriched mixed-charge domains form condensates that drive speckle assembly (Greig et al. 2020).

Contents of NS

The NS is composed of RNA and various RNA metabolism-regulating proteins. The previous studies have revealed the presence of RNAs, particularly poly (A)-attached RNAs and noncoding RNAs, in NS (Carter et al. 1991; Shopland et al. 2002). On the contrary, NS contain little or no DNA (Carter et al. 1991; Spector and Lamond 2011). More than 50% of the proteins residing within the NS are involved in transcription or splicing regulation (Galganski et al. 2017) (Fig. 1b). Examples of NS proteins include speckle core proteins (SRRM2 and SON), transcriptional regulatory molecules (Pol II subunits, CDK12, and CDK13), RNA-binding proteins (RBM10 and RBM8A), and snRNP (SF3B4 and SF3B2).

For the comprehensive identification of NS proteins, we obtained a list of 380 NS proteins residing within the NS from a public database, the Human Protein Atlas database [https://www.proteinatlas.org/] (Supplemental Table 1) (Uhlen et al. 2015). Using the obtained list, we performed gene ontology (GO) analysis using DAVID functional annotation tool (Dennis et al. 2003). As expected, “nuclear speckles” were the top-ranked GO terms of the cellular component (Supplemental Table 2). Other cellular component terms enriched in NS proteins were “nuclear body,” “nucleoplasm,” “spliceosomal complex,” “chromosome,” “ribonucleoprotein complex,” and “chromatin.” Biological process GO term ranking indicated that “mRNA processing” was the top-ranked term, followed by “RNA splicing,” “gene expression,” and “regulation of transcription.” (Supplemental table 3). Molecular function GO terms enriched were “nucleic acid binding,” “DNA binding,” “RNA binding,” and “RNA polymerase II regulatory region DNA binding.” (Supplemental table 4). We acknowledge the limitations of the use of the Human Protein Atlas database and those include antibody specificities and cell fixation conditions, because this database relied on immunofluorescence staining experiments using fixed cells. Of note, cell fixation conditions are known to affect localization of certain NS proteins (Dopie et al. 2020). As a result, it is possible that the obtained NS protein list may include false positives or negatives. Despite these limitations, similar sets of proteins were identified as NS proteins in studies using mass spectrometry (Dopie et al. 2020; Saitoh et al. 2004). Collectively, these findings support the notion that gene expression or mRNA processing/splicing regulation are the main functions of NS proteins.

Functions of NS

Consistent with the findings of GO analyses of NS proteins, NS is currently regarded as a site of gene expression regulation from transcription to RNA processing. Pol II catalyzes DNA transcription to synthesize pre-mRNA. This active transcription occurs at the periphery of the NS, and gene expression is increased by their association with NS (Kim et al. 2020). During active gene transcription, Pol II undergoes serial protein modifications, including phosphorylation of the C-terminal domain of Pol II. Various cyclin-dependent kinases (CDKs) are involved in the phosphorylation of Pol II, and CDK12/CDK13, NS proteins, are the CDKs involved in Pol II phosphorylation that determines the activity of Pol II (Fan et al. 2020).The repertoire of phosphorylated Pol II-interacting proteins includes a large number of spliceosomal proteins found in NS, suggesting that Pol II is recruited to the NS during active transcriptional elongation (Nojima et al. 2018). Of note, phosphorylation of Pol II facilitates the incorporation of Pol II into splicing condensates (Guo et al. 2019; Lu et al. 2018), while it remains to be determined whether these splicing condensates are NS or not. It is possible that condensate formation of Pol II may be involved in the mechanism linking gene transcription to subsequent RNA-processing steps within NS.

Pre-mRNA is converted to mRNA after several processing steps, including splicing. As many NS proteins are known to regulate these pre-mRNA-processing steps (Dias et al. 2010), NS is thought to be involved in the regulation of these steps. However, it remains to be determined whether these steps occur within NS or not. During pre-mRNA synthesis, the spliceosome, composed of five snRNPs (U1, U2, U4/U6, and U5) and non-snRNP splicing factors, is recruited to the pre-mRNA, where it catalyzes splicing reactions. Splicing is a stepwise process that includes catalytic steps (splice-site recognition, intron lariat formation, and intron excision) and post-catalytic steps (exonic ligation and mRNA release from the spliceosome). Throughout this process, the spliceosome undergoes dynamic component changes wherein many proteins join and leave the complex. For example, U2 snRNP, containing SF3B4 and SF3B2, is recruited during splicing steps (Sun 2020), RBM8A is recruited to the exon junction complex (EJC) (Chuang et al. 2015), and RBM10 is recruited to the pre-mRNA during splicing (Inoue et al. 2014; Wang et al. 2013). These splicing processes are intricately integrated into the Pol II transcription process and Pol II regulation, and the pre-mRNA splicing steps are interdependent. For example, alterations in Pol II activity can affect pre-mRNA splicing (Fong et al. 2014). Conversely, pre-mRNA splicing factors, such as SC35, can influence the Pol II transcription rate (Ji et al. 2013; Lin et al. 2008). A detailed review of pre-mRNA splicing can be found in Lee and Rio (Lee and Rio 2015).

NS also contains proteins involved in the regulation of other mRNA maturation processes including 5′ end capping, 3′ end polyadenylation (poly(A)), N6-methyladenosine (m6A) modification, and mRNA nuclear export. 5′ capping involves the addition of 7-methylguanosine triphosphate to the 5′ end of mRNA transcripts (Reviewed in (Ramanathan et al. 2016)). 3′ tail polyadenylation involves the addition of a series of adenine repeats to the end of the transcripts (Reviewed in (Liu et al. 2022)). Interestingly, NS contain Star-PAP, which differentially regulates mRNA 3′ end processing of certain transcripts depending on oxidative/DNA damage pathways (Li et al. 2013; Mellman et al. 2008). Monomethylated adenosine at the N6 position (m6a) is the most well-characterized epitranscriptomic modification. m6A provides a binding site for m6A-binding proteins such as YTH family proteins (Reviewed in (Boulias and Greer 2022)). These mRNA modifications serve as a method to stabilize mRNA and facilitate the transfer of mature mRNA to the ribosome for translation. After the completion of RNA processing, processed transcripts are transported to the cytosol by nuclear export factors such as the TREX complex (Reviewed in (Khan et al. 2022)). Depletion of TREX complex components enhances the association of mRNA with NS, supporting the role of NS in mRNA nuclear export (Dias et al. 2010).

In addition, a role for NS in genome organization has been proposed. Knockdown of SRRM2, a core structural NS protein, resulted in the reduction of intra-chromosomal interactions within active chromatin compartments (Hu et al. 2019). These findings were supported by genomic and cytological studies, as NS resides near the highly transcribed genes of active compartment A, which are mutually exclusive to lamina-associated inactive compartment B (Chen et al. 2018; Su et al. 2020). These observations suggest a role for NS in maintaining chromatin interactions in active compartments.

Although chromatin remodeler proteins, including SETD1A, SETD2, and HDAC4, are located within NS, the role of these proteins in NS function remains unclear. Interestingly, only subsets of chromatin remodelers reside within NS. Based on the known functions of these proteins (Wang et al. 2021b, 2014; Xu et al. 2019), chromatin remodelers located within the NS are expected to influence gene transcription or splicing efficacy.

Although the above-mentioned proteins mainly localize within NS, they also present outside the NS. Therefore, the interpretation of their role in NS requires caution, as their canonical function can be executed outside of NS.

NS target genes and transcripts

To support the notion that NS is involved in gene expression regulation, gene expression levels correlates inversely with distance to nuclear speckles (Zhang et al. 2020). Although a sparse amount of DNA resides within the NS, genomic loci located near the NS have been evaluated using tyramide signal amplification sequencing (TSA-seq) (Chen et al. 2018) and split-pool recognition of interactions by tag extension (SPRITE) (Quinodoz et al. 2018). Considering that NS proteins function as universally required transcriptional and splicing regulatory molecules, it is expected that the majority of Pol II-transcribed genes require RNA processing steps orchestrated by NS. However, NS tend to localize close proximity to genomic regions of certain features, such as genes residing within the active A chromatin compartment, highly expressed genes, housekeeping genes, genes with low transcriptional pausing, and super-enhancers (Chen et al. 2018; Quinodoz et al. 2018). HSPA1 genes (HSPA1A, HSPA1B, and HSPA1L), that encode heat shock 70 kDa proteins, represents examples of well-characterized NS target genes (Kim et al. 2019). There is a significant overlap between the NS target and p53 target genes, suggesting the role of p53 in determining NS target genomic regions (Alexander et al. 2021). Considering that many NS proteins lack DNA-binding domains, transcription factors, including p53, may determine the genomic loci targeted by NS.

Unsurprisingly, NS-associated transcripts demonstrated a high degree of overlap with NS-associated genomic loci. NS-associated coding transcripts are characterized by a high GC content, and their transcript lengths tend to be shorter than those associated with other nuclear components (Barutcu et al. 2022). Speckle-associated transcripts were also enriched for transcripts with retained short introns, suggesting the role of NS in the splicing of shorter introns (Barutcu et al. 2022). To support this notion, SRRM2 deficiency causes exon skipping of short introns surrounded by weak splice sites (Xu et al. 2022). Notably, exon skipping with weak splice sites is influenced by the transcriptional rate (Dujardin et al. 2014). Therefore, it is possible that the observed splicing defects may be the consequence of transcriptional rate changes.

To explain the high gene expression levels of NS target genes, recent cytological investigations have revealed that NS is involved in the gene expression bursting of NS-associated genes (Alexander et al. 2021; Kim et al. 2019). By maintaining the close proximity between the genomic locus and NS, ample amounts of transcriptional and splicing regulatory molecules are furnished to these genomic loci, resulting in gene expression amplification. However, it remains to be determined whether this gene expression bursting is at the transcriptional versus post-transcriptional level.

NS and human genetic disorders

Reflecting the importance of proper nuclear speckle function in regulating normal human development, an increasing number of genetic disorders have been found to result from mutations in the genes encoding NS proteins. Several notable examples include ZTTK syndrome due to SON mutations, TARP syndrome due to RBM10 mutations, and TAR syndrome due to RBM8A mutations.

We investigated the involvement of genes encoding NS proteins in the pathogenesis of human genetic diagnoses using public databases (OMIM: Online Mendelian Inheritance in Man, OMIM®. McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University (Baltimore, MD) https://omim.org/ and HPO browsers: (Kohler et al. 2021)). A systematic search of NS genes within the “genemap2.txt” file obtained from the OMIM database was conducted on 11/3/2022. A total of 291 out of 380 genes, whose encoded proteins localize within NS based on Protein Atlas database, were found to be associated with human disease phenotypes. Strikingly, among the top 100 annotated NS proteins reported by Dopie et al. (2020), 95 out of 100 genes were found to be associated with human disease phenotype. Therefore, most NS proteins have been implicated in the pathogenesis of human diseases.

Next, we investigated the clinical features commonly observed in disorders due to mutations in genes encoding NS proteins based on the Protein Atlas data analysis. A table listing the associations between genes and Human Phenotype Ontology (HPO) terms was obtained from the HPO browser (genes_to_phenotype.txt; available at: http://purl.obolibrary.org/obo/hp/hpoa/genes_to_phenotype.txt; data queried on 11/3/22). The HPO terms associated with NS genes were ranked by frequency, and the results are presented in Table 1. The most common HPO term associated with NS genes was “global developmental delay.” Similarly, HPO terms associated with brain developmental defects, such as “intellectual disability” and “seizures” were ranked highly in the table. Similar result was obtained by using NS proteins identified by Dopie et al. (Supplemental table 5). These findings are of particular interest as most of the NS proteins are ubiquitously expressed. Therefore, the enrichment of neurological symptoms indicates the importance of NS function in human brain development. Alternative splicing has been proposed as a method to diversify protein repertoires, particularly in the brain (Zhang et al. 2016); therefore, it is possible that brain tissues may be more susceptible to NS dysfunction, which could trigger splicing alterations, compared to other organ systems.

Table 1 Top 20 frequent HPO terms associated with NS genes identified by Protein Atlas data analysis

Nuclear speckleopathies due to mutations in genes encoding NS proteins

To denote genetic disorders caused by mutations in genes encoding NS proteins, we propose a term “nuclear speckleopathies” (Table 2). The current knowledge of various nuclear speckleopathies and the molecular mechanisms underlying these disorders are discussed herein.

Table 2 List of nuclear speckleopathies

ZTTK syndrome due to SON mutations

ZTTK (Zhu-Tokita-Takenouchi-Kim) syndrome (MIM# 617,140) is characterized by developmental delay/intellectual disability, hypotonia, skeletal abnormalities, and facial dysmorphisms (Kim et al. 2016b; Takenouchi et al. 2016; Tokita et al. 2016; Zhu et al. 2015). Recent large cohort studies have revealed that developmental delay/intellectual disability are almost universally observed in individuals with ZTTK syndrome, and the range of intellectual disability ranges from mild to severe (Dingemans et al. 2022; Kushary et al. 2021). Brain structural abnormalities, such as hypoplasia/agenesis of the corpus callosum and ventricular enlargement, are common in individuals with ZTTK syndrome (83–90%). Seizures (50–60%) and hypotonia (60%) are also common in individuals with ZTTK syndrome. This syndrome is also associated with structural birth defects, including congenital heart defects (18–33%, particularly atrial septal defect or ventricular septal defect) and genitourinary malformations (31–47%) (Dingemans et al. 2022; Kushary et al. 2021). Visual and hearing impairments and immunological abnormalities have also been observed in individuals with ZTTK syndrome (Dingemans et al. 2022). Facial dysmorphic features were commonly observed; however, no unifying, recognizable features were identified (Dingemans et al. 2022).

Germline heterozygous loss of function (LoF) mutations in SON cause ZTTK syndrome. Most of the identified mutations are predicted LoF variants, such as frameshift, nonsense, or splice-site variants (Dingemans et al. 2022; Kushary et al. 2021). A small number of missense or in-frame deletion variants have been reported as candidate variants; however, the clinical significance of these variants remains unclear (Dingemans et al. 2022; Kushary et al. 2021).

Molecular mechanism

SON was originally identified as a NS protein by comprehensive proteomic analysis (Saitoh et al. 2004). SON functions as a scaffolding protein for RNA-processing factors in NS (Sharma et al. 2010). SON plays various transcriptional regulatory roles, including splicing and epigenetic regulation (Kim et al. 2016a; Lu et al. 2013). A recent study has demonstrated that SON is a core structural component of NS (Ilik et al. 2020).

Genome-wide transcriptome analysis has not been performed to date using samples from patients with ZTTK syndrome; however, misexpression and alternative splicing events of a selected set of 11 genes have been reported in the context of ZTTK syndrome using blood samples obtained from patients (Kim et al. 2016b). A recent report by Ueda et al. investigating the neuronal effects of Son in a mouse model has demonstrated that Son knockdown results in neural migration defects and dendritic spine abnormalities, which are reversed via induction of the human wild-type SON gene, suggesting the importance of SON in regulating brain development (Ueda et al. 2020).

Neurodevelopmental disorder due to SRRM2 mutation

A novel neurodevelopmental disorder due to LoF variants of SRRM2 has been recently reported in a cohort of twenty-two patients by Cuinat et al. (Cuinat et al. 2022). Clinical features include mild developmental delay, variable intellectual disability, predominant speech delay, autism or attention-deficit/hyperactivity disorder, overfriendliness, generalized hypotonia, increased weight, and dysmorphic facial features. Variant analysis of the patient cohort identified twelve frameshift variants, eight nonsense variants, and two microdeletions of 66 and 270 kb (Cuinat et al. 2022).

Molecular mechanism

SRRM2 is a highly conserved core scaffolding protein component of NS (Miyagawa et al. 2012). SRRM2 forms a complex that facilitates pre-mRNA maturation as one of the main catalytic components of the spliceosome and promotes the interaction between pre-mRNA and splicing factors (Blencowe et al. 2000; Sawada et al. 2000). In a study by Hu et al., wherein Hi-C experiments using Srrm2 knockdown models in mouse hepatocytes were performed to characterize the effects of SRMM2 on 3D genome organization and gene expression, it was demonstrated that the loss of SRRM2 resulted in a global decrease in intra-topologically associating domain (TAD) chromosomal interactions in active (type A) compartments and an increase in repressive (type B) compartments (Hu et al. 2019). They also observed that Srrm2 knockdown resulted in changes in the expression of over 1000 genes. Even though these results illustrate the mechanisms by which SRRM2 maintains specific patterns of genomic interactions in NS, these effects were not associated with corresponding changes in gene expression at the TAD level. Together, these results suggest that while loss of SRRM2 causes a reduction in TAD interactions, particularly in A1 sub-compartments, TAD-mediated transcription is preserved, and the observed differences in gene expression may be a result of alternative or downstream components of genome organization.

NKAP-related syndrome due to NKAP mutations

NKAP-related syndrome (MIM# 301039) is characterized by developmental delay, behavioral abnormalities, cardiac abnormalities, tall stature, scoliosis, camptodactyly, and obesity (Fiordaliso et al. 2019). NKAP-related syndrome is caused by hemizygous missense mutations in NKAP. In the human genome, NKAP resides on the X-chromosome. NKAP mutations primarily cause symptoms in males. Females with heterozygous NKAP mutations are unaffected or significantly less affected than their male counterparts, although the full clinical spectrum of females with heterozygous NKAP mutations remains unknown. Notably, the mutations identified in individuals with NKAP-related syndrome showed clear clustering within the C-terminus of the NKAP protein.

Molecular mechanism

NKAP was originally identified as a nuclear protein that promotes NF-κB activation (Chen et al. 2003). Pajerowski et al. demonstrated that NKAP interacts with HDAC3 and functions as a transcriptional regulator of the Notch signaling pathway (Pajerowski et al. 2009), although the role of NKAP-HDAC3 in global transcriptional regulation remains unknown. Burgute et al. demonstrated that NKAP localizes within NS and binds to RNA to regulate mRNA splicing (Burgute et al. 2014).

Previously, a conditional Nkap knockout model was reported. Similar to human NKAP, mouse Nkap also resides on the X-chromosome. Hematopoietic lineage-specific Nkap hemizygous null male mice exhibit embryonic lethality (Pajerowski et al. 2010). However, the role of NKAP in the development and function of other organ systems was not studied. A recent study has documented strong Nkap expression in the mouse brain, indicating a potential role for NKAP in neuronal development and function (Worlitzer and Schwamborn 2014).

Considering that the mutations identified in NKAP-related syndrome are clustered in the C-terminal region of NKAP, it is possible that a particular aspect of NKAP function is disrupted in NKAP-related syndrome. Notably, the C-terminus of NKAP harbors an HDAC3-interacting domain and a post-catalytic spliceosomal P-complex interacting domain (Fica et al. 2019; Pajerowski et al. 2009). Therefore, in a previous study, we hypothesized that NKAP mutations alter transcription. To test this hypothesis, transcriptome analysis was performed using RNA sequencing in lymphoblastoid cell lines obtained from individuals with NKAP mutations and controls. Consistent with the role of NKAP as a post-catalytic P-complex subunit, NKAP mutations had a minimal impact on splicing but did demonstrate a large number of differentially expressed genes in NKAP-related syndrome (Fiordaliso et al. 2019). Thus, we concluded that NKAP mutations cause only quantitative transcriptome changes without qualitative alternative splicing in NKAP-related syndrome. However, the mechanism underlying this transcriptome profile remains unknown.

Congenital heart defects, dysmorphic facial features, and intellectual developmental disorder (CHDFIDD) due to CDK13 mutations

CHDFIDD (MIM #617360) was first reported by Sirfrim et al. in a cohort of seven children harboring heterozygous missense mutations within the CDK13 kinase domain (Sifrim et al. 2016). The patients exhibited the following clinical features to a variable extent: atrial and/or ventricular septal heart defects, pulmonary valve abnormalities, dysmorphic features, global developmental delay, intellectual disability, seizures, microcephaly, structural brain malformations, feeding difficulties, and digital anomalies (Sifrim et al. 2016). In a subsequent report, Hamilton et al. described nine additional pediatric patients presenting with developmental delays, variable intellectual disability with autistic features, common facial gestalt, digital anomalies, and poor feeding. Notably, only two patients in this cohort had structural heart defects, and one had seizures (Hamilton et al. 2018).

Molecular mechanism

CDK13, a cyclin–dependent kinase (CDK), belongs to a class of ATP-dependent serine-threonine protein kinases that mediate extracellular and intracellular signals to regulate cell cycle progression and gene expression. Researchers have sought to define the role of CDK13 in NS. A recent study has supported the role of CDK13 in transcriptional regulation by activating RNA Pol II (Fan et al. 2020). The N-terminus of CDK13 contains arginine and serine-rich (RS) motifs, which generally regulate splicing (Marques et al. 2000). The localization of CDK12 and CDK13 to NS is dependent upon their respective RS motifs (Even et al. 2006; Ko et al. 2001). Even et al. previously demonstrated that CDK13 is highly concentrated in NS and regulates splicing by controlling the phosphorylation status and activity of splicing factors (Even et al. 2006). Liang et al. systematically identified protein-binding partners for CDK12 and CDK13 and found that RNA-processing factors, spliceosomes, and NS components were significantly enriched in purifications for both CDK12 and CDK13 (Liang et al. 2015).

The role of CKD13 in NS has previously been reported in cancer and HIV; however, its role in the pathophysiology of CHDFIDD remains to be established (Berro et al. 2008; Greenleaf 2019). To date, the reported variants associated with CHDFIDD are heterozygous missense mutations isolated from the CDK13 protein kinase domain (Hamilton et al. 2018; Sifrim et al. 2016). Molecular modeling analysis of the variants predicted structural changes resulting in a significant loss of catalytic activity of the enzyme, including deleterious effects on ATP binding, magnesium binding, and cyclin K interactions (Hamilton et al. 2018; Sifrim et al. 2016). However, functional studies have not yet established the predicted effects of these observed variants. Numerous LoF variants in CDK13 have been reported in relatively healthy individuals within the gnomAD and Decipher databases, suggesting that LoF mutations alone may not account for the severe clinical phenotypes observed in patients with CHDFIDD and that dominant negative effects may contribute, wherein the resultant inactive enzymatic complexes could compete with active complexes for binding substrates (Hamilton et al. 2018). Considering that the reported CHDFIDD variants are predicted to retain the ability to bind cyclin K, it is plausible that cyclin K could be sequestered into inactive complexes (Hamilton et al. 2018). Pathogenic variants in CDK13 associated with CHDFIDD have not been observed in the RS domains; thus, it is possible that the localization of CDK13 to NS is preserved in these patients, although the predicted reduction in catalytic activity may result in the loss of phosphorylation of splicing factors within the NS.

TARP syndrome due to RBM10 mutations

TARP syndrome, characterized by talipes equinovarus, atrial septal defect, Robin sequence, and persistent left superior vena cava, is an X-linked disorder with frequently observed male lethality that was originally discovered by Gorlin et al. (Gorlin et al. 1970), and is caused by inactivating mutations in RBM10 (RNA-binding motif 10; MIM#300080) (Johnston et al. 2010; Kaeppler et al. 2018; Niceta et al. 2019). Notably, the four cardinal features are not consistently present, and additional nonclassical findings have been observed (Kaeppler et al. 2018). Other clinical features include feeding difficulties, developmental delay, facial dysmorphisms, other distal-limb and digital anomalies, various renal anomalies, deafness, oral masses, chorio-retinal atrophy, bronchopulmonary pathology, skeletal anomalies, cerebral and cerebellar anomalies, and multiple other cardiac lesions, including hypertrophic cardiomyopathy (Kaeppler et al. 2018; Niceta et al. 2019). Pathogenic mutations reported to date include frameshift and nonsense variants predicted to lead to destabilization of the protein structure as well as induction of nonsense-mediated mRNA decay (Johnston et al. 2014, 2010; Kaeppler et al. 2018; Niceta et al. 2019). Recent studies with expanded clinical follow-up have also identified male patients with nontruncating and truncating variants with milder phenotypes that have survived into adolescence (Hojland et al. 2018; Niceta et al. 2019).

Molecular mechanism

RBM10, also known as S1-1, is a ubiquitously expressed protein possessing two RNA recognition motifs, a nuclear localization signal, two zinc finger motifs and a G patch domain that is frequently observed in splicing regulatory RNA-binding proteins (Niceta et al. 2019). RBM10 has been proposed to coordinate diverse post-translational processes and interact with more than 200 spliceosome proteins (Inoue et al. 2014; Wang et al. 2013). Interestingly, RBM10 exists in a dynamic state between the nucleoplasm and membrane-less nuclear bodies called S1-1 NBs (Inoue 2021; Wang et al. 2021a). S1-1 NBs are often spatially overlap with NS and show a similar change in size in response to transcriptional states of cells, indicating a close functional relationship between NS and S1-1 NBs (Inoue et al. 2008). Wang et al. showed that RBM10 is sequestered in S1-1 NBs when it is not engaged in alternative splicing or when transcription decreases, and can then dissociate from S1-1 NBs into the nucleoplasm when transcription increases; therefore, it may participate in alternative splicing (Wang et al. 2021a). Given the severe, pleiotropic effects of RBM10 variants in patients with TARP syndrome, it is possible that the role of RBM10 within S1-1 NBs has broad effects on gene expression and developmental processes; however, the exact molecular mechanisms by which effects on RNA binding and alternative splicing contribute to disease remain to be elucidated.

TAR syndrome due to RBM8A mutations

Thrombocytopenia-absent radius (TAR) syndrome [MIM #274000] is a well-recognized clinical entity characterized by hypomegakaryocytic thrombocytopenia and bilateral radial aplasia with the presence of both thumbs. Additional associations include congenital heart disease, genitourinary tract abnormalities, structural brain differences, hip dislocation, knee subluxation, cow milk intolerance, and resultant eosinophilia (Klopocki et al. 2007). Klopocki et al. demonstrated that affected individuals commonly harbor de novo or inherited deletions on chromosome 1q21.1; however, the authors hypothesized that an additional genetic modifier must be present to cause the disease, given its complex mode of inheritance and sporadic nature (Klopocki et al. 2007). Albers et al. discovered compound inheritance mechanisms that cause TAR syndrome involving a rare null allele (deletion, frameshift mutation, or premature stop codon) and one of the two low-frequency noncoding single nucleotide polymorphisms (SNPs) in RBM8A (Albers et al. 2012).

Molecular mechanism

RBM8A (also known as Y14) is a component of the EJC, which is required for several mRNA regulatory processes, including splicing, mRNA export, and nonsense-mediated decay (Chuang et al. 2015). Albers et al. showed that two regulatory SNPs observed in patients with TAR syndrome resulted in diminished RBM8A transcription in vitro and reduced expression in patient platelets. The authors concluded that TAR syndrome was a result of EJC dysfunction due to Y14 insufficiency (Albers et al. 2012). Notably, EJC proteins accumulate in NSs alongside most other splicing-related factors, and such interactions with NS occur in both transient and long-term states (Schmidt et al. 2009). Future functional studies are needed to fully characterize the downstream effects of the complex genetic etiology underlying TAR syndrome on EJC functions that are coordinated with NS to regulate mRNA metabolism.

Nager syndrome due to SF3B4 mutations

Nager syndrome is a representative disorder of acrofacial dysostosis (AFDs) [MIM #155440]. Clinical features include midface retrusion, severe micrognathia, down-slanted palpebral fissures, hypoplastic or triphalangeal thumbs, radial hypoplasia or aplasia, radioulnar synostosis, upper-limb phocomelia, and lower-limb defects. Occasionally, patients exhibit short stature, cardiac malformations, costovertebral anomalies, and renal malformations. Notably, unlike most nuclear speckleopathies, patients with Nager syndrome typically have normal intelligence (Bernier et al. 2012; Petit et al. 2014). Most cases of Nager syndrome (~ 60%) are associated with autosomal dominant LoF mutations in SF3B4, including nonsense, frameshift, and splice-site variants (Bernier et al. 2012; Czeschik et al. 2013; Petit et al. 2014). Highly variable phenotypic severity with intra-family variability was noted as well (Petit et al. 2014).

Molecular mechanism

SF3B4 encodes the splicing promoter protein SAP49, an integral component of the SF3B complex that tethers the U2 snRNP branch site (Champion-Arnaud and Reed 1994). The full spectrum of phenotypic effects of SF3B4 haploinsufficiency may also be accounted for via diverse mechanisms in addition to its impact on spliceosome function. SAP49 interaction with the BMP signaling pathway underlying osteochondral cell differentiation has also been proposed as a molecular mechanism leading to the skeletal features of Nager syndrome (Petit et al. 2014). Considering that approximately one-third of patients with Nager syndrome do not possess detectable LoF SF3B4 variants, future work will be needed to clarify potential genetic heterogeneity inside and outside the spliceosome.

Craniofacial microsomia (CFM) due to SF3B2 mutations

CFM is an asymmetrical congenital malformation disorder (historically referred to as hemifacial microsomia, oculo-auricular-vertebral spectrum (OAVS), or Goldenhar syndrome) that can be inherited in an autosomal dominant manner [MIM #164210]. CFM presents with a spectrum of phenotypes and core clinical features, including hemifacial mandibular hypoplasia, microtia, facial and preauricular skin tags, epibulbar dermoids, and lateral oral clefts. At the mild end of the spectrum, patients present with microtia in the absence of other anomalies. Patients on the more severe end of the spectrum can exhibit more extensive multisystem disease, including anomalies of the cardiac, nervous, vertebral, and renal systems (Timberlake et al. 2021). Genetic and environmental etiologies have been proposed to explain this disorder (Barisic et al. 2014; Beleza-Meireles et al. 2014). LoF variants leading to haploinsufficiency of SF3B2 represents the most common genetic cause of CFM (Timberlake et al. 2021). Consistent observations of external ear malformations involving the tragus and mandibular hypoplasia suggest that SF3B2 variants predominantly influence pharyngeal arch I development. Functional studies have demonstrated that Sf3b2 knockdown disrupts cranial neural crest cell formation in Xenopus embryos (Timberlake et al. 2021).

Molecular mechanism

SF3B is a component of the U2 snRNP complex that interacts directly with SF3B4 to anchor U2 snRNP to pre-mRNAs. Interestingly, recent studies have implicated spliceosomal dysfunction in retaining highly conserved “poison exons” that harbor premature termination codons and are typically spliced out from pre-mRNAs in genes, which have substantial roles in developmental processes (Thomas et al. 2020). It is not yet understood how dysfunctional splicing that leads to the retention of such exons may contribute to the pathogenesis of CFM. Future studies focused on understanding the fundamental functions of SF3B4 and its potential interactions with other regulatory components of NS may identify additional “second-hit” mechanisms accounting for the phenotypic variability observed in CFM.

Nager syndrome due to SF3B4 mutations and CFM due to SF3B2 mutations are also referred to as “spliceosomopathies” (Beauchamp et al. 2020). Considering that these conditions are associated with a unique craniofacial phenotype without neurocognitive impairment, the underlying pathogenic mechanisms of these syndromes may differ from those of other NS component dysfunctions.

Summary

Here, we review the current knowledge of NS and the genetic diagnoses associated with NS dysfunction. Recent studies have revealed a critical role of NS in diverse aspects of RNA metabolism regulation, including transcriptional and splicing reactions, RNA modifications, and mRNA export. As expected from the pleiotropic functions of NS, an increasing number of genetic disorders are linked to NS dysfunction. However, as NS regulates multiple aspects of RNA metabolism, it has been challenging to identify the molecular step(s) whose perturbation is directly related to the clinical phenotype of nuclear speckleopathies. Since NS orchestrate complex gene expression regulation, mutations within genes encoding NS protein could affect a part of NS function. These mutations could also affect the encoded proteins’ function not only within NS, but also outside of NS. With the development of cutting-edge next-generation sequencing applications enabling transcriptome-wide RNA metabolism investigations, such as nascent RNA sequencing, and cross-linking immunoprecipitation sequencing (CLIP-seq), it has become technically feasible to carefully characterize the type of RNA metabolism defects associated with nuclear speckleopathies. In addition to proteins regulating RNA metabolism, NS contain chromatin remodelers such as SETD1A, SETD2, and HDAC4. Mutations in genes encoding chromatin proteins, collectively called chromatinopathies, cause developmental disabilities similar to nuclear specleopathies. As recent findings indicate the role of NS in chromatin organizational regulation (Hu et al. 2019), there may exist a common molecular perturbation between chromatinopathies and nuclear specleopathies. Elucidating the mechanisms underlying nuclear speckleopathies will contribute to the basic knowledge of NS and how its functional defects result in human developmental disorders.