Abstract
Emanuel syndrome is a rare autosomal disorder characterized by microcephaly, heart defects, cleft palate and developmental delay. However, there is a lack of specific prenatal screening for Emanuel syndrome. To screen for early diagnostic marker genes in fetuses with karyotype+der[22]t(11;22)(q23;q11) of Emanuel syndrome. Transcriptome sequencing and clinical trait data of t(11;22)(q23;q11) translocation samples were screened from the GEO database. The differentially expressed genes (DEGs) were screened by principal component analysis of gene expression by R package, and intersections were taken with balanced and unbalanced DEGs. Then, the correlation with clinical traits was determined by WGCNA analysis, GO and KEGG enrichment analysis, and then univariate Cox analysis and Lasso analysis were performed to obtain the key genes. The core regulatory genes were obtained after protein–protein interaction (PPI) network analysis. A total of 50 DEGs were obtained after differential analysis. WGCNA analysis showed that DEG was associated with the chromosomal imbalance and age module. GO and KEGG enrichment analyses showed candidate genes were associated with exocytic vesicle membrane, synaptic vesicle membranes, glycoprotein complex, dystrophin-associated glycoprotein complex and malaria. COX and Lasso analyses yielded 5 hub genes, including ZBED9, RGS20, SGCB, ETV5, and ZAP70. The results of PPI identified the key regulatory gene associated with chromosomal imbalance as the ZAP70 gene. ZAP70 may be a key gene for early diagnosis of Emanuel syndrome in fetuses with+der[22]t(11;22)(q23;q11) karyotype.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Emanuel syndrome (ES), also known as supernumerary der[22]t(11;22) syndrome with partial der[22] trisomy(Manju et al. 2022), is a rare inherited chromosomal disorder(Piwowarczyk et al. 2022), usually resulting from a 3:1 meiosis in carriers with the t(11;22)(q23;q11) translocation(Zackai and Emanuel 1980). Patients with this trisomy are characterized by severe developmental delay, pre and postnatal growth deficits, microcephaly, ear abnormalities, preauricular tag or invagination, and genital abnormalities (Li and Zhu 2022; Luo et al. 2017). To date, the majority of ES cases have been diagnosed in the postnatal period and only a few prenatal cases have been reported (Luo et al. 2020). Specific diagnostic methods of ES are still limited to prenatal invasive procedures such as chorionic villus sampling, amniocentesis, or cordocentesis. However, most pregnant women are reluctant to undergo such procedures to detect fetal abnormalities because of concerns about the risks associated with the invasive procedure. Although nonspecific ultrasound indicators such as intrauterine growth retardation, anomalies of the posterior cranial fossa, cardiac malformations, and other abnormalities can be applied for auxiliary diagnosis, amniotic fluid karyotyping is still required to diagnose a fetus with ES (Hao et al. 2022). In addition, only 16% of ES patients reported ultrasound abnormalities (Carter et al. 2009). Some phenotypes are difficult to diagnose by prenatal ultrasound, such as facial malformations, ear and genital anomalies, heart defects, and hypotonia. Moreover, these ultrasound findings do not distinguish fetuses with ES from other fetuses with genetic syndromes. The clinical data from five hospitals in Guangdong Province showed that between January 2015 and July 2021, only six ES fetuses with structural malformations were diagnosed by ultrasound (Luo et al. 2020). As far as methods and techniques for early diagnosis of ES are not yet available, there is an urgent need to improve the efficiency of ES diagnostic methods and reduce the probability of ES neonates. This study aims to investigate the hub gene to help targeted diagnosis for ES.
Materials and Methods
Data Sources
The dataset GSE13122 was obtained from GEO (http://www.ncbi.nlm.nih.gov/geo) for analysis of Emanuel syndrome data analysis, including transcriptome data and sample information of chromosome translocation of 9 t(11;22)(q23;q11) translocation (balanced translocation carriers), 4 ES individuals and 13 normal controls.
Differential Gene Screening
The study used the facto extra package of R software to analyze the expression of the genes by principal component analysis (PCA). Subsequently, the limma package was used to analyze the differences between the translocation samples and the control samples, and the balanced translocation samples and the unbalanced translocation samples, respectively. Both comparisons were screened for differentially expressed genes (DEGs) with P < 0.05 and |log2(Foldchange)|> 0.58, and the intersection genes of DEGs in the two groups were used as the candidate differential gene.
WGCNA Analysis of DEGs
The candidate genes obtained above were analyzed by gene clustering using the R package WGCNA. The power with the squared value of the correlation coefficient that reached 0.8 or more for the first time was selected. Secondly, based on the clustering and dynamic tree cutting method, the number of genes in the module was set to 10, and the gene modules were identified by the dynamic tree cutting method, while modules with the degree of similarity of more than 0.5 were clustered and merged. Finally, by calculating the correlation between modules and clinical phenotypes, the gene set modules closely related to clinical status were identified. Here, we selected gene modules that were significantly related to unbalanced translocation for subsequent analysis. Meanwhile, we analyzed the gender-related gene modules and further performed KEGG and GO enrichment to explore the underlying mechanism.
Model Construction and Model Reliability Verification
Based on the set of candidate differential genes, the preliminary candidate genes were firstly obtained by single-factor analysis according to whether the translocation were balanced or unbalanced. Then, the preliminary candidate genes were analyzed by Lassoof Lars package to screen out the covariant genes. After that, the bio-marker genes were obtained. Subsequently, the training set was randomly selected at a ratio of 0.7, the glm and RandomForest models were constructed, and the AUC was calculated and the ROC curve was plotted in the remaining samples, so as to verify the reliability of the marker genes.
PPI Analysis and Functional Enrichment of Key Marker Gene
In order to further clarify the functions involved in marker genes, the STRING database was used to select proteins with experimental validation for the interactive analysis, and 5 and 10 were used as the number of interaction nodes to mine the core regulatory genes. Moreover, the PPI network of key factor-acting proteins was constructed through the STRING database (https://string-db.org/) and performed functional enrichment analysis. Subsequently, we selected the genes with experimentally verified relationships among the interactions and performed GO and KEGG enrichment analyses.
Results
Differentially Expressed Gene Analysis
The results of PCA showed difference in overall expression between the translocation group and the normal group in the GSE13122 dataset. The expression difference analysis was filtered by P < 0.05 and |log2FC|> 0.58 (Fig. 1A). As a result, a total of 205 DEGs were obtained, of which 108 were significantly highly expressed and 97 were significantly low expressed in the translocation samples (Fig. 1B, C). PCA analysis of balanced translocation chromosomes and unbalanced translocation chromosomes in the samples. The results showed dispersion of samples between groups and clustering of samples within groups. (Fig. 1D). Balance-unbalance differential gene screening was performed with P < 0.05 and |log2FC|> 0.58, resulting in a total of 377 DEGs, of which 218 were significantly highly expressed and 159 genes were significantly low expressed (unbalance) (Fig. 1E, F). To obtain t(11; 22)(q23; q11) biomarker genes, DEGs in the translocation-control group were intersected with the balance-unbalance translocation group, and a total of 50 intersected genes were obtained, which were used as the subsequent screening of t(11; 22)(q23; q11) biomarker marker candidate genes (Fig. 1G).
Key Modules Screening by WGCNA
Fifty candidate biomarker genes were further screened based on WGCNA analysis, and the relationship between the 50 genes and clinical indicators was analyzed simultaneously. The clinical information of the integrated translocation samples included clinical indicators such as gender, age, and disequilibrium, and the expression of the 50 candidate genes in the translocation samples was used for the modular correlation analysis (Fig. 2A). A soft threshold of 12 was determined to enable the adjacent function to better satisfy the scale-free condition and R2 > 0.8 (Fig. 2B). Significant correlations with unbalance were obtained based on the clustering and dynamic tree cutting method for the MEturquoise module (Cor = 0.87, P = 1e-04), MEblue (Cor = − 0.85, P = 2e-04), and MEbrown (Cor = − 0.69, P = 0.009) (Fig. 2C–D). The genes in MEturquoise module (Cor = − 0.79, P = 0.001) and MEblue (Cor = 0.68, P = 0.01) were correlated with age, suggesting that translocations affecting the expression of certain genes that is correlated with age (Fig. 2D). The results of GO revealed that genes in key modules were enriched on exocytic vesicle membrane, synaptic vesicle membrane, glycoprotein complex, dystrophin-associated glycoprotein complex (GO, Cellular Component, P < 0.05) and Malaria (KEGG, P < 0.05), suggesting that the key genes related to ES were mainly associated with exocytic vesicle membrane, synaptic vesicle membrane, glycoprotein complex, and dystrophin-associated glycoprotein complex, as well as with malaria (Fig. 2E).
Prediction Model Construction
The 50 candidate genes were analyzed by univariate Cox regression analysis to determine the balanced translocation samples and unbalanced translocation samples, and the results are shown in Table 1 (only 20 genes are shown). 11 candidate genes were obtained with P < 0.04 (Lasso regression failed under P < 0.05 criterion, probably because of more covariate genes, so the more significant genes were chosen to screen out some covariate genes), and 11 candidate genes were further screened by using the Lasso regression (Fig. 3A). Five marker genes, including ZBED9, RGS20, SGCB, ETV5, and ZAP70, were finally obtained (Fig. 3B). In order to verify the reliability of the five marker genes in the samples, the GML model and random Forest model were constructed with five marker genes, respectively. In addition, the samples with a proportion of 0.7 were used as the training set, and the samples with a proportion of 0.3 were used for validation, and the AUC were obtained to be 0.810, and 0.857, respectively (Figs. 3C, D), suggesting that the reliability of marker genes was significant.
PPI Analysis
The results of PPI network and functional enrichment for 5 marker genes showed that ZAP70 was the key regulatory gene, and those directly interacting with ZAP70 were VAV1, CBL, LCP2, CD247, and CD3E (Fig. 4A). Figure 4B showed that these genes were enriched to pathways such as signal transduction, and immune responses activate cell surface receptor signaling pathway. Cellular components were significantly enriched for α-β T-cell receptor complexes, membrane microdomains, membrane rafts, etc., and molecular functions were significantly enriched for phosphotyrosine residue binding, protein phosphorylation amino acid binding, and phosphoprotein binding, etc. KEGG analysis showed that PD-L1 expression. The PD-1 checkpoint pathway and the Th1 and Th2 cell differentiation signaling pathways were significantly enriched in ES.
Discussions
The t(11;22)(q23;q11) chromosomal balanced translocation is the only recurrent non-Robertsonian translocation in humans (Fu et al. 2018). Carriers generally have a normal phenotype, but male translocation carriers may have infertility, azoospermia, or decreased sperm count and motility (Kara et al. 2014). Female carriers present with a history of recurrent spontaneous abortions and births of malformed children. The main reason for the occurrence of these symptoms is all due to abnormal chromosome segregation, resulting in abnormal chromosome copy number of the embryo (Chen et al. 2022). In translocation carriers, abnormal 3:1 segregation of quadriradial chromosome during meiosis results in the birth of offspring with ES.
ES is a rare chromosomal disorder with a variety of distinctive congenital anomalous phenotypes, such as severe intellectual disability, heart defects, and high arched palate (Ohye et al. 2014). When ES fetuses have no obvious developmental defects, the fetal abnormalities cannot be accurately detected by ultrasound examination and other techniques (Hao et al. 2022). ES patients die in the neonatal period and rarely survive into childhood. The detection of chromosome deletions and duplications remains challenging and places a financial burden on the families of the patients (Hardisty and Vora 2014). To date, because there are no practical, specific, and sensitive diagnostic methods for prenatal screening for ES (Carter et al. 2009; Chen et al. 1996), it is very difficult to know whether a fetus has ES before birth. Although nonspecific ultrasound indicators such as intrauterine growth retardation, anomalies of the posterior cranial fossa, cardiac malformations, and gastrointestinal abnormalities can suggest the diagnosis, some of these children have no obvious anomalies, and therefore cannot be accurately screened for ES (Walfisch et al. 2012).
Although noninvasive prenatal genetic testing (NIPS) detects common autosomal aneuploidies and genetic disorders early in pregnancy by maternal plasma testing (Minear et al. 2015), and 2 ES fetuses were screened by this way in 2020. However, the efficacy of NIPS is uncertain due to low coverage, fetal fractions, and microduplications, and NIPS also cannot be used as a diagnostic method for the specific detection of ES fetuses because of the small chromosomal region of abnormality as well as the double segmental repeats with 11q23 and 22q11 in ES (Luo et al. 2020). Defects of the posterior fossa (62% of fetuses; 13/21) and left diaphragmatic hernia (29% of fetuses; 6/21) are the most frequently reported prenatal findings in ES syndrome. No pattern of specific prenatal ultrasound markers of ES exists (Piwowarczyk et al. 2022). Abnormalities of the posterior fossa are frequent and may be diagnosed as early as in the first trimester of pregnancy. Specific diagnosis can be made only after invasive genetic testing (Walfisch et al. 2012). So it is critically important to screen for the presence of diagnostic markers of ES in early life, which would avoid the negative impacts of having high-frequency miscarriages or delivering malformed fetus who are carriers of the balance translocation.
In this study, 50 DEGs were obtained by transcriptome analysis in the public GEO database and all were associated with imbalance translocation. Five key genes were obtained by univariate Cox regression and Lasso regression analyses, and the PPI network identified ZAP70 as a key regulatory gene for supernumerary der[22]t(11;22). ZAP70 is a protein kinase that regulates spindle assembly and chromosome arrangement in oocytes(Kim et al. 2017). In the present study, we found that ZAP70 was significantly Highly expressed in ES fetuses, so the use of NIPS to detect maternal peripheral blood while specifically detecting the expression of ZAP70. Therefore, the use of NIPS to detect the expression of ZAP70 in maternal peripheral blood specifically at the same time provides a theoretical basis for the noninvasive prenatal detection of ES fetuses. It can also reverse the speculation that the parents who conceived ES fetuses may be t(11;22)(q23;q11) carriers, so as to scientifically guide the carriers to give birth to healthy fetuses and reduce the physical and mental trauma of the carriers. However, this study is based on a small sample size and database analysis, which has not been validated by experiments. In the future, we will further validate the results of this study by using peripheral blood from mothers with confirmed ES fetuses in the clinic, which will provide new perspectives for non-invasive prenatal trisomy detection.
Data Availability
Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
References
Carter MT, St Pierre SA, Zackai EH, Emanuel BS, Boycott KM (2009) Phenotypic delineation of Emanuel syndrome (supernumerary derivative 22 syndrome): clinical features of 63 individuals. Am J Med Genet Part A 149a(8):1712–1721. https://doi.org/10.1002/ajmg.a.32957
Chen CP, Liu FF, Jan SW, Yang YC, Lan CC (1996) Prenatal diagnosis of supernumerary der(22)t(11;22) associated with the Dandy-Walker malformation in a fetus. Prenat Diagn 16(12):1137–1140. https://doi.org/10.1002/(sici)1097-0223(199612)16:12%3c1137::Aid-pd979%3e3.0.Co;2-k
Chen X, Yu Z, Chen S, Zhou Y (2022) The most common recurrent reciprocal translocation: T(11; 22)(q23; q11). Andrologia 54(11):e14618. https://doi.org/10.1111/and.14618
Fu W, Zhao W, Jiang H, Li Z, Zhou B, Gu H, Zhu Z, Liu S (2018) Clinical and genetic analysis of 11 patients with balanced translocation of t(11;22)(q23;q11) chromosome. Chongqing Med. https://doi.org/10.3390/cancers15010035
Hao XL, Wu JZ, Fu WT, Zhang R, Zhong SL, Deng YQ, Zhu YX, Ye YC, Fang Q (2022) Prenatal diagnosis of fetuses with Emanuel syndrome: results of ultrasound examination and invasive genetic testing. Prenat Diagn 42(4):469–477. https://doi.org/10.1002/pd.6098
Hardisty EE, Vora NL (2014) Advances in genetic prenatal diagnosis and screening. Curr Opin Pediatr 26(6):634–638. https://doi.org/10.1097/mop.0000000000000145
Kara M, Sen A, Cetin ES, Kargun K (2014) Chromosomal translocation t (10;19) (q11.2;q13.4) in an infertile male. Eurasian J Med 46(3):220–223. https://doi.org/10.5152/eajm.2014.33
Kim HJ, Lee SY, Lee HS, Kim EY, Ko JJ, Lee KA (2017) Zap70 and downstream RanBP2 are required for the exact timing of the meiotic cell cycle in oocytes. Cell Cycle 16(16):1534–1546. https://doi.org/10.1080/15384101.2017.1339847
Li R, Zhu J (2022) Effects of aneuploidy on cell behaviour and function. Nat Rev Mol Cell Biol 23(4):250–265. https://doi.org/10.1038/s41580-021-00436-9
Luo JW, Yang H, Tan ZP, Tu M, Luo H, Yang YF, Xie L (2017) A clinical and molecular analysis of a patient with Emanuel syndrome. Mol Med Rep 15(3):1348–1352. https://doi.org/10.3892/mmr.2017.6107
Luo Y, Lin J, Sun Y, Qian Y, Wang L, Chen M, Dong M, Jin F (2020) Non-invasive prenatal screening for Emanuel syndrome. Mol Cytogenet 13:9. https://doi.org/10.1186/s13039-020-0476-7
Manju HC, Bevinakoppamath S, Bhat D, Prashant A, Kadandale JS, Sairam P (2022) Supernumerary derivative 22 chromosome resulting from novel constitutional non-robertsonian translocation: t(20;22)-case report. Mol Cytogenet 15(1):14. https://doi.org/10.1186/s13039-022-00591-4
Minear MA, Lewis C, Pradhan S, Chandrasekharan S (2015) Global perspectives on clinical adoption of NIPT. Prenat Diagn 35(10):959–967. https://doi.org/10.1002/pd.4637
Ohye T, Inagaki H, Kato T, Tsutsumi M, Kurahashi H (2014) Prevalence of Emanuel syndrome: theoretical frequency and surveillance result. Pediatr Int 56(4):462–466. https://doi.org/10.1111/ped.12437
Piwowarczyk P, Massalska D, Obodzińska I, Gawlik Zawiślak S, Bijok J, Kucińska-Chahwan A, Roszkowski T (2022) Prenatal diagnosis of Emanuel syndrome - case series and review of the literature. J Obstet Gynaecol 42(7):2615–2620. https://doi.org/10.1080/01443615.2022.2114331
Walfisch A, Mills KE, Chodirker BN, Berger H (2012) Prenatal screening characteristics in Emanuel syndrome: a case series and review of the literature. Arch Gynecol Obstet 286(2):299–302. https://doi.org/10.1007/s00404-012-2288-4
Zackai EH, Emanuel BS (1980) Site-specific reciprocal translocation, t(11;22) (q23;q11), in several unrelated families with 3:1 meiotic disjunction. Am J Med Genet 7(4):507–521. https://doi.org/10.1002/ajmg.1320070412
Acknowledgements
Not Applicable.
Funding
This work was supported by Yunnan Provincial Science and Technology Department-Kunming Medical University Joint General Program on Applied Basic Research 2019FE001 (-259).
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. The first draft of the manuscript were performed and the manuscript was revised by Jing Hu. Data collection and analysis were performed by Mengyue Wang. The language of the paper was revised by Ruiyao Xiang. All authors approved the final manuscript.
Corresponding author
Ethics declarations
Competing Interests
The authors declared that they have no competing interests.
Consent for Publication
All the authors have declared their consent for publication.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hu, J., Wang, M. & Xiang, R. ZAP70: A Key Gene Identified by Differential Expression Analysis for Early Diagnosis of Fetuses with Emanuel Syndrome. Biochem Genet (2024). https://doi.org/10.1007/s10528-024-10808-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10528-024-10808-3