Three nervous system-specific expressed genes are potential biomarkers for the diagnosis of sporadic amyotrophic lateral sclerosis through a bioinformatic analysis

Liao, Yifu; Cai, Haiping; Luo, Feifei; Li, Dongcheng; Li, Hao; Liao, Geng; Duan, Jinhai; Xu, Renshi; Zhang, Xiong

doi:10.1186/s12920-023-01441-x

Three nervous system-specific expressed genes are potential biomarkers for the diagnosis of sporadic amyotrophic lateral sclerosis through a bioinformatic analysis

Research
Open access
Published: 27 January 2023

Volume 16, article number 15, (2023)
Cite this article

Download PDF

You have full access to this open access article

BMC Medical Genomics Aims and scope Submit manuscript

Three nervous system-specific expressed genes are potential biomarkers for the diagnosis of sporadic amyotrophic lateral sclerosis through a bioinformatic analysis

Download PDF

Yifu Liao¹^na1,
Haiping Cai²^na1,
Feifei Luo³^na1,
Dongcheng Li⁴,
Hao Li⁷,
Geng Liao⁷,
Jinhai Duan⁵,
Renshi Xu⁶ &
…
Xiong Zhang⁷

2332 Accesses
4 Citations
10 Altmetric
1 Mention
Explore all metrics

Abstract

Background

Amyotrophic lateral sclerosis (ALS) is the most common neurodegenerative disease in adults. However, ALS, especially sporadic ALS (sALS), is difficult to diagnose due to the lack of biomarkers.

Results

We used the bioinformatics technology to find the potential biomarker and we found that two hundred seventy-four DEGs were identified and enrichment analysis showed DEGs were involved in nervous system activity, like axon_guidance and the neurotrophin_signaling_pathway. Five nervous system-specific expressed hub genes were further validated by three GEO datasets. APP, LRRK2, and PSEN1 might be potential diagnostic and prognostic biomarkers of sALS, and NEAT1-miR-373-3p/miR-302c-3p/miR-372-3p-APP, circ_0000002-miR-302d-3p/miR-373-3p-APP and XIST-miR-9-5p/miR-30e-5p/miR-671-5p might be potential ceRNA regulatory pathways. APP SNP analysis showed subjects harboring the minor G allele of rs463946, minor G allele of rs466433 and minor C allele of rs364048 had an increased risk of sALS development.

Conclusions

Our results identified three nervous system-specific expressed hub genes that might be diagnostic and prognostic markers of sALS and APP might be a genetic susceptibility factor contributing to sALS development.

View this article's peer review reports

Identification of potentially functional modules and diagnostic genes related to amyotrophic lateral sclerosis based on the WGCNA and LASSO algorithms

Article Open access 22 November 2022

Identifying Candidate Genes Associated with Sporadic Amyotrophic Lateral Sclerosis via Integrative Analysis of Transcriptome-Wide Association Study and Messenger RNA Expression Profile

Article 17 January 2022

Polymorphism of rs3737597 in DISC1 Gene on Chromosome 1q42.2 in sALS Patients: a Chinese Han Population Case-Control Study

Article 07 April 2016

Background

Amyotrophic lateral sclerosis (ALS) is a progressive and aggravated neurodegenerative disease and the most common motoneuron disease in adults. ALS is characterized by the progressive degeneration of upper and lower motor neurons in the cerebral cortex, brain stem and anterior horn of the spinal cord, which leads to amyotrophy of the limbs and trunk, and eventually, patients die because of respiratory failure caused by predominant diaphragm dysfunction [1]. The clinical characteristics of ALS patients are the coexistence of symptoms and signs of upper and lower motor neuron damage, featuring as different combinations of muscle weakness, amyotrophy and pyramid signs [2]. ALS has a prevalence of 1–2 per 100,000 people worldwide, and the average age of onset is approximately 55 years [3]. The clinical heterogeneity among ALS patients leads to difficulty in diagnosis, and currently, definitive diagnostic tests are lacking [4]. In addition, effective treatments for ALS are not available since riluzole and edaravone, two FDA-approved drugs for ALS treatment, only delay the progression of ALS in some patients, and therapeutic interventions are still based on symptom management and respiratory support [5, 6]. The average survival time of ALS patients is 3–5 years [7]. Therefore, a detailed understanding of ALS development could help with more effective intervention in the early stages of the disease. Based on the family history, patients are divided into familial ALS (fALS) and sporadic ALS (sALS) [8]. Genetic disorders are considered an important cause of ALS, and several known genes and/or loci have been reported to affect the development of fALS, such as SOD1, FUS and C9orf72 [9,10,11]. Because several genes/loci involved in sALS development have been identified, sALS is considered to have a genetic basis and more complex pathogenesis [12]. The biomarkers investigation of sALS has been ongoing for many years, however, effective biomarkers and exact genetic mechanism of sALS still need further studied [13].

Transcriptomics and microarray analyses are important techniques in disease research and have been widely used to identify novel biomarkers and improve the diagnosis and treatment of various diseases, such as tumors and neurodegenerative diseases [14, 15]. Competing endogenous RNAs (ceRNAs) can competitively bind miRNAs through microRNA response elements (MREs) to regulate the expression level of each other, forming a large-scale gene expression regulatory network in the transcriptome [16, 17]. The ceRNA regulatory network plays an important role in the occurrence and development of neurodegenerative diseases [18]. Therefore, it is possible to explore potentially key genes and pathway networks closely related to disease development through a combination of microarray and bioinformatical technologies.

In the present study, we performed a bioinformatics analysis using publicly available gene expression datasets to search for sALS susceptibility genes. In addition, we conducted a case–control study including 30 sALS patients and 30 nonneurological controls in the Chinese Han population. We detected single nucleotide polymorphisms (SNPs) of the susceptibility gene risk fragments to verify the relationship between these SNPs of susceptibility genes and sALS. We aimed to identify some diagnostic biomarkers in the early stage of sALS development.

Methods

Gene expression data acquisition

The microarray data, which were samples (nervous tissues, muscular tissues or whole blood) of sALS patients, were obtained from the Gene Expression Omnibus (GEO) database. Six GEO datasets and eight GPL platforms were included in our study, including GSE833, GSE26276, GSE4595, GSE26927, GSE39644, GSE112681, GPL80 (Affymetrix), GPL6244 (Affymetrix), GPL1708 (Agilent-012391), GPL6255 (Illumina humanRef-8), GPL10558 (Illumina HumanHT-12), GPL15846/GPL15847 (NanoString Technologies) and GPL6947 (Illumina HumanHT-12). We divided these datasets into the test set and the validation set. The test set included GSE833 and GSE26276, including 11 samples (sALS, nonneurological control, and fALS) and 9 samples (sALS, multifocal motor neuropathy [MND], and nonneurological control), respectively. The mRNA expression data were acquired from tissue specimens of the spinal cord and skeletal muscle. The validation set included GSE4595, GSE26927, GSE39644 and GSE112681, which included 20 samples (sALS and nonneurological control), 118 samples (Alzheimer’s disease, sALS, fALS, Huntington’s disease, multiple sclerosis, Parkinson’s disease, nonneurological control and schizophrenia), 48 samples (sALS, fALS, nonneurological control, and multiple sclerosis), and 1117 samples (sALS, fALS and nonneurological control), respectively. The mRNA expression data were acquired from tissue specimens of the motor cortex, spinal cord and whole blood. An expression matrix of nonneurological controls and sALS patients was acquired online. In total, the data of 681 nonneurological controls and 436 ALS patients were analyzed in our study (Table 1). The accessed date of the database was March. 3, 2022.

Table 1 Information of selected GEO datasets

Full size table

Data processing and identification of differentially expressed genes (DEGs)

The raw data downloaded from the GEO database were normalized by the Robust Multiarray Average (RMA) method using the R software (Version 4.1.0) affy package and further transformed into fragments per kilobase of sequence per million mapped reads (FPKM) values for the analysis. The gene expression analysis and analysis of intersample differences were conducted using the limma package. The significance of False Discovery Rate (FDR) q < 0.05. The screening criteria were as follows: Log2 (fold change) > 1.5 or < − 1.5 and an adjusted p value ≤ 0.05.

DEG visualization analysis

Heatmaps and volcano plots were used for the DEG visualization analysis. Briefly, a heatmap was generated with the pheatmap package, and a volcano plot was generated with the ggpubr package by R software.

Tissue/organ-specific gene expression determination

We determined the tissue/organ-specific expressed genes of the DEGs using the online tool BioGPS (http://biogps.org/). Briefly, the tissue distribution of the DEGs was analyzed, and the screening criteria for tissues/organ-specific genes were as follows: (1) the gene expression location matched a single organ system, and its expression value was greater than 10 times the median value, and (2) the second most abundantly expressed tissues were no more than one-third of the most expressed tissues.

Performance of the enrichment analysis

A gene set enrichment analysis (GSEA) is a computational method that can be used to determine the distribution trend of genes of interest or concordant differences between two biological states. For the GSEA, we downloaded the GSEA software (version 3.0) and c2: GO gene sets (c2.cp.kegg.v7.4symbols.gmt) to evaluate the pathways and molecular mechanisms of the DEGs. For the GO (Gene Ontology) enrichment analysis, we used the GO annotation of genes in the R software org.Hs.eg.db (version 3.1.0) package and mapped the DEGs onto the background set. The gene enrichment results were analyzed by R software by setting the minimum gene set as 5 and the maximum gene set as 5000. A Q value < 0.05 and an FDR < 0.25 were considered statistically significant. The online database KOBAS 3.0 (http://kobas.cbi.pku.edu.cn/kobas3) was used for the KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis of the DEGs. The use of KEGG data were approved by the Kanehisa laboratory [19].

PPI network construction

All DEGs were analyzed using the online tool STRING (https://string-db.org/), and a PPI network was constructed under a filter condition of a combined score > 0.4. The interaction information of all DEGs was downloaded and further modified by Cytoscaple software (v3.9.1) for better visualization. The significant gene clusters and related cluster scores were identified by Minimal Common Oncology Data Elements (MCODE) with the following filter criteria: code score cutoff = 0.2; degree cutoff = 2; k core = 2; and maxdepth = 100. CytoHubba is commonly used for significant gene identification (hub genes), and the top 14 hub genes in the DEG network were determined by five algorithms, namely, maximal clique centrality (MCC); degree; density of maximum neighborhood component (DMNC); maximum neighborhood component (MNC); and clustering coefficient. The final hub genes were determined by intersecting all results.

Prediction of the target miRNAs of the DEGs

The target miRNAs of the hub genes were predicted by the following five online miRNA databases: miRWalk, miRDB, TargetScan, DIANA-micro and miRcode. MiRNAs that were found in at least four databases were selected as the target miRNAs, and a visual messenger RNA (mRNA)–miRNA coexpression network was constructed according to the targeting relationship between the mRNAs and miRNAs using Cytoscape software.

CeRNA network construction

LncRNAs and circRNAs that interacted with the selected miRNAs were predicted by the online database StarBase (version 3.0) (http://starbase.sysu.edu.cn/index.php). The mRNA–miRNA–lncRNA/circRNA (ncRNAs) interaction was further used for the ceRNA network construction by Cytoscape software.

Genomic DNA extraction and polymerase chain reaction (PCR)

Samples were collected from sALS patients and nonneurological controls, and the relevant genomic DNA was extracted using a QIAamp DNA Micro Kit (56304, Qiagen, Germany) according to the manufacturer’s instructions. Genomic DNA was further used to perform PCR according to the manufacturer’s instructions. The amplification conditions were as follows: 95 °C predenaturation for 3 min, 94 °C denaturation for 20 s, 58 °C annealing for 20 s, and 72 °C extension for 40 s for 35 cycles. The sequences of the primers are shown in Additional file 1: Table S1.

Statistical analysis

The statistical analysis was performed using the R software (Version 4.1.0). The continuous variables were compared between the groups using Student’s t-test and presented as the mean ± standard deviation (S.D.) or standard error (S.E.M.). The Kaplan–Meier method was used for the survival analysis, and the difference between the groups was analyzed by the log-rank test. ROC curves were generated using the R software pROC package (version 1.17.0.1), and the AUCs were determined accordingly. A p value < 0.05 was considered statistically significant. *p < 0.05; **p < 0.01; ***p < 0.001.

Results

Identification of DEGs

The workflow of this study was shown in Fig. 1. Then, two GEO datasets, GSE833 and GSE26276, in our test set were selected to identify the DEGs. GSE833 included 4 nonneurological control samples and 5 sALS samples, while GSE26276 included 3 nonneurological control samples and 3 sALS samples. A heatmap and volcano plot analysis of two datasets were used to visualize the DEGs (Fig. 2A, B). In the GSE833 dataset, 274 DEGs were identified in the sALS group compared with the nonneurological control samples, including 117 upregulated genes and 157 downregulated genes. In the GSE26276 dataset, 203 DEGs were identified in the sALS group compared with the nonneurological control samples, including 142 upregulated genes and 61 downregulated genes. The Venn plot showed that 8 DEGs were found in both datasets, and we merged the DEGs in the two datasets into a new expression matrix for further analysis (Fig. 2C).

Analysis of DEG expression in tissues/organs

According to the localization in the gene expression analysis, 46 tissue/organ-specific expressed genes were identified by BioGPS (Table 2). The results showed that most of these DEGs were specifically expressed in the nervous system (24/46, 52.17%). The second tissue/organ-specific expressed system was hematologic/immune cells, including 9 DEGs (9/46, 19.57%), followed by the digestive system (4/46, 9.70%), circulatory system (2/46, 4.35%), endocrine system (2/46, 4.35%), genital system (2/46, 4.35%), respiratory system (1/46, 2.17%), placental system (1/46, 2.17%) and other systems (1/46, 2.17%).

Table 2 Tissues/organ-specific DEGs expression

Full size table

DEG enrichment analysis

Functional and pathway enrichment analyses of the DEGs were performed by GSEA software, the R software org.Hs.eg.db package and the Online database KOBAS 3.0. First, we performed a GSEA by uploading the expression profiles of GSE833 and GSE26276, and we used the c2: GO gene set to investigate the GO enrichment of gene expression at the overall level. The screening criteria for the significantly enriched gene set were a Q value < 0.05 and an FDR < 0.25. We found that the most enriched gene sets in GSE833 and GSE26276 were related to axon_guidance, neurotrophin_signaling_pathway, neuroactive_ligand_receptor_interaction, adherens_junction, huntingtons_disease, amyotrophic_lateral_sclerosis_als and parkinsons_disease (Fig. 3A–D).

Second, we performed GO and KEGG pathway analyses of the DEGs using the R software org.Hs.eg.db package and KOBAS 3.0, respectively. The results of the GO enrichment analysis of the DEGs showed that processes of nervous system activity, such as chemical synaptic transmission, neural crest cell development and neurogenesis, were significantly related to sALS, and other biological processes, such as response to endogenous stimulus, anterograde transsynaptic signaling and excitatory postsynaptic potential, were also involved. The top 10 biological processes were selected based on a Q value < 0.05 and an FDR < 0.25 and are shown in Fig. 3E. The KEGG pathway analysis revealed that the DEGs were mainly enriched in neuroactive ligand receptor interaction. In addition, the PI3K-Akt signaling pathway, cellular senescence, the apelin signaling pathway, fluid shear stress and atherosclerosis were enriched in the sALS samples (Fig. 3F).

PPI network analysis and hub gene identification

The interaction network of proteins coded by the DEGs between the nonneurological controls and sALS patients comprising 161 nodes and 617 edges was evaluated by STRING and visualized by Cytoscape (Fig. 4A). We further used the MCODE plugin to identify the gene cluster modules according to the filter criteria, and the results showed that four modules were identified (Fig. 4B–E). Cluster 1 had 14 nodes and 60 edges with a score of 4.615. Cluster 2 had the second highest cluster score (score: 3.571, 15 nodes and 50 edges), followed by Cluster 3 (score: 3.5, 9 nodes and 28 edges) and Cluster 4 (score: 3.5, 5 nodes and 14 edges). To identify the hub genes in the interaction network, the CytoHubba plugin was used, and the results showed that 14 hub genes were identified by five algorithms of cytoHubba, including Clustering Coefficient, Degree, MNC, MCC and DMNC (Table 3). These DEGs are the core genes in the PPI network, implying that they play an important role in the pathogenesis of sALS. Since the GO, KEGG and GSEA enrichment analyses revealed an important function of the DEGs in the biological process of nervous system activity, we further intersected 14 hub genes and 24 nervous system-specific expressed genes and identified five nervous system-specific expressed hub genes, including APP, AKT1, LRRK2, PSEN1, and SLCO1A2 (Table 3, in bold).

Table 3 Hub genes determined using cytoscape plug-in cytoHubba by five algorithms

Full size table

Construction of mRNA–miRNA coexpression networks

MiRNAs have been reported to regulate gene levels by binding the 5′ or 3′ UTR of the target mRNA and play an important role in neurological disorder development. Therefore, we predicted the target miRNAs of 5 nervous system-specific expressed hub genes using five online miRNA databases and identified 87 target miRNAs and 94 mRNA–miRNA pairs. Finally, we constructed a coexpression network of mRNAs and miRNAs by Cytoscape, which comprised 92 nodes and 94 edges (Fig. 5).

Verification of five nervous system-specific expressed hub genes in three GEO datasets

Three GEO datasets, namely, GSE4595, GSE26927 and GSE39644, including 29 nonneurological control samples and 31 sALS patient samples, were used to verify the expression levels of 5 nervous system-specific expressed hub genes. The R software ggplot2 package was used to generate a split violin plot, and the differences were analyzed by Student’s t-test. As expected, we found that the mRNA expression levels of the 5 nervous system-specific expressed hub genes in the sALS group were significantly higher than those in the nonneurological control groups (Fig. 6A–C, p < 0.01), implying that the 5 hub genes play an important role in sALS development. In addition, we performed GO, KEGG and GSEA enrichment analyses based on five nervous system-specific expressed hub genes. As expected, the enriched gene sets of the DEGs were related to the neurotrophin signaling pathway, glial cell activation, neuron death, axon guidance, regulation of the actin cytoskeleton, etc., which are correlated with sALS progression (Additional file 1: Fig. S1A–G).

ROC curve and prognosis prediction ability of 5 nervous system-specific expressed hub genes in sALS samples

Since sALS patients are difficult to diagnose in the early stage because of clinical heterogeneity, it is essential to find diagnostic biomarkers to distinguish individuals with sALS from nonneurological people. We used ROC curves to evaluate the diagnostic ability of the 5 hub genes in predicting sALS. Specifically, the expression profiles of 5 hub genes in the GSE112681 dataset of nonneurological control samples and sALS samples were analyzed by the R software pROC package. The ROC curves of these hub genes were drawn, and the area under the curve (AUC) was calculated. The results showed that all hub genes have strong diagnostic value in sALS samples. Our results showed that these 5 hub genes could be biomarkers for sALS diagnosis. LRRK2 had the highest diagnostic ability (AUC: 0.927), while the AUC of the other genes was 0.832 for AKT1, 0.811 for APP, 0.832 for PSEN1, and 0.810 for SLCO1A2 (Fig. 7A–E). Additionally, we evaluated the ability of these genes to predict the prognosis of sALS patients by a survival analysis. A survival curve of the sALS patients based on the expression of the 5 hub genes was drawn by the Kaplan–Meier method. The results showed that sALS patients with high expression levels of AKT1, APP, LRRK2, PSEN1, or SLCO1A2 had a poorer prognosis than those with a low expression of these hub genes (Fig. 7F–J). Our results imply that AKT1, APP, LRRK2, PSEN1 and SLCO1A2 may be biomarkers for the diagnosis and prognosis prediction of sALS patients based on our present samples.

Construction of mRNA–miRNA–ncRNA coexpression networks

MiRNAs can cause posttranscriptional gene silencing by binding mRNAs, while other ncRNAs, such as lncRNAs and circRNAs, can regulate gene expression by competitively binding miRNAs, which is called the ceRNA mechanism. CeRNAs can disable microRNAs by binding microRNA response elements (MRS), which reveal mRNA–miRNA–ncRNA interaction coexpression networks. The existence of microRNA regulatory pathways is of great biological significance. We predicted the circRNAs and lncRNAs that interacted with selected miRNAs by the online database StarBase 3.0. The screening criteria were as follows: (1) mammalian; (2) human h19 genome; (3) and strict stringency (≥ 5) of CLIP-Data with degradome data. NcRNAs present in most selected miRNA predictions were chosen as the predicted circRNAs and lncRNAs. In the predicted results of the StarBase database, a transcript has multiple circRNA shearing sites; thus, the circRNA with the largest number of samples and the highest score in the circBase database was selected as the target circRNA. Finally, we obtained 7 lncRNAs and 5 circRNAs from APP-targeted miRNAs, 1 lncRNA and 16 circRNAs from LRRK2-targeted miRNAs, and 1 lncRNA and 14 circRNAs from PSEN1-targeted miRNAs. Three ceRNA networks were constructed based on the prediction results and visualized by Cytoscape (Fig. 8A–C). Subsequently, we conducted a literature search and selected seven miRNAs and three ncRNAs that were reported to affect neurodegeneration disorder development for further study. We propose that NEAT1-miR-373-3p/miR-302c-3p/miR-372-3p-APP, circ_0000002-miR-302d-3p/miR-373-3p-APP and XIST-miR-9-5p/miR-30e-5p/miR-671-5p might be potential ceRNA regulatory pathways that regulate sALS progression (Fig. 8D–F).

Identification of the genetic association between 3 SNPs and sALS

To date, several studies have confirmed that AD (Alzheimer’s disease), PD (Parkinson’s disease), FTD (frontotemporal dementia), PSP (progressive supranuclear palsy) and ALS have similar genetic bases. SNPs of APP (rs463946, rs466433, and rs364048) have been found to be closely related to the incidence of AD in the Chinese Han population. Therefore, we hypothesized that these three SNPs of APP are involved in sALS development, and the locations of the three SNPs of APP are shown in Fig. 9 A-C. All 3 SNPs were intronic polymorphisms, and the minor alleles of rs463946 (OR = 3.000, 95% CI 1.164–7.732, p = 0.014), rs466433 (OR = 3.000, 95% CI 1.164–7.732, p = 0.014), and rs364048 (OR = 3.000, 95% CI 1.164–7.732, p = 0.014) significantly increased the risk of sALS development, suggesting that these three polymorphisms might represent genetic susceptibility factors. Subjects harboring the minor G allele (GG + CG) of rs463946 (p = 0.026), minor G allele (GG + AG) of rs466433 (p = 0.026) and minor C allele (CC + CT) of rs364048 (p = 0.026) showed an increased risk of sALS development compared with those with the other genotypes (Table 4).

Table 4 Three SNPs shown the significant at p < 0.05 in the study

Full size table

Discussion

Amyotrophic lateral sclerosis is a highly fatal neurodegenerative disease, and currently, effective treatment is lacking. The cause of ALS currently remains unknown, although some people have familial disease, which is associated with mutations in genes that perform a wide range of functions [20]. Research investigating the pathogenic factors of sALS is still in a relatively underdeveloped state [21]. Based on the current predicament that sALS is difficult to diagnose due to the lack of biomarkers, new diagnostic biomarkers and prognostic biomarkers of sALS are urgently needed.

In this study, we identified 3 nervous system-specific expressed hub genes, which play an important role in sALS progression. According to the current understanding of the pathogenesis of sALS, axonopathy, aberrant RNA metabolism, nucleocytoplasmic and endosomal transport, oligodendrocyte degeneration, neuroinflammation and mitochondrial dysfunction were reported to be involved in sALS development [22]. The results of our GO, KEGG pathway and GSEA analyses of hub genes functions are consistent with the pathogenesis of sALS, indicating that elucidating the functions of these DEGs in sALS might help us gain a deeper understanding of the pathophysiology of sALS.

The expression levels of three nervous system-specific expressed genes (APP, LRRK2, PSEN1) were validated in three GEO datasets. We further evaluated whether these genes could be used as diagnostic biomarkers of sALS. The ROC curve analysis showed that all these genes have a high diagnostic ability for sALS. Survival analysis confirmed that a high expression of these genes predicted a poor prognosis of sALS, indicating that these genes could also be prognostic biomarkers of sALS.

Thus far, studies have shown that AD, PD, FTD, PSP and ALS have similar genetic bases [23]. With the development of research concerning neurodegenerative diseases, many neurodegenerative diseases are believed to exhibit varying degrees of genetic and pathological overlap, indicating that genes or proteins associated with the pathogenesis of one neurodegeneration disease may also be associated with other diseases. For example, the deletion of the SOD1 gene promotes APP protein oligomerization and memory loss in an AD mouse model, suggesting that SOD1 may be involved in the regulation of APP metabolism in AD patients [24]. The amyloidosis process of APP in the central nervous system may lead to the degeneration of motor neurons. Bryson J B et al. reported that after the knockdown of the APP gene in a SOD1 mouse model, innerve, motor function, viable motor neurons and other disease parameters of the mice were significantly recovered, indicating that APP was involved in the pathogenesis of SOD1-mediated ALS [25]. There are more and more studies on early diagnosis of ALS using plasma/serum or cerebrospinal fluid (CSF). Reports showed that soluble APP fragments and Aβ peptides levels are altered in plasma/serum or CSF form ALS patients, which could serve as the biomarkers of ALS Pathophysiology [26]. Another report also showed that CSF Aβ₄₂ increased remarkably in the ALS groups [27]. Our results showed that APP was highly expressed in the sALS samples, which is consistent with reports showing that APP was found in the spinal cord, skin and muscle of ALS patients [28, 29].

Since neurodegenerative diseases, such as AD, PD, FTD, PSP and ALS, share a similar genetic basis and SNPs of APP were reported to be involved in AD progression [30, 31], we investigated the effect of SNPs of APP in sALS patients. The results confirmed that three minor alleles, rs463946, rs466433 and rs364048, increased the risk of sALS development. A possible reason is that the metabolic process of the APP protein is affected by SNPs of APP, which reduces the normal decomposition of the APP protein and leads to the occurrence of sALS. However, more investigations are needed to confirm this hypothesis.

CeRNA networks play an important role in neurodegenerative disease progression. LncRNAs, miRNAs and circRNAs are important component factors in ceRNA networks. In our study, the target miRNAs, target lncRNAs and circRNAs of these miRNAs were predicted for APP, LRRK2 and PSEN1. We identified three important potential RNA regulatory pathways in the pathogenesis of sALS, including NEAT1-miR-373-3p/miR-302c-3p/miR-372-3p-APP, XIST-miR-9-5p/miR-30e-5p/miR-671-5p-LRRK2 and circ_0000002-miR-302d-3p/miR-373-3p-APP.

Our study has some shortcomings. The sample size for testing and validation was relatively small, and the expression of the three hub genes in sALS samples should be further examined using fresh tissue samples. In addition, the diagnostic ability and prognostic ability of the three hub genes of sALS should be further confirmed in prospective cohort studies.

Conclusions

Our study identified 3 nervous system-specific expressed hub genes, APP, LRRK2 and PSEN1, as potential diagnostic and prognostic biomarkers of sALS, and our results provide new insight into the pathogenesis of sALS at the transcriptome level. We also identified three potential RNA regulatory pathways that affect sALS progression by ceRNA network construction.

Availability of data and materials

The GEO dataset data used in this study are available in the GEO database (https://www.ncbi.nlm.nih.gov/geo/) with the following data accession identifiers: GSE833, GSE26276, GSE4595, GSE26927, and GSE39644 and GSE112681.

Abbreviations

ALS:: Amyotrophic lateral sclerosis
sALS:: Sporadic amyotrophic lateral sclerosis
GEO:: Gene expression omnibus
DEGs:: Differentially expressed genes
GO:: Gene ontology
GSEA:: Gene set enrichment analysis
KEGG:: Kyoto encyclopedia of genes and genomes
PPI:: Protein–protein interaction
MNC:: Maximum neighborhood component
MCC:: Maximal clique centrality
DMNC:: Density of maximum neighborhood component
ceRNA:: Competitive endogenous RNA
ROC:: Receiver operating characteristic
AUC:: Area under the curve
ncRNAs:: Noncoding RNAs

References

Le Bras A. New insights into C9ORF72-ALS/FTD using C. elegans. Lab Anim. 2022;51(1):8.
Article Google Scholar
Ustyantseva EI, Medvedev SP, Zakian SM. Studying ALS: current approaches, effect on potential treatment strategy. Adv Exp Med Biol. 2020;1241:195–217.
Article CAS Google Scholar
Hardiman O, Al-Chalabi A, Chio A, Corr EM, Logroscino G, Robberecht W, et al. Amyotrophic lateral sclerosis. Nat Rev Dis Primers. 2017;3:17071.
Article Google Scholar
Rusina R, Vandenberghe R, Bruffaerts R. Cognitive and behavioral manifestations in ALS: beyond motor system involvement. Diagnostics. 2021;11(4):624.
Article Google Scholar
Dharmadasa T, Kiernan MC. Riluzole, disease stage and survival in ALS. Lancet Neurol. 2018;17(5):385–6.
Article Google Scholar
Park JM, Kim SY, Park D, Park JS. Effect of edaravone therapy in Korean amyotrophic lateral sclerosis (ALS) patients. Neurol Sci Off J Ital Neurol Soc Ital Soc Clin Neurophysiol. 2020;41(1):119–23.
Google Scholar
Chua JP, De Calbiac H, Kabashi E, Barmada SJ. Autophagy and ALS: mechanistic insights and therapeutic implications. Autophagy. 2022;18(2):254–82.
Article CAS Google Scholar
Zou ZY, Zhou ZR, Che CH, Liu CY, He RL, Huang HP. Genetic epidemiology of amyotrophic lateral sclerosis: a systematic review and meta-analysis. J Neurol Neurosurg Psychiatry. 2017;88(7):540–9.
Article Google Scholar
Miller T, Cudkowicz M, Shaw PJ, Andersen PM, Atassi N, Bucelli RC, et al. Phase 1–2 trial of antisense oligonucleotide tofersen for SOD1 ALS. N Engl J Med. 2020;383(2):109–19.
Article CAS Google Scholar
Lin YC, Kumar MS, Ramesh N, Anderson EN, Nguyen AT, Kim B, et al. Interactions between ALS-linked FUS and nucleoporins are associated with defects in the nucleocytoplasmic transport pathway. Nat Neurosci. 2021;24(8):1077–88.
Article CAS Google Scholar
Balendra R, Isaacs AM. C9orf72-mediated ALS and FTD: multiple pathways to disease. Nat Rev Neurol. 2018;14(9):544–58.
Article CAS Google Scholar
Zhang J, Qiu W, Hu F, Zhang X, Deng Y, Nie H, et al. The rs2619566, rs10260404, and rs79609816 polymorphisms are associated with sporadic amyotrophic lateral sclerosis in individuals of Han Ancestry from Mainland China. Front Genet. 2021;12:679204.
Article CAS Google Scholar
Wilson KM, Katona E, Glaria I, Carcole M, Swift IJ, Sogorb-Esteve A, et al. Development of a sensitive trial-ready poly(GP) CSF biomarker assay for C9orf72-associated frontotemporal dementia and amyotrophic lateral sclerosis. J Neurol Neurosurg Psychiatry. 2022;93:761–71.
Article Google Scholar
Moreno-Garcia L, Lopez-Royo T, Calvo AC, Toivonen JM, de la Torre M, Moreno-Martinez L, et al. Competing endogenous RNA networks as biomarkers in neurodegenerative diseases. Int J Mol Sci. 2020;21(24):9582.
Article CAS Google Scholar
Nazarov PV, Kreis S. Integrative approaches for analysis of mRNA and microRNA high-throughput data. Comput Struct Biotechnol J. 2021;19:1154–62.
Article CAS Google Scholar
Yang N, Liu K, Yang M, Gao X. ceRNAs in cancer: mechanism and functions in a comprehensive regulatory network. J Oncol. 2021;2021:4279039.
Article Google Scholar
Tang X, Ren H, Guo M, Qian J, Yang Y, Gu C. Review on circular RNAs and new insights into their roles in cancer. Comput Struct Biotechnol J. 2021;19:910–28.
Article CAS Google Scholar
Tehrani SS, Ebrahimi R, Al EAA, Panahi G, Meshkani R, Younesi S, et al. Competing endogenous RNAs (CeRNAs): novel network in neurological disorders. Curr Med Chem. 2021;28(29):5983–6010.
Article CAS Google Scholar
Kanehisa M, Furumichi M, Sato Y, Kawashima M, Ishiguro-Watanabe M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 2022;51:D587–92.
Article Google Scholar
Masrori P, Van Damme P. Amyotrophic lateral sclerosis: a clinical review. Eur J Neurol. 2020;27(10):1918–29.
Article CAS Google Scholar
Debska-Vielhaber G, Miller I, Peeva V, Zuschratter W, Walczak J, Schreiber S, et al. Impairment of mitochondrial oxidative phosphorylation in skin fibroblasts of SALS and FALS patients is rescued by in vitro treatment with ROS scavengers. Exp Neurol. 2021;339:113620.
Article CAS Google Scholar
Swash M. Chitinases, neuroinflammation and biomarkers in ALS. J Neurol Neurosurg Psychiatry. 2020;91(4):338.
Article Google Scholar
Berson A, Nativio R, Berger SL, Bonini NM. Epigenetic regulation in neurodegenerative diseases. Trends Neurosci. 2018;41(9):587–98.
Article CAS Google Scholar
Dent P, Booth L, Roberts JL, Poklepovic A, Cridebring D, Reiman EM. Inhibition of heat shock proteins increases autophagosome formation, and reduces the expression of APP, Tau, SOD1 G93A and TDP-43. Aging. 2021;13(13):17097–117.
Article CAS Google Scholar
Bryson JB, Hobbs C, Parsons MJ, Bosch KD, Pandraud A, Walsh FS, et al. Amyloid precursor protein (APP) contributes to pathology in the SOD1(G93A) mouse model of amyotrophic lateral sclerosis. Hum Mol Genet. 2012;21(17):3871–82.
Article CAS Google Scholar
Stanga S, Brambilla L, Tasiaux B, Dang AH, Ivanoiu A, Octave JN, et al. A role for GDNF and soluble APP as biomarkers of amyotrophic lateral sclerosis pathophysiology. Front Neurol. 2018;9:384.
Article Google Scholar
Ye LQ, Li XY, Zhang YB, Cheng HR, Ma Y, Chen DF, et al. The discriminative capacity of CSF beta-amyloid 42 and Tau in neurodegenerative diseases in the Chinese population. J Neurol Sci. 2020;412:116756.
Article CAS Google Scholar
Islam MT. Oxidative stress and mitochondrial dysfunction-linked neurodegenerative disorders. Neurol Res. 2017;39(1):73–82.
Article CAS Google Scholar
Spadoni O, Crestini A, Piscopo P, Malvezzi-Campeggi L, Carunchio I, Pieri M, et al. Gene expression profiles of APP and BACE1 in Tg SOD1G93A cortical cells. Cell Mol Neurobiol. 2009;29(5):635–41.
Article CAS Google Scholar
Yang Y, Tapias V, Acosta D, Xu H, Chen H, Bhawal R, et al. Altered succinylation of mitochondrial proteins, APP and tau in Alzheimer’s disease. Nat Commun. 2022;13(1):159.
Article CAS Google Scholar
Wang R, Chopra N, Nho K, Maloney B, Obukhov AG, Nelson PT, et al. Human microRNA (miR-20b-5p) modulates Alzheimer’s disease pathways and neuronal function, and a specific polymorphism close to the MIR20B gene influences Alzheimer’s biomarkers. Mol Psychiatry. 2022;27(2):1256–73.
Article CAS Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This work was supported by the National Natural Science Foundation of China (No. 81960244 and No. 81360198 to RSX), Natural Science Foundation of Guangdong Province of China (2019A1515011341 to XZ), The Science and Technology Plan Foundation of Guangzhou (No. 201904010066 to JHD), The Science and Technology Plan Foundation of Guangzhou (No. 202201011663 to YFL).

Author information

Yifu Liao, Haiping Cai and Feifei Luo contributed equally to this work.

Authors and Affiliations

Department of Neurology, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
Yifu Liao
Department of Neurosurgery, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
Haiping Cai
Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Oncology in South China, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
Feifei Luo
The First School of Clinical Medicine, Southern Medical University, Guangzhou, China
Dongcheng Li
Eastern Department of Neurology, Guangdong Geriatrics Institute, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
Jinhai Duan
Department of Neurology, Jiangxi Provincial People’s Hospital, The First Affiliated Hospital of Nanchang Medical College, Clinical College of Nanchang Medical College, Nanchang, China
Renshi Xu
Department of Neurology, Maoming People’s Hospital, Maoming, China
Hao Li, Geng Liao & Xiong Zhang

Authors

Yifu Liao
View author publications
You can also search for this author in PubMed Google Scholar
Haiping Cai
View author publications
You can also search for this author in PubMed Google Scholar
Feifei Luo
View author publications
You can also search for this author in PubMed Google Scholar
Dongcheng Li
View author publications
You can also search for this author in PubMed Google Scholar
Hao Li
View author publications
You can also search for this author in PubMed Google Scholar
Geng Liao
View author publications
You can also search for this author in PubMed Google Scholar
Jinhai Duan
View author publications
You can also search for this author in PubMed Google Scholar
Renshi Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xiong Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

XZ: project design, data analyses, manuscript writing. YFL: project design, data analyses, manuscript writing. HPC: data analyses. FFL: data analyses. DCL: data analyses. HL: data analyses. GL: data analyses. JHD: data analyses. RSX: data analyses. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Jinhai Duan, Renshi Xu or Xiong Zhang.

Ethics declarations

Ethics approval and consent to participate

Informed consent was obtained from all subjects and/or their legal guardians. This method was also carried out in accordance with the guidelines (Declaration of Helsinki).

Consent for publication

This study is approved by the research ethics committee in the Guangdong Provincial People’s Hospital with informed consent for publication of any potentially identifiable data or images (No. GDREC2019424H(R2)).

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Primer sequences genotyped and enrichment analysis of 5 nervous system-specific expressed genes.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Liao, Y., Cai, H., Luo, F. et al. Three nervous system-specific expressed genes are potential biomarkers for the diagnosis of sporadic amyotrophic lateral sclerosis through a bioinformatic analysis. BMC Med Genomics 16, 15 (2023). https://doi.org/10.1186/s12920-023-01441-x

Download citation

Received: 15 September 2022
Accepted: 16 January 2023
Published: 27 January 2023
DOI: https://doi.org/10.1186/s12920-023-01441-x

Three nervous system-specific expressed genes are potential biomarkers for the diagnosis of sporadic amyotrophic lateral sclerosis through a bioinformatic analysis

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Identification of potentially functional modules and diagnostic genes related to amyotrophic lateral sclerosis based on the WGCNA and LASSO algorithms

Identifying Candidate Genes Associated with Sporadic Amyotrophic Lateral Sclerosis via Integrative Analysis of Transcriptome-Wide Association Study and Messenger RNA Expression Profile

Polymorphism of rs3737597 in DISC1 Gene on Chromosome 1q42.2 in sALS Patients: a Chinese Han Population Case-Control Study

Background

Methods

Gene expression data acquisition

Data processing and identification of differentially expressed genes (DEGs)

DEG visualization analysis

Tissue/organ-specific gene expression determination

Performance of the enrichment analysis

PPI network construction

Prediction of the target miRNAs of the DEGs

CeRNA network construction

Genomic DNA extraction and polymerase chain reaction (PCR)

Statistical analysis

Results

Identification of DEGs

Analysis of DEG expression in tissues/organs

DEG enrichment analysis

PPI network analysis and hub gene identification

Construction of mRNA–miRNA coexpression networks

Verification of five nervous system-specific expressed hub genes in three GEO datasets

ROC curve and prognosis prediction ability of 5 nervous system-specific expressed hub genes in sALS samples

Construction of mRNA–miRNA–ncRNA coexpression networks

Identification of the genetic association between 3 SNPs and sALS

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation