Introduction

Vitamin B12 (VitB12), also known as cobalamin, is an essential micronutrient synthesized exclusively by certain bacteria and archaea, which can enter the food chain of human beings due to their interaction with animals and plants (Salinas et al 2018; Watanabe and Bito 2018). The uptake of VitB12 is a multi-step process and several different alterations can impair its absorption. The estimated reserve of VitB12 in the human liver is roughly about 2–5 mg, in spite of its daily requirement being 1–4 μg. Therefore, the symptoms of VitB12 deficiency develop 3–5 years from the start of malabsorption and then a possible recovery from the deficiency may not be reversible (Henríquez et al 2007). VitB12 is of vital importance due to its significant role as the cofactor in two fundamental biochemical reactions. One such reaction takes place in the mitochondria, where VitB12, in the form of adenosylcobalamin, is responsible for the synthesis of succinyl-CoA from methylmalonyl-CoA and biotin. The other reaction occurs in the cytoplasm, where VitB12, in the form of methylcobalamin, participates in the synthesis of methionine from folic acid (VitB9) and homocysteine (Allen 2012; Kräutler 2012). Therefore, VitB12 deficiency leads to the accumulation of methylmalonic acid and homocysteine, respectively, which are ultimately toxic at high concentrations. VitB12 deficiency can be equally detrimental for both the older population and children (Fidaleo et al. 2021).

VitB12 deficiency can give rise to a broad range of, yet often unrecognized, symptoms (such as weakness, loss of appetite, and weight) and can severely affect human health by impairing fertility, embryo development and progressing into severe disease conditions such as megaloblastic anemia, cancer, and neurological modifications (namely numbness and tingling in limbs, depression, confusion, dementia, and optic neuropathy) (Andrès et al. 2007; Ata et al. 2020; Sangle et al 2020). Furthermore, VitB12 deficiency is associated with cognitive functions and can present as risk factor for Alzheimer’s disease (AD) (Calderón-Ospina and Nava-Mesa 2020). It is also linked to many other neuronal disorders, such as Wernicke’s encephalopathy, depression, subacute combined degeneration of the spinal cord, and peripheral neuropathy (Calderón-Ospina and Nava-Mesa 2020). In children, VitB12 deficiency due to the strict vegan diet of lactating mothers or malnourishment can affect their development and result in movement disorders and developmental delays. The various inborn errors of metabolism, also known as congenital metabolic diseases or inherited metabolic disorders, affecting absorption (intrinsic factor deficiency), transport (transcobalamin deficiency) and intracellular metabolism of VitB12 (combined methylmalonic acidemia and homocystinuria depending on the genes involved) rely on a specific requirement of supplementation/treatment with VitB12 that allow patients to survive and, in some cases, improve mental impairments (Watkins and Rosenblatt 2011). Furthermore, VitB12 deficiency can severely affect myelin formation, remyelination and regeneration of nerves after a peripheral injury and can also lead to the incorporation of abnormal fatty acids into neuronal cells and modify the level of some neurotransmitters (Duncan et al. 2018; Nielsen et al 2012; Nishimoto et al. 2015). Although medical evidence suggests a direct role of VitB12 in the CNS homeostasis, the detailed mechanisms are not yet characterized in depth. Inborn defects linked to VitB12 uptake and metabolism can help in formulating hypotheses, regarding the possible role of a lack of VitB12 in the pathology, that include byproduct toxicity and missing products of the reaction, impaired methylation capacity, alteration in the level of oxidative stress and modification non-directly linked to the enzymatic functions (Huemer et al. 2017).

Medical approaches are devoted to preventing death and severe damage to organs and the scarce knowledge currently available do not allow researchers to develop possible interventions for recovering neurological impairments, which are more common in the case of late diagnosis. Due to the crucial role VitB12 plays, together with its importance in being an essential micronutrient, a conserved and basal mechanism that may involve the gene expression can be hypothesized as a result of VitB12 deficiency or deficiencies in VitB12 metabolism. Therefore, keeping this in mind for the present study, databases were queried to search for RNA-Seq and microarray datasets regarding the CNS and were individually analyzed. The results obtained were then compared in order to highlight the possible common features and the findings reported in this study could provide insights into future research towards medical interventions.

Methods

Data acquisition and analyses

To proceed with the RNA gene expression data analyses, the datasets were obtained from the Gene Expression Omnibus (GEO) in the NCBI website (https://www.ncbi.nlm.nih.gov/) and ArrayExpress (https://www.ebi.ac.uk/arrayexpress/).

The RNA-Seq or microarray datasets were obtained from the GEO DataSets archive from the NCBI and ArrayExpress website. In order to obtain the datasets relevant to the role of VitB12 in CNS homeostasis, queries were initiated including the keywords “Cobalamin and brain” or “Vitamin B12 and brain” or “Cobalamin and CNS” or “Vitamin B12 and CNS.” Such queries gave 61 unique search results pertaining to different experiments conducted on different organisms, organs and tissues. From these search results, four datasets were chosen as the CNS models with alterations in the VitB12 metabolism, thereby making them suitable for the present study (Table 1).

Table 1 Datasets employed for the analyses

GSE103417 is an expression profiling by array obtained from the neuroblastoma cells (N1E-115). N1E-115 cells were engineered to express a chimeric protein by fusing oleosin to the N-terminal (TO) or C-terminal (OT) of transcobalamin. TO chimeric proteins had an increased binding affinity towards VitB12, whereas the OT chimeric proteins had no detectable VitB12 binding activity. Thus, the expression of OT chimeric proteins led to the sequestering of substrates and helped in mimicking an appropriate VitB12 deficiency condition. Furthermore, neuroblastoma cells can have a proliferative phenotype (tumor status) or can be differentiated towards a non-proliferative motor-like neuron phenotype (Battaglia-Hsu et al. 2009, 2018). Therefore, the datasets from the non-proliferative motor-like neuron phenotypes were considered for further analyses.

GSE99113 is a high throughput sequencing dataset of the astrocytes obtained from cerebral hemispheres of postnatal day 0 C57BL/6 J pups. In this experiment, VitB12 deficiency was induced by maintaining the astrocyte cultures under serum-free conditions (DMEM/F12) and VitB12-free medium (custom-synthesized medium) (Jonnalagadda et al 2022).

GSE161763 is a high throughput sequencing dataset obtained from the E16.5 brain of a mouse model carrying the mutated protein THAP11F80L/F80L. THAP11 is a transcription factor that regulates the transcription of Mmachc (a key gene responsible for the intracellular cobalamin metabolism). Here, the change of a phenylalanine to leucine at position 80 of THAP11 (THAP11F80L/F80L) determines a decrease in Mmachc gene transcription and depletion of MMACHC protein, thereby mimicking the VitB12 deficiency condition (Chern et al. 2022).

GSE104164 is an expression profiling by array obtained from the ground cerebellum of female rat pups under nutritional methyl donor deficiency (MDD). Nutritional MDD was generated by feeding female rats with a diet deficient in folate (VitB9) and VitB12 and lowered in choline, starting from one month before pregnancy until the pups were weaned at postnatal day 21 (P21) (Willekens et al. 2019).

For analyzing the RNA-Seq and microarray datasets, iDEP (integrated differential expression and pathway analysis), a web-based tool, was employed (http://ge-lab.org/idep/) to perform exploratory data analysis (EDA) (Ge et al 2018).

Gene expression matrices were built from TXT files downloaded from GEO DataSets. Datasets were filtered based on the parameters mentioned in Table S1. The quality of datasets was assessed by hierarchical clustering and principal component analysis (PCA). The differentially expressed genes (DEG) were detected by DESeq2 package and the co-expressed genes were used to identify Gene Ontology (GO) using cellular component as a gene set. Also, genes emerging from DEG analysis were used to conduct an enrichment of transcription factor (TF) binding motif in the upstream 600 base pairs (bps) of the promoters of the highlighted genes. The number of replicas for samples of each dataset, false discovery rate (FDR) cutoff, and minimal fold change and specific GO considered are listed in Table S1.

PGSEA (parametric gene set enrichment analysis) package was used to run the PAGE (parametric analysis of gene set enrichment) algorithm, which performs one-sample t-test on each gene included in the gene dataset chosen (GO cellular component), to identify the altered pathways.

ChIP-X enrichment analysis 3

TF enrichment analysis on the genes that were considered to build the “Ribosome” pathway (derived from the Enrichment analysis in the differentially expressed genes (DEG) analysis on mice considering GO cellular component gene set) was performed by ChIP-X enrichment analysis 3 (ChEA3) considering “Literature” dataset (https://amp.pharm.mssm.edu/ChEA3) (Keenan et al. 2019).

In silico analyses

Sequencing alignment was performed by the Pairwise Sequence Alignment—EMBOSS Needle that uses Needleman-Wunsch algorithm (https://www.ebi.ac.uk/).

Identification of the putative Transcription Factor Binding Sites (TFBSs) in the promoter regions were performed using PROMO (http://alggen.lsi.upc.es/) (Farré et al. 2003; Messeguer et al. 2002). Sequences were related to matrixes using Quandt and co-workers similarity algorithm, and the default threshold for dissimilarity is 15% (Quandt et al 1995). Sequences used for the alignment and identification of the putative TFBSs have been obtained from the NCBI website (https://www.ncbi.nlm.nih.gov/).

Results

Pre-processing and exploratory data analysis

The pre-process analyses and the parameters employed for all the datasets are reported in Fig. S1, S2, S3 and S4 and Table S1. The selection parameters filtered out 30–50% of the genes as mentioned in Table S2. The analysis involving the motor-like neuron model revealed a lack of correlation (Fig. S1, panels A-B) and also heterogenous distribution of the transformed data (Fig. S1, panel C) between samples of the same groups. Furthermore, the heatmap and hierarchical clustering tree built using the genes with maximum expression level at the top 75% did not achieve perfect discrimination among the groups (Fig. S1, panels D-E). Hence, the motor-like neuron model was excluded from the analyses. All other datasets showed high correlation and homogenous distribution of the transformed data among the various samples, with the reference groups and VitB12 deficiency mimicking groups distinctly grouped in the hierarchical clustering trees, although a little variation was observed among the technical replicates (Fig. S2, S3 and S4, panels A-E). Furthermore, the PCA for each dataset gave Principal Component (PC)1 and PC2 accounting for 55% and 28%, 63% and 16%, and 56% and 17% for the astrocyte, mouse and rat models, respectively, which led to cluster discrimination of the samples (Fig. S2, S3 and S4, panel F), whereas samples from the motor-like neuron model overlapped (Fig. 1S, panel F).

DEG analysis was performed on each dataset by comparing the VitB12 deficiency mimicking groups vs reference groups (see Table S1 for parameters and Fig. S5). The number of genes that were upregulated and downregulated were 191 and 308 in the astrocyte model, 844 and 132 in the mouse model and 880 and 1494 in the rat model, respectively (Fig. S5, panels A-C, left side). For each dataset, the gene expression levels of Mmachc and Cd320 (encoding for metabolism of cobalamin associated C and transcobalamin receptor proteins, respectively), were reported, as they were the key factors involved in the metabolism of VitB12 (Table 2).

Table 2 Gene fold change from DEG, transciption factor gene targets (according to ChEA3) and peak score from THAP11 ChIP-Seq, in accord with Chern et al. (2022)

Results from DEG analysis were used to conduct the enrichment analysis considering the GO cellular component gene set (Fig. S5 and Table S3). A hierarchical clustering tree map describes the difference in distances among the similar-termed GO, exploiting the percentage of overlapped genes involved in each pathway (Fig. S5, panels A-C, right panel). The comparison among the 3 tree maps revealed that the ribosomal biology was affected only in the mouse and rat models (ribosomal pathways modified in both the animal models showed low values of adjusted P-values). Surprisingly, in the rat model, although the modulation of the ribosome pathway was evident, the analysis revealed an opposite trend (upregulation of gene expression) compared to that of the mouse model (the ribosomal genes being globally downregulated) (Fig. S5, panels A-C, right panel).

With the aim to corroborate these observations, a different analysis was performed on the datasets. The PGSEA, considering GO cellular component (PAGE algorithm) as a gene set, performed one-sample t-test on each gene specifically involved in a specific biological process branch of GO, thereby obtaining adjusted P-values that were used to rank the pathways contributing to the determination of each principal component (see PCA analysis, Fig. S2, S3 and S4, Panel F). This analysis was more sensitive and brought to light the weaker modifications. Therefore, Fig. 1 shows the observations from the PGSEA analysis for each dataset considered. For each pathway, the load of PC1 and PC2 on the pathway was indicated by the appropriate colours (red and blue indicate activated and suppressed pathways, respectively). The Pathways were also labelled with the false discovery rate (FDR). Both for the mouse and rat models, alteration of genes involved in the ribosome gene dataset participates in the load of the first component by being negatively or positively correlated, respectively. This result was in accord with the DEG analysis. Although the genes involved in the “Ribosome” GO negatively and weakly loaded the PC2 for the rat model, the PC1 accounted for 56% of the PCs, thereby portraying a prominent role. Surprisingly, the PGSEA analysis performed on the astrocyte dataset also revealed some ribosomal alterations within this model (Fig. 1).

Fig. 1
figure 1

PGSEA analysis considering GO cellular component gene set. Red arrows indicate the GO: cellular component related to ribosomes. Red and blue in the square boxes indicate an activated pathway (corresponding to a positive z-score) and a suppressed pathway (corresponding to a negative z-score), respectively

A different analysis from DEG, called “Pathways,” was also performed. While DEG analysis is sensitive to arbitrary cut-offs, Pathways analysis helps in the identification of coherently altered pathways using fold-change values. On choosing the GAGE method and GO cellular component gene set, this analysis gave results very similar to that of the PGSEA analysis, thereby corroborating observations regarding the alterations in the ribosome pathways in all three datasets (Table S4).

THAP11 targets analysis in mouse model

Since the mouse model was obtained through genetic engineering and the other models by a nutritional deficiency of VitB12, it was questioned whether the alteration in “Ribosome” pathway was directly linked to THAP11 mutation. A ChEA3 analysis was performed on the genes that were considered to build the “Ribosome” pathway in the Enrichment analysis on mice from DEG results (obtained by considering the GO Cellular Component gene set) (Table S5). ChEA3 is a transcription factor enrichment analysis that ranks TFs associated with a specific list of genes. As a gene set library, “Literature” was chosen for the analysis. This library is built from the data of published experiments obtained from humans, mice and rats (Keenan et al. 2019). The analysis highlighted that THAP11 was ranked 29th place with an overlap of 4 genes, while MYC was the top-ranked one with 21 genes overlapping out of 34 (Table S6 and Fig. 2). Considering the genes that were included in “Ribosome” pathway common to both mice and rats (15 genes as reported in Table 2), only 2 of them were returned in the ChEA3 results. Furthermore, Chern and co-workers performed a ChIP sequencing analysis to establish the THAP11 target genes (Chern et al. 2022) and the peak scores (–log10 (P-value)) obtained were reported in Table 2. With the exception of Rps27a and Mrpl54, the other ribosomal genes had a low score. These results are in accord with ChEA3 analysis and collectively suggested that THAP11 did not have a global action on the ribosomal genes, but were limited to very specific ones.

Fig. 2
figure 2

ChIP-X enrichment analysis 3 (ChEA3). The analysis ranks the TFs that modulate the genes considered to build the “Ribosome” pathway in the Enriched pathways DEG analysis on mice, considering the GO cellular component gene set. The name at the base of histograms indicates the TFs while the number in the histograms is their place in the rank. FET refers to Fisher’s exact test

Next, it was to be understood whether the alterations in astrocytes and rats were possibly linked with a change in the THAP11 expression that, in turn, can affect the genes involved in the metabolism of VitB12. Considering the ChIP-seq results regarding THAP11, a high peak score was observed for both Mmachc and, interestingly, for Thap11, thereby suggesting a positive feedback regulation mechanism for this transcription factor in the mouse model. However, Thap11 and Hcfc1 (which encodes for a transcription factor and together with THAP11 jointly regulates the Mmachc expression) gene levels were not altered in the astrocyte and rat models on inducing the VitB12 deficiency, thereby excluding any alteration linked to these genes.

TF enrichment analysis in DEG

The observed difference in the ribosomal gene regulation for the models could be due to the transcription factors being regulated differently after the induction of VitB12 deficiency. Aiming at understanding this point, a TF Enrichment analysis in DEG was performed. This returns the more representative TFs that target the promoters of the genes involved in the phenom based on the greater variable ones without focusing on a specific pathway. Thus, although DEG analysis was unable to detect the alterations in ribosomal gene expression in the astrocyte model, the latter was included in the analysis. TF enrichment highlighted a complex panorama (Table S7). A possible overlap of TFs (without considering the direction, i.e., upregulation or downregulation) was evaluated in the results obtained from each group: as reported in the Venn diagram (Fig. 3) and Table 3, and four common TFs encoded by Patz1/Zfp278, Sp4, Egr1 and Zbtb7b genes were found. Surprisingly, none of these TFs were returned in the ChEA3 analysis performed on ribosomal genes. Furthermore, the intersection of the TF Enrichment analysis performed for each dataset showed an enrichment of genes having E2F1 as TFBSs in the promoter common for the astrocyte and rat models and EGR3 common for the mouse and rat models. Interestingly, ChEA3 analysis highly ranked E2F1 and ERG3 when genes from “Ribosome” pathway from mice were used as a query, with an overlap of 18 genes and 9 genes respectively (Table S6).

Fig. 3
figure 3

Venn diagram on TF enrichment analysis. The Venn diagram reports TF enrichment analysis, according to DEG analysis results, performed on all the models. The Venn diagram was made using https://bioinformatics.psb.ugent.be/webtools/Venn/

Table 3 TF enrichment analysis on DEG results (Venn diagram intersection)

Transcription factors and “Ribosome” pathway gene expression

Considering the common TFs returned by the TF enrichment from the DEG and ChEA3 analyses, the gene levels of E2f1 and Erg3 for all the models were evaluated. Since MYC was highly ranked in the ChEA3 analysis for “Ribosome” pathway, Myc was also included (Table 2). A significant increase in the gene expression of E2f1 under VitB12 deficiency mimicking conditions in the mouse and rat models (with the latter showing a greater induction) was observed, but not in the astrocyte model; however, the gene expression levels of Erg3 and Myc were not altered. Furthermore, the expression of the genes common to the mouse and rat models that take part in the “Ribosome” pathway establishment were selected (Table S5) and for each of them, it was annotated if they were returned in the ChEA3 analysis (Table S6) and the log2 fold change (log2FC) (Table 2). Comparing the genes involved in the “Ribosome” pathway with adjusted P-values < 0.05, it was observed that all of them were downregulated for the mouse model (Rps15a, Rps25, Rps27a, Mrpl14, Rps24, Rbm3, Mrpl54, Mrpl41, Rpl12, Rpl22l1, Chchd1, Rpl13a and Rpl41), whereas the significantly modulated genes were upregulated for the rat model (Mrpl14, Mrpl54, Rpl13a). A majority of the above-mentioned genes were observed to be associated with E2F1 in the ChEA3 analysis (mainly based on data from mice). In accord with the mouse model, astrocytes showed a downregulation trend in ribosome-related genes, although it was observed to be insignificant.

Putative transcription factor binding sites of the genes involved in the “Ribosome” pathway for rat model

Since E2F1 regulates the expression of ribosomal genes positively (Ayrault et al 2006; Gnanasundram and Fåhraeus 2018; van Riggelen et al 2010), it was evaluated whether E2F1 could occupy the promoter and modulate the upregulation of Mrpl14, Mrpl54, and Rpl13a genes observed in the rat model. The ChEA3 analysis performed on the genes involved in “Ribosome” pathway returned only one dataset for the rat model; therefore, an in silico analysis was needed to evaluate the putative TFBSs in rats.

For the promoter analysis, a region of 600 bps upstream of the gene was considered. Firstly, an alignment of both mRNA transcripts and promoter regions for each gene was performed for assessing the similarities in transcripts and gene promoter regions using Pairwise Sequence Alignment. A comparison of the mRNA transcripts of Mrpl14, Mrpl54, and Rpl13a from mice and rats gave a similarity and gaps of 67.3% and 29.1%, 81.0% and 8.6%, and 57.4% and 39.8%, respectively, thereby indicating a certain degree of structural relationship (Fig. S6, S7 and S8 and Table 4). The same analysis performed on the Mrpl14, Mrpl54, and Rpl13a promoters highlighted a similarity and gaps of 42.1% and 31.2%, 45.7% and 21.1%, and 48.1% and 25.8%, respectively, thereby suggesting a low structural conservation of the promoter regions due to the similarities being lower than 50% (Fig. S9, S10 and S11 and Table 4).

Table 4 Pairwise sequence alignment

To investigate whether putative E2F1 TFBSs were present in the promoters of Mrpl14, Mrpl54, and Rpl1 for the rat model, an in silico analysis was performed using PROMO, an online software which could predict TFBSs (Farré et al. 2003; Messeguer et al. 2002). This software highlighted multiple putative E2F1 TFBSs in the promoters of Mrpl14, Mrpl54, and Rpl13a for the rat model (Fig. 4 and Table 5). In spite of all the genes involved in “Ribosome” pathway exhibiting an upregulation trend for the rat model, the expression of Chchd1 and Rbm3 genes were reduced during the VitB12 deficiency mimicking conditions (Table 2). A comparison of the Chchd1 and Rbm3 mRNA transcripts for the mouse and rat models highlighted a similarity and gaps of 82.7% and 9.7% and 33.3% and 62.6%, respectively (Fig. S12 and S13, Table 4), whereas, with regard to the promoter regions, the similarity and gaps were observed to be 39.9% and 20.3% and 39.3% and 47.1%, respectively (Fig. S14 and S15 and Table 4). The pairwise sequence alignment analysis on Rbm3 revealed very low similarity for the gene transcripts and also wide gaps for both the gene transcripts and promoter sequences, thereby making their comparison unreliable. Therefore, Rbm3 was excluded from the subsequent analysis. Furthermore, PROMO analysis performed on the Chchd1 promoter does not identify any putative E2F1 TFBSs, thereby hypothesizing that the observed downregulation could be associated with the absence of E2F1 TFBSs in the promoter of Chchd1 gene.

Fig. 4
figure 4

PROMO analysis of promoters. Factors predicted within a dissimilarity margin less or equal to 15%. The progressive numbers on the top of the grid indicate the distance from the 5′ of the DNA sequences considered

Table 5 PROMO analysis

Discussion

The crucial role of VitB12 in brain homeostasis, including neurodevelopment and neurodegenerative/neurological diseases, is known, although the underlying mechanisms remain poorly understood. Therefore, with the aim of gathering new insights on the errors in VitB12 metabolism with specific reference to gene expression response, RNA-Seq and microarray datasets generated from experimental models involving VitB12 metabolism defects were considered in this study. Datasets were obtained from the GEO DataSets archive and by limiting the inquiry to CNS or CNS-related models, although a great heterogeneity in methods were employed to generate VitB12 mimicking deficiency conditions in cell and animal models, it was possible to identify four suitable datasets. Three of these datasets were analysed as part of previous publications (GSE99113, GSE161763, and GSE104164) (Chern et al. 2022; Jonnalagadda et al. 2022; Willekens et al. 2019), whereas any analysis regarding the last dataset (GSE103417) was not published, although the authors of the dataset submitted a manuscript reporting the same experimental design (Battaglia-Hsu et al. 2009).

The DEG analysis performed on each dataset compared the VitB12 deficiency mimicking groups vs reference groups. For each dataset, about 18,000 genes were taken into account. DEG analysis revealed a difference in the number of genes that were modulated: considering the upregulated or downregulated gene, astrocytes gave the lowest number (191 and 308, respectively), whereas, the rat model gave the highest (880 and 1494). Enrichment Pathways analysis, followed by the DEG analysis, highlighted alterations in the regulation of the genes involved in ribosomes as a consequence of the VitB12 metabolism modification, in both the animal models with opposite trends. Although different similar-termed GO pathways involving the ribosomes emerged (as underscored by hierarchical clustering tree maps), the “Ribosome” pathway from DEG analysis was used as a reference for subsequent investigations in this study. In the mouse model, a general downregulation of the ribosomal gene expression was observed, whereas, on the contrary, there was an upregulation of the ribosomal gene expression in the rat model. However, there were no significant alterations observed for mentioned pathways in the astrocyte model. With regard to the mouse astrocytes, it is possible that being a cellular model, the maintenance of the culture, e.g., periodically changing the culture medium, reduces the accumulation of toxic byproducts (such as homocysteine) and can mitigate the alterations due to VitB12 deficiency, thereby leading to a weak gene modulation (in accord with the lower number of altered genes detected) and hindering the identification of many emerging alterations in the ribosomal pathways from DEG analysis. Alternatively, since DEG analysis is based on variance, the experimental design of the astrocytes dataset (made up of 2 replicas for each condition) could affect results by flatting the possible differences between groups. As a result of more sensitive analyses of all models, including PGSEA and Pathways, a decrease in the expression of the ribosomal genes in astrocytes of a lesser magnitude was observed, thus confirming the hypothesis that culture maintenance attenuated the effects of VitB12 deficiency on the cellular model.

The defects arising primarily due to VitB12 deficiency or alteration in proteins linked to the VitB12 metabolism pathway could be exacerbated by a general downregulation of Mmachc or Cd320. Indeed, the downregulation or a defect in MMACHC, the pivotal enzyme in the metabolism of VitB12, and in the transcobalamin receptor (encoded by Cd320) are associated with VitB12-related defects (Chern et al. 2022; Fidaleo et al. 2021; Pappas et al 2022). Therefore, to characterize the models chosen for this study, Mmachc and Cd320 gene expression levels were evaluated. A highly significant downregulation was observed for the mouse model, whereas the mouse astrocyte and rat models showed a small downregulation of Mmachc, however not significant. With regard to Cd320, a reduction in the gene expression level was observed in the mouse and astrocyte models (although without significance), but not in the rat model. Such observations could be plausible, considering the approaches used to induce VitB12 deficiency: in the mouse model, Mmachc expression was altered following mutation in the THAP11 transcription factor which participates in the Mmachc gene regulation; whereas, VitB12 deficiency in the astrocyte and rat models was induced by removing VitB12 from the food or culture medium which does not lead to an alteration in the expression of Mmachc, at least under the tested conditions (Battaglia-Hsu et al. 2009; Chern et al. 2022; Jonnalagadda et al. 2022; Willekens et al. 2019).

The selected models included in the present study were very heterogenic considering the cell types (astrocytes, neuroblastoma, brain and cerebellum) and the species (mouse and rat) (Table 1). Furthermore, in the mouse astrocyte and rat models, the deficiency was induced by removing VitB12 from the culture medium or food, whereas the mouse model was obtained by genetic manipulations of Thap11, thus resulting in a downregulation of Mmachc. THAP11F80L/F80L mice exhibited a complex disease phenotype including craniofacial deformity. Despite the controversy, defects in cranial formation and VitB12 metabolism may be linked. Chern and colleagues generated a mouse model overexpressing Mmachc — in the presence of THAP11F80L/F80L — in the neural crest cells (which differentiated into a portion of the craniofacial skeleton) showing normal cranial formation thus demonstrating that the impairment in bone development was not due to VitB12 metabolism defect, but mainly related to the other THAP11 target genes. On the contrary, a direct involvement of cobalamin metabolism in cranial development was evident in the zebrafish model (Quintana et al. 2014). Currently, an exhaustive description of the involvement of VitB12 defects and cranial formation is unavailable (Chern et al. 2022; Quintana et al. 2014). Furthermore, regarding the THAP11F80L/F80L mouse model, Chern and colleagues observed ribosomopathy and postulated that it was mainly linked to THAP11 rather than MMACHC alteration. Even though they concluded that the mechanism of action of THAP11 (and HCFC1) in ribosome biogenesis remains unsolved (for instance they observed tissue-specific phenotypes), their observations from the ChIP sequencing suggested a direct binding of THAP11 to some of the ribosomal gene promoters (Chern et al. 2022). Since an overall modification was observed in the expression of genes linked to ribosomes in all the datasets chosen for this study, the possible occupancy of THAP11 in the promoter of “Ribosome” pathway genes, that were commonly modulated in all the models, was evaluated by assessing the peak scores from the ChIP sequencing results from Chern and colleagues (Chern et al. 2022). A high peak score was observed for only two genes involved in ribosome biogenesis in mice. Therefore, considering the ChEA3 analysis – many other TFs are higher ranked compared to THAP11, including the MYC family which are considered as the master regulators of ribosome biogenesis (Arabi et al. 2005; Boon et al. 2001; Cole and Cowling 2008; Grandori et al. 2005; Grewal et al 2005; van Riggelen et al. 2010) — and considering the common overall misregulation of the ribosomal genes in all the datasets considered in this study, it is possible to hypothesize that the modulation of genes related to the “Ribosome” pathway was not strictly associated to the THAP11 mutation, but more generally due to the VitB12 metabolism alteration.

In order to identify a common TF for all the models that could be responsible for the ribosomal gene regulation associated with the VitB12 metabolism alteration and more specifically, the one responsible for the divergence in the mouse and rat models, a TFs enrichment analysis was performed. The analysis took into account the genes that were globally upregulated and downregulated (without focusing on specific gene clusters) and highlighted four common TFs, for all the models, encoded by Patz1/Zfp278, Sp4, Egr1 and Zbtb7b genes that were not returned by the ChEA3 analysis performed on ribosomal genes (the latter is mostly based on the datasets from mouse). This suggested that the above-mentioned TFs could take part in a more general gene regulation process and not be related to the “Ribosome” pathway alteration, at least for the mouse model. Interestingly, E2F1 and EGR3 emerged as TFs modulated following the VitB12-metabolic defect (not common to all models) and are returned by the ChEA3 analysis, thereby providing a hint about their involvement in the ribosomal gene regulation in all the models. Furthermore, the ChEA3 analysis revealed MYC as the highest-ranked TF which modulated the ribosome genes selected. As reported in Table 2, E2F1 was associated with the majority of the selected genes as compared to MYC and EGR3, at least for the mouse model. In mice, the fact that the Enrichment TF analysis did not allow to emerge E2F1 could be linked to several features associated with the model and the analysis: (i) Enrichment TF analysis is based on the more altered genes; (ii) astrocyte and rat models are obtained by inducing nutritional VitB12 deficiency; and (iii) in mice, THAP11 mutation can have a lower and indirect role in “Ribosome” pathway and a prominent direct role on several other different pathways. These can lead to the detection of ribosome gene alteration without highlighting the change in E2F1 TFBSs enrichment in mice. Thus, a different analysis approach was needed.

The regulation of a TF target gene can be regulated by its TF content. Hence, we evaluated the gene expression levels of Myc, E2f1/Necab3 and Egr3 and the expression levels of E2f1/Necab3 were observed to be significantly upregulated in both the animal models, although rats showed a greater increase and higher significant upregulation due to the induction of the VitB12 metabolism defect. In spite of having very little information regarding ERG3 and its role in ribosomal gene regulation, the involvement of E2F1 and MYC in the positive regulation of ribosomal gene expression is quite known (Ayrault et al. 2006; Gnanasundram and Fåhraeus 2018; van Riggelen et al. 2010). The fact that E2F1 was greatly upregulated in rats as compared to that of mice after the induction of the VitB12-metabolic defect suggested its possible role in the divergent modulation of the ribosomal genes.

To reinforce the hypothesis on the possible involvement of E2f1 in the divergent regulation of ribosomal genes in mice and rats, the promoter sequence of genes involved in the “Ribosome” pathway exhibiting an opposed behaviour were taken into account: these were Mrpl14, Mrpl54 and Rpl13a. Before examining the promoter region, the transcripts of these genes were compared to establish a certain similarity, thereby allowing a genuine comparison. All the gene transcripts exhibited a similarity of over 50% and gaps below 50%, indicating an overall similar structure. The same analysis performed on the promoters revealed a lower similarity between mice and rats, thus underscoring a difference for the considered DNA portions. On comparing the promoter and transcript alignment results, it can be inferred that an evolutionary modification in the promoter could be responsible for the difference in gene regulation. Interestingly, using PROMO analysis, putative E2F1 TFBSs were predicted in the rat promoters. As a result of this and together with the higher E2f1 gene expression, it can be concluded that rats had a greater amount of E2F1 protein which could bind Mrpl14, Mrpl54 and Rpl13a, thereby activating their upregulation. To provide more evidence regarding this mechanism, the same analyses were performed on Chchd1 and Rbm3, which were part of the “Ribosome” pathway and were both observed to be downregulated in rats after the induction of VitB12 deficiency. However, only Chchd1 exhibited similarity after performing the alignment, thus allowing genuine comparison. PROMO analysis did not show any E2F1 TFBSs in the Chchd1 promoter, thereby suggesting its non-involvement in E2F1 regulation. This could explain the Chchd1 downregulation in rats after the induction of the VitB12-metabolic defect, irrespective of the increase in E2f1 (Fig. 5).

Fig. 5
figure 5

Molecular hypothesis. VitB12 deficiency leads to an upregulation of E2f1 in animal models (with rats showing a higher level). In rats, some genes involved in the ribosome pathway show putative E2F1 TFBSs. The greater expression of E2F1 and a higher TFBSs frequency in ribosome genes could sustain the upregulation of ribosome-related genes observed in rats

The alteration of the ribosome-related gene expression can affect ribosome biogenesis leading to ribosomopathies. Although ribosomes are essential organelles, the manifestation of the defects is extremely cell- and tissue-specific and includes mRNA translation, cell cycle control and signalling pathways (e.g., TP53, MYC and mTOR) (Armistead and Triggs-Raine 2014). In general, non-dividing differentiated cells have a reduced ribosome biogenesis rate, whereas post-mitotic neurons demonstrate an intense rate of ribosome biogenesis, particularly in neurites, for the increased synthesis of local proteins to promote neuronal morphogenesis. Increased ribosome biogenesis can be achieved via the ERK1/2 signalling cascade by neurotrophic factors, including brain-derived neurotrophic factors (BDNF). Additionally, data suggest a strong relationship between the increase in ribosome biogenesis and neurite outgrowth during nerve regeneration following injury (Turi et al. 2019). Of note, ERK1/2 and E2F1 are involved in the same pathway regulating cell cycling (Korotayev et al. 2008), thus speculating a possible association of ribosome biogenesis in CNS to be validated.

In this study, the presented comparative analysis revealed ribosome-related gene modulation specific to cell and animal models and suggested the important role played by E2F1 in maintaining ribosome homeostasis in the CNS on inducing VitB12 deficiency mimicking conditions. Overall, literature data and the results reported from this study indicate ribosome biogenesis/E2F1 as a potential target for reversing the VitB12 deficiency side effects.

Conclusion

There are ample pieces of evidence that suggest the role of VitB12 in CNS homeostasis, however, the precise mechanisms regarding the impairments due to its deficiency have not been elucidated. In this study, the available gene expression datasets with gene misregulation, following the induction of VitB12 metabolic defects, were considered to evaluate the possible underlying mechanisms and the mode of action. An alteration in the ribosome regulation was highlighted in all the models. The huge differences among the models and the protocols for mimicking the VitB12 deficiency considered in this study, strengthen the hypothesis of a conserved and basal mechanism involved in the regulation of ribosomal gene expression related to VitB12 metabolism. Furthermore, a possible role in the above-mentioned ribosomal regulation can be ascribed to the transcription factor E2F1 and cognate TFBSs. Although this study requires further validation employing the wet laboratory procedures, the results drawn from this comparative dataset analysis have provided new insights into the basic molecular mechanisms that might be involved in CNS homeostasis regarding its association with VitB12 deficiency.