An analysis of aging-related genes derived from the Genotype-Tissue Expression project (GTEx)

Jia, Kaiwen; Cui, Chunmei; Gao, Yuanxu; Zhou, Yuan; Cui, Qinghua

doi:10.1038/s41420-018-0093-y

An analysis of aging-related genes derived from the Genotype-Tissue Expression project (GTEx)

Article
Open access
Published: 20 August 2018

Volume 4, article number 91, (2018)
Cite this article

Download PDF

You have full access to this open access article

Cell Death Discovery

An analysis of aging-related genes derived from the Genotype-Tissue Expression project (GTEx)

Download PDF

Kaiwen Jia ORCID: orcid.org/0000-0003-4941-6456¹,
Chunmei Cui¹,
Yuanxu Gao¹,
Yuan Zhou¹ &
…
Qinghua Cui^1,2

7326 Accesses
25 Citations
7 Altmetric
Explore all metrics

A Correction to this article was published on 10 July 2019

This article has been updated

Abstract

Aging is a complex biological process that is far from being completely understood. Analyzing transcriptional differences across age might help uncover genetic bases of aging. In this study, 1573 differentially expressed genes, related to chronological age, from the Genotype-Tissue Expression (GTEx) project, were categorized as upregulated age-associated genes (UAGs) and downregulated age-associated genes (DAGs). Characteristics in evolution, expression, function and molecular networks were comprehensively described and compared for UAGs, DAGs and other genes. Analyses revealed that UAGs are more clustered, more quickly evolving, more tissue specific and have accumulated more single-nucleotide polymorphisms (SNPs) and disease genes than DAGs. DAGs were found with a lower evolutionary rate, higher expression level, greater homologous gene number, smaller phyletic age and earlier expression in body development. UAGs are more likely to be located in the extracellular region and to occur in both immune-relevant processes and cancer-related pathways. By contrast, DAGs are more likely to be located intracellularly and to be enriched in catabolic and metabolic processes. Moreover, DAGs are also critical in a protein–protein interaction (PPI) network, whereas UAGs have more influence on a signaling network. This study highlights characteristics of the aging transcriptional landscape in a healthy population, which may benefit future studies on the aging process and provide a broader horizon for age-dependent precision medicine.

Synchronized age-related gene expression changes across multiple tissues in human and the link to complex diseases

Article Open access 19 October 2015

A network-based meta-analysis for characterizing the genetic landscape of human aging

Article Open access 21 December 2017

Identification of genes associated with accelerated biological ageing through computational analysis: a systematic review

Article 04 June 2024

Introduction

Aging is considered to be a dominating risk factor for many fatal diseases, including cancer, cardiovascular diseases and neurodegenerative diseases^1,2,3,4,5,6. A large number of studies have found that aging is associated with telomere attrition, mitochondrial dysfunction, DNA damage, immune system impairment etc., and can be inhibited by calorie restriction^7,8,9,10,11. However, the detailed mechanisms involved in aging remain unclear. In recent years, rapidly developing high-throughput omics have provided a broader insight, with the identification of a number of longevity-relevant loci based on genome-wide association studies(GWAS) and epigenome analyses^12,13. As previous studies have shown, aging is distinct at molecular, cellular and tissue levels¹⁴, which indicates that the relatively dynamic transcriptome might also provide important clues for the study of aging. A large number of human age-associated genes have been identified in previous transcriptomic studies, based on specific tissues like muscle, blood, skin, adipose, brain etc., and have been compared across tissues^{15,16,17,18,19,20,21,22,23,24}. The identified age-associated genes in these studies vary from each other, which could be partly due to differences in the health condition of donors, sample size, sample quality, tissue, platform and the method of identifying age-associated genes. Also, the divergent results may be accounted for by the generally low repeatability of microarray data that most of the analyses were based on. In addition, given that aging is likely to confound with other factors, profiling the aging factor alone can be difficult, which may lead to biases in screening age-associated genes. Among these studies, the age-associated gene set from the Genotype-Tissue Expression (GTEx) project²⁵ is of high quality. This set of aging-associated genes was screened out by using a regression model on large-sample RNA-Seq data, collected from >40 tissues from hundreds of healthy individuals; whereas sex, race, and tissue were controlled as a covariate to avoid biases²⁴. However, characteristics of these genes still remain unexplored. A comprehensive analysis of these genes might help to improve the understanding of aging process and provide valuable clues for strategies in anti-aging interventions. Here, we comprehensively explored characteristics of the age-associated genes derived from GTEx. The results revealed that differences and interactions exist in evolution, expression, function, associated diseases and molecular network between the upregulated age-associated genes (UAGs) and downregulated age-associated genes (DAGs).

Result

The human transcriptional age-associated genes from GTEx

To characterize the age-associated genes, we extracted a protein-coding transcriptional age-associated gene set from a GTEx transcriptional analysis (see Materials and methods section). The gene set analysis procedure is summarized in Fig. 1a. Ultimately, we extracted 710 UAGs and 863 DAGs across tissues. The age-associated genes account for 7.71% of the protein-coding genes (Fig. 1b and Supplementary Table 1). Genomic information of UAGs and DAGs is shown in Supplementary Tables 2 and 3, respectively.

**Fig. 1: The transcriptional age-associated genes.**

Distribution of the age-associated genes across chromosomes

To characterize the age-associated genes at the genomic level, we investigated the chromosomal distribution pattern of the age-associated genes. Results showed age-associated genes are widespread across chromosomes. Over 97% of the age-associated genes are located on autosomes, where the proportion of DAGs is higher than that of UAGs, with the exception of five autosomes. Although no age-associated genes were found on the Y chromosome, possibly due to the adjustment for sex in the regression model used to discover age-associated genes, some UAGs (2.86%) and a smaller number of DAGs (1.83%) were observed on the X chromosome (Figs. 1c, d).

Genes with similar functions are likely to locate adjacently on chromosomes²⁶. To study whether the age-associated genes are close to each other on chromosomes, we calculated the genomic distances of each pair of genes on the same chromosome, within one age-associated gene group only (UDs, DDs), and between groups (UDDs). Results showed these three distances are significantly different from each other (median values of UDs, DDs and UDDs are 3.52e + 7, 4.67e + 7 and 4.42e + 7, respectively). Of the three distances, UDs possess the smallest value compared with DDs (P = 7.52e-56, Wilcoxon test) and UDDs (P = 1.17e-57, Wilcoxon test). Surprisingly, UDDs show smaller value than DDs (P = 0.0038, Wilcoxon test) (Fig. 1e). These results suggest that on the same chromosome, UAGs are more clustered than DAGs and non-age-associated genes (NAGs); DAGs on the other hand are relatively dissociated, whereas some of them tend to be more adjacent to UAGs than other DAGs.

Evolutionary characteristics and expression profile of age-associated genes

To gain a better insight into the evolutionary background of genes, the evolutionary characteristics of age-associated genes were investigated in three aspects: evolutionary rate, homologous gene number and phyletic age. Results showed that DAGs have the lowest dN/dS ratio (P = 1.43e-50, Wilcoxon test), the highest homologous gene number (P = 5.73e-55 Wilcoxon test) and have a relatively earlier origin in phyletic age compared with UAGs (P = 7.37e-24, chi-squared test). The corresponding values for NAGs are P = 5.02e-45, P = 3.39e-60 and P = 4.74e-32. Comparatively, UAGs evolves most rapidly, with the smallest homologous gene number, and originated later than DAGs but earlier than NAGs (Figs. 2a–c; Table 1). These results revealed that DAGs are more conserved in evolution, suggesting the DAGs are more crucial to fundamental functions in humans. However, UAGs are less stable and have a shorter history, which implies they are likely to function in more specific and advanced functions.

**Fig. 2: Evolutionary characteristic and expression profile of age-associated genes.**

Table 1 Statistical results for characteristic analyses in evolution and expression

Full size table

Essential genes are those that function in basic biological processes and must exist for an organism to survive²⁷. As expected, DAGs are significantly more enriched in human essential genes (15.3% of DAG genes), compared with NAGs (11.8%; P = 0.0035, chi-squared test) (Table 1). UAGs (11.8%) and NAGs (11.9%) have no significant difference.

The expression profile also provides important characteristics for a gene and often gives valuable clues to potential gene function. Therefore, we explored the expression characteristics of age-associated genes from three aspects: average expression level across tissue, tissue specificity and the earliest expression stage. We found DAGs have the highest expression level and lowest tissue specificity compared with UAGs and NAGs, suggesting DAGs have a more global function, providing basic support to the whole body. UAGs are significantly higher in expression level and tissue specificity than NAGs, which suggests UAGs are also important in function, but more tissue specific (Figs. 2d, e; Table 1). As for the earliest expression stage, although the stage of embryoid body is predominant for expression in all three groups, DAGs are far more expressed than UAGs and NAGs at this stage. UAGs on the other hand outnumber DAGs and NAGs in the following stages of blastocyst and fetus (Fig. 2f; Table 1). Altogether the age-associated genes tend to express in earlier stages than other genes, underlining they are likely to play important roles in early development.

Functional annotation for age-associated genes

Functional enrichment analyses were carried out to explore the functions of UAGs and DAGs. First, we investigated the subcellular locations of UAGs and DAGs to see if they are located differently. Genes located in nucleus (NU), cytoplasm (CY), membrane (ME) and extracellular region (ER) were filtered from gene ontology (GO) terms and the age-associated genes were mapped into these cellular locations. The number of UAGs and DAGs in these four locations were, 78 versus 118 (10.99% versus 13.67%, NU), 163 versus 231 (22.96% versus 26.77%, CY), 67 versus 123 (9.44% versus 14.25%, ME) and 85 versus 15 (11.97% versus 1.74%, ER), respectively, see Fig. 3a. The distributions of UAGs and DAGs in subcellular locations are significantly different (P = 1.971e-16, chi-squared test). Noted that there is a striking difference in the fraction of UAGs and DAGs in ER, suggesting the UAG group includes far more secretory protein genes.

**Fig. 3: Functional enrichment analysis for age-associated genes.**

Next, we focused on the biological processes and pathways. GO biological process analysis was carried out for 626 UAGs and 772 DAGs mapped in DAVID. A total of 759 and 549 enriched terms were filtered for UAGs and DAGs with the threshold of P-value < 0.05. Most terms enriched with UAGs are relevant to the immune system, such as processes related to immune cells, ‘response to stimulus’ (54.8%, P = 1.10e-10) or ‘defense response’ (15.10%, P = 1.20e-10). In all, 178 UAGs are most significantly enriched in the term ‘immune system process’ (25.1%, P = 1.40e-19), whereas 292 are enriched in the term ‘cell communication’, with a fairly high number (41.1%, P = 1.10e-6) (Supplementary Fig. 1). Apart from processes related to cell migration, the data showed that cell proliferation, cell adhesion and cell differentiation are also highly enriched with UAGs, hinting that UAGs are likely to be involved in processes relevant to tumorigenesis.

As for DAGs, terms related to catabolic processes are enriched most significantly (with 31 terms in total); whereas metabolic process-relevant terms appear in most genes (with 80 terms in total). In total, 87 DAGs (10.1%, P = 5.20e-19) are most significantly enriched in the term ‘proteolysis involved in cellular protein catabolic process’, whereas 571 genes are enriched in the term ‘metabolic process’ (66.2%, P = 6.40e-7). In addition, 37 terms are relevant to mitochondria, and 19 terms are associated with cell cycle (111 genes are enriched in term ‘cell cycle’, P = 6.30e-5). Moreover, 11 terms with the keyword ‘immune’ were also observed in DAGs enriched processes, 5 terms have the keyword ‘innate immune’ and 37 DAGs are enriched in the term ‘positive regulation of innate immune response’ (3.8%, P = 4.80e-6) (Supplementary Fig. 2).

Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis revealed 26 and 32 enriched terms (with the threshold of P < 0.05) for 257 UAGs and 355 DAGs (Figs. 3b, c). Notably, apart from being enriched in pathways related to inflammation and infection, several pathways enriched with UAGs are oncogenic pathways. DAGs are enriched in foundational metabolic pathways, which is similar with the result of the enrichment analysis for biological processes. In addition, there are pathways related to neurodegenerative diseases enriched with DAGs, as a relevant study mentions²⁸. Considering DAGs are downregulated across age, this result indicates that healthy old individuals may suffer from a functional decline in relevant pathways, but this result may not serve as direct evidence for the relation between healthy aging and these diseases.

UAGs have higher SNP density and are more enriched in disease genes

In the human genome, single-nucleotide polymorphism (SNP) is the most common genetic variation. To investigate the difference between SNP in age-associated gene and other genes, we calculated the SNP density (the number of SNPs in a gene divided by the gene’s length) for UAGs, DAGs and NAGs, and compared their distributions across chromosomes (Fig. 4a). As expected, DAGs bear the lowest SNP density compared with UAGs and NAGs (P = 0.0012 and 0.0011, respectively, Wilcoxon test, the median value of UAGs, DAGs and NAGs are 0.037, 0.012 and 0.025, respectively; Fig. 4b), which is consistent with the evolutionary characteristic analysis that DAGs tend to be more conserved.

**Fig. 4: SNP distribution and disease relation of age-associated genes.**

To further study whether UAGs have more disease-relevant SNPs (dSNPs) compared with other groups, we mapped the dSNPs to each gene, and calculated the fraction of the number of dSNPs, versus total number of SNPs, in each gene group (Fig. 4c). Surprisingly, no significant difference was found between the age-associated genes (data not shown), whereas NAGs have a higher dSNP fraction than UAGs and DAGs (P = 0.011 and 0.0045, respectively, Wilcoxon test, the median value for the fraction of dSNP to total SNP for UAGs, DAGs and NAGs is 0.31%, 0.29% and 0.52%, respectively; Fig. 4d). Given that the individuals contributing age-associated genes are healthy, it is not likely that they have many fatal mutations as reflected in the UAGs and DAGs.

In addition, 7291 disease genes from DisGeNET database were used to analyze the enrichment of the three gene groups to see whether there are differences between groups (Fig. 4e). Results showed that UAGs are enriched the most in disease genes, with the median proportion of 47.69% across chromosomes, compared with DAGs (38.36%, P = 0.0031, Wilcoxon test) and NAGs (35.82%, P = 1.02e-4, Wilcoxon test). No significant difference was observed between DAGs and NAGs (Fig. 4f). Disease genes in UAGs and DAGs are listed in Supplementary Table 4.

Moreover, for 495 UAGs, 93 items are significantly enriched in Genetic Association Database (GAD) diseases (with threshold P < 0.05). In all, 134 UAGs are most significantly enriched in term ‘Type 2 Diabetes|edema|rosiglitazone’ (18.9%, P = 9.30e-9), whereas ‘hypertension’, ‘multiple sclerosis’, various infectious diseases and cancers were also observed. As for the GAD disease class, 162 UAGs are significantly enriched in ‘pharmacogenomics’ (22.8%, P = 5.10e-10), also there are 172, 153, 106 and 182 UAGs enriched in terms ‘immune’, ‘cancer’, ‘infection’ and ‘cardiovascular’, respectively. DAGs are only enriched in one GAD item ‘infection’, with the proportion of 19.2% (P = 1.10e-14) (Supplementary Fig. 3).

Age-associated genes in PPI and signaling networks

Genes cooperate with each other to maintain homeostasis in the human body. To investigate the network characteristics of age-associated genes, we first estimated the centrality difference for each gene node in a non-directional protein–protein interaction (PPI) network through three metrics: degree, betweenness centrality and closeness centrality. These serve as important characteristics to evaluate whether a gene node in the network is ‘well-connected’ to other nodes. Among the three gene groups, DAG nodes have the highest degree, betweenness centrality and closeness centrality, compared with UAG nodes and NAG nodes, whereas UAG nodes have lower degrees than NAG nodes (Figs. 5a–c). Statistical results are shown in Table 2. To further explore how closely UAGs and DAGs are linked to each other in the PPI network, first we performed a permutation test, which revealed that the number of direct DAG-DAG links (DLs) and UAG-DAG links (UDLs) tend to be greater than when gene nodes are randomly distributed to the three gene groups (P = 0 and 6e-4, respectively) (Figs. 5e, f), whereas no significant results were observed for UAG-UAG links (ULs). Similar results, revealed by network distance analysis, are that DAGs have the shorter distances from each other than NAG nodes (median value of average D-D distances and D-N distances are 2.68 and 2.88, respectively), UAGs have the longest distances within the group, with no significant difference between the distance to NAGs (median value of average U-U distance and U-N distance are 3.07 and 3.06, respectively), and the UAG-DAG distances remain shorter than UAG-UAG distances (Fig. 5d and Table 3). These results suggest DAGs are of great connectivity and are cross-functioning, whereas UAGs are likely to stay in smaller groups and be more specialized in function. The fact that these two groups are fairly connected to each other in the PPI network, implies some of the UAGs and DAGs function similarly and may be attached to the same pathways.

**Fig. 5: Characteristics of age-associated genes in the PPI network.**

Table 2 Statistical results for topological characteristic analyses in the networks

Full size table

Table 3 Statistical results for network distance analyses

Full size table

In addition, we explored the characteristics of UAGs and DAGs in a cellular signaling network, in which 299 UAGs, 319 DAGs were mapped, with 5506 other genes considered as NAG nodes²⁹. Centrality and distance analysis for UAGs and DAGs, in the signaling network, revealed different results, compared to PPI network, which UAGs have higher betweenness and closeness centrality value than DAGs and NAGs (Figs. 6c, d and Table 2). Concomitantly, DAGs have higher values than NAGs. Besides, UAGs have more positive links and negative links, compared with DAGs, respectively in out-degree and in-degree (Figs. 6a, b and Table 2). As for the distance analysis, more U-U links, D-D links and U-D/D-U links were observed in the UAG and DAG node groups compared with randomized groups (P = 0, 0, 0.021, respectively, permutation test) (Supplementary Fig. 4), whereas U-U distances remain the shortest when compared with U-D, D-D and U-N. D-D and U-D distances are also shorter, in comparison with distances to other nodes (Fig. 6e and Table 3). These results showed UAGs and DAGs both have high connectivity in the signaling network; nonetheless UAGs are more clustered and of great centrality, suggesting that UAGs are more interconnected in the signaling network.

**Fig. 6: The signaling network provide more detailed characteristics for age-associated genes.**

Discussion

Aging is a process characterized by progressive loss of physiological integrity, which leads to impaired function and increased vulnerability to death³⁰. By analyzing the aging differences revealed by gene expression, important clues can be gained to better understand the process of aging at the transcriptional level. Here, we systematically characterized the characteristics of transcriptional age-associated genes in multiple aspects. To sum up, genes that increase expression with age are less conserved in evolution, more tissue specific in expression and more enriched in SNPs and disease genes. They are more active in the signaling network and are located more in the extracellular region, with clustered functions mostly involving the immune system and are likely to play crucial roles in various cancer-related pathways. Genes with a decreased level of expression with age have a longer phyletic history, tend to be stable over time, expressed globally across tissues and are expressed early in a human lifetime. These genes function mostly in basic metabolic or catabolic process that are vital for human survival, and play important roles in gene networks, especially in the PPI network.

Notably, these age-associated genes, in the up and downregulated directions, tend to be close to each other in genomic distances and distances in both PPI and signaling networks. To further study potentially aging-related processes, 323 interacting UAGs and DAGs nodes were extracted from the signaling network, visualized (Supplementary Fig. 5) and functionally annotated (results for GO BP and pathway enrichment analysis with threshold FDR < 0.05 are shown in Supplementary Tables 5 and 6, respectively) by Cytoscape³¹. Examples for the interacting age-associated gene nodes enriched in process ‘T-cell receptor signaling pathway’ and enriched in pathway ‘pathways in cancers’ are shown in Supplementary Figure 6.

Apart from this, some DAGs are overlapped in function with UAGs, other DAGs are more dispersive with functions in multiple aspects, whereas UAGs are consistently more clustered and co-functioned. These dispersive distributed DAGs are enriched in terms related to catabolic, metabolic, mitochondrion-related and innate immune processes, which is consistent with previous studies that aging is a process with mitochondrial and immune dysfunction^9,32. UAGs are mostly clustered in immune-related function and more located in the extracellular region, indicating the tendency of senescent cells to secrete pro-inflammatory cytokines, which lead to a chronic inflammatory state in healthy aging individuals. Besides, these immune-relevant UAGs and some of their interacted DAGs are also highly enriched in oncogenic signaling pathways, including Ras, nuclear factor kappa-light-chain-enhancer of activated B cells(NF-kappa B), phosphatidylinositol 3-kinase(PI3K)/AKT, Notch and P53 signaling pathways, which implies that senescent cells resemble cancer cells. This needs to be further verified by comparing aging profiles to specifically precancerous profiles. When mapping the 1020 cancer genes from OncoKB³³ to all gene groups, UAGs were observed to bear the highest cancer genes proportion (7.75%) compared with DAGs (4.17%, P = 0.0036, chi-squared test) and NAGs (4.97%, P = 0.0013, chi-squared test). DAGs have the lowest proportion and no significant difference is shown between DAGs and NAGs. The cancer genes mapped in UAGs and DAGs are listed in Supplementary Table 7. In addition, we observed 15 and 14 cancer driver genes, respectively, from a 299 pan-cancer driver gene set provided by a recent study of TCGA’s Pan-Cancer³⁴ in UAGs and DAGs, such as ERBB2 (level 1 driver gene in OncoKB³³), TNFAIP3 in UAGs, and MAPK1, PIK3CB in DAGs (driver genes mapped in UAGs and DAGs are listed in Supplementary Table 8). As these driver genes are mostly increased in expression in tumorigenesis process, the driver genes found in DAGs may serve as a protective factor in this driving process. However, a previous study showed some driving processes are not likely to co-occur in cancer³⁵, therefore the decreased level in these driver genes may accompany an increased level of other driver genes.

Aging is widely considered to be caused by accumulated cellular damage^36,37, which is led by random mutations. SNP enrichment analysis shows more variations in UAGs, which means these genes are likely to be the positions that random mutation occurred. There is a possibility that the mutation in some UAGs cause them to increase in expression, and interfere with other interactive DAGs and UAGs. These changes may contribute to part of the aging transcriptional landscape. Considering DAGs experience less selection with lower evolutionary rates, mutations in DAGs are likely to be fatal, which would cause more disruptive outcomes that would be hard to observe in healthy individuals. Besides, as previous studies noted^13,38,39, the decreased expression in DAGs may be mainly caused by age-associated epigenetic drift. The analyses in evolutionary characteristics, expression pattern and connectivity in the PPI network, showed consistent results with previous studies that UAGs are younger genes than DAGs, as studies showed that younger genes evolve more rapidly⁴⁰, are more likely to present different temporal and spatial expression patterns⁴¹ and have fewer interactions in the PPI network⁴². Although UAGs and DAGs are both of high connectivity in the signaling network, UAGs remain more interactive, which suggests aging may accompany more accumulated in signaling crucial proteins.

Moreover, when performing the functional enrichment analysis, it was of note that the number of enriched terms that have a positive and a negative regulating function are about the same. This suggests those differentially expressed genes in healthy old individuals are not likely to cause imbalances that lead to conspicuously disruptive outcomes. As for an effort in precise medicine, 82 and 88 FDA-approved target genes from DrugBank⁴³, mapped in UAGs and DAGs (Supplementary Table 9), are provided, respectively, which hopefully can help future research in different drug use for the young and old. Overall, our findings provide multiple biological implications for further study in healthy aging.

Materials and methods

The age-associated gene set

The age-associated gene set was extracted from a dataset of genes expressed differentially across age, as given in the GTEx project. These genes were screened out using a linear mixed model where sex, race and tissue were controlled to avoid the biases²⁴. Our study focused on protein-coding genes, thus noncoding RNA genes were excluded from the gene set and the remaining genes were then categorized into two groups according to the regression coefficient. Genes with positive and negative coefficients were described as UAGs and DAGs, respectively. The original Ensembl IDs in the dataset were transformed into Entrez gene IDs and official gene symbols for further analysis. Apart from the age-associated genes, the rest of the protein-coding genes were regarded as NAGs.

Chromosomal distribution and genomic distance analyses

Chromosomal distribution of age-associated genes was revealed by calculating and comparing the proportion of both UAGs and DAGs in the totality of genes on each chromosome. The genomic distances of every pair of UAGs/DAGs (UDs/DDs), as well as the distances between a UAG and a DAG (UDDs) on the same chromosome, were calculated. The Wilcoxon rank-sum test was used to compare the distribution of UDs, DDs and UDDs across chromosomes.

Evolutionary and expression characteristic analyses

The average expression level across tissues and tissue specificity of each gene were calculated based on a gene atlas database from Su et al.⁴⁴. The dN/dS ratio dataset of each human–mouse homolog was derived from the Ensembl database (release 83) to illustrate gene evolutionary rate. The homologous gene number dataset was obtained from the Homologene database⁴⁵ (build 68), the phyletic evolution and earliest expression stage datasets were obtained from Online Gene Essentiality database⁴⁶ and the essential gene dataset was obtained from the DEG database (version 10.6)²⁷. The Wilcoxon test was performed to statistically compare the evolutionary rate, expression level, expression specificity and homologous gene number of UAGs, DAGs and NAGs, whereas a chi-squared test was used to compare the proportion of earliest expression stage, phyletic evolution and the essential genes between the three gene sets.

Functional enrichment analysis for age-associated genes

We calculated the number of genes in four terms of the GO: NU, CY, ME and ER, and compared the numbers between UAGs, DAGs and NAGs by performing chi-squared tests. Meanwhile, functional enrichment analysis of GO biological process and KEGG pathway annotations were both performed using the DAVID Web server⁴⁷.

Analysis of SNP densities

A dataset of SNPs in human protein-coding genes (SNPs and indels, excluding flagged variants) (GRCh38.p2) with genome coordinates was derived from the Ensembl database⁴⁸. SNP density of each gene was defined as the total number of mapped SNPs, divided by gene length. SNPs were mapped to all protein-coding genes, and the average SNP densities on each chromosome of UAGs, DAGs and NAGs were compared using the Wilcoxon test.

The enrichment analysis of disease-related SNP and disease genes

The disease-related SNPs (dSNPs) were obtained from the ClinVar database⁴⁹ and Human Gene Mutation Database;⁵⁰ data without a dbSNP ID or labeled as ‘protective’, ‘(likely) benign’, ‘uncertain significance’, ‘conflicting data from submitters’, ‘other’ and ‘not provided’ were excluded. dSNPs were mapped to each chromosome and the fractions of dSNPs to total SNPs in UAGs, DAGs and NAGs were calculated and compared using the Wilcoxon test. As for disease gene analysis, the DAVID Web server⁴⁷ was used to perform functional enrichment analysis for terms of specific disease and disease class in the GAD, and the curated gene–disease association dataset downloaded from the DisGeNET database⁵¹ was also used to compare the fraction of disease genes for UAGs, DAGs and NAGs across chromosomes by performing the Wilcoxon test.

Network analysis

The human PPI network was downloaded from the BioGRID database (build 3.4.140) with the deletion of links that include non-human protein⁵². The human cellular signaling network was taken from our previous study²⁹. The Wilcoxon test was performed to investigate the degree, betweenness centrality and closeness centrality differences for the UAG, DAG and NAG nodes in the PPI network. For the degree analysis in the signaling network, the proportion of positive links to the sum of positive links and negative links in UAGs and DAGs was compared for both in-degree and out-degree by performing chi-squared tests. As for network distance analysis, permutation tests were performed to compare the numbers of UAG-UAG links (ULs), DAG-DAG links (DLs) and UAG-DAG/DAG-UAG links (UDLs) in our identified networks with the numbers of those three types of links for 10,000 randomized node groups. The P-value was calculated as the frequency of times when link number in randomized groups was greater than that in identified groups. In addition, the average length of the shortest paths for UAG and DAG nodes in their own group, between each group and with the NAG nodes were also compared by performing the Wilcoxon test. The degree, betweenness centrality, closeness centrality and length of the shortest path values were calculated by using the python package NetworkX⁵³. For the signaling network, distances were counted from both directions in each distance group. Network visualization and functional enrichment analysis for interacting age-associated gene nodes in the signaling network were carried out by Cytoscape³¹.

Change history

10 July 2019
Due to a technical error, content intended for publication in Volume 4 (2018) published in Volume 5 (2019). The content has been moved into the correct volume, and the citation information was updated accordingly.

References

Niccoli, T., & Partridge, L. Ageing as a risk factor for disease. Curr. Biol. 22, R741–R752 (2012).
Article CAS PubMed Google Scholar
Finkel, T., Serrano, M. & Blasco, M. A. The common biology of cancer and ageing. Nature 448, 767–774 (2007).
Article CAS PubMed Google Scholar
Jousilahti, P., Vartiainen, E., Tuomilehto, J. & Puska, P. Sex, age, cardiovascular risk factors, and coronary heart disease. Circulation 99, 1165 (1999).
Article CAS PubMed Google Scholar
North, B. J. & Sinclair, D. A. The intersection between aging and cardiovascular disease. Circ. Res. 110, 1097–1108 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lindsay, J. et al. Risk factors for Alzheimer’s disease: a prospective analysis from the Canadian Study of Health and Aging. Am. J. Epidemiol. 156, 445–453 (2002).
Article PubMed Google Scholar
Phillipson, O. T. Management of the aging risk factor for Parkinson’s disease. Neurobiol. Aging 35, 847–857 (2014).
Article PubMed Google Scholar
Anderson, R. M. & Weindruch, R. The caloric restriction paradigm: implications for healthy human aging. Am. J. Human. Biol. 24, 101–106 (2012).
Article Google Scholar
Blackburn, E. H., Epel, E. S. & Lin, J. Human telomere biology: a contributory and interactive factor in aging, disease risks, and protection. Science 350, 1193 (2015).
Article CAS PubMed Google Scholar
Theurey, P. & Pizzo, P. The aging mitochondria. Genes 9, 22 (2018).
Hoeijmakers, J. H. J. DNA damage, aging, and cancer. New Engl. J. Med. 361, 1475–1485 (2009).
Article CAS PubMed Google Scholar
Bandaranayake, T., & Shaw, A. C. Host resistance and immune aging. Clin. Geriatr. Med. 32, 415–432 (2016).
Article PubMed PubMed Central Google Scholar
Jeck, W. R., Siebold, A. P., & Sharpless, N. E. Review: a meta-analysis of GWAS and age-associated diseases. Aging Cell 11, 727–731 (2012).
Article CAS PubMed Google Scholar
Sen, P., Shah, P. P., Nativio, R., & Berger, S. L. Epigenetic mechanisms of longevity and aging. Cell 166, 822–839 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kenyon, C. J. The genetics of ageing. Nature 464, 504–512 (2010).
Article CAS PubMed Google Scholar
Welle, S. et al. Skeletal muscle gene expression profiles in 20-29 year old and 65-71 year old women. Exp. Gerontol. 39, 369–377 (2004).
Article CAS PubMed Google Scholar
Park, S. K., & Prolla, T. A. Gene expression profiling studies of aging in cardiac and skeletal muscles. Cardiovasc. Res. 66, 205–212 (2005).
Article CAS PubMed Google Scholar
Harries, L. W. et al. Human aging is characterized by focused changes in gene expression and deregulation of alternative splicing. Aging Cell 10, 868–878 (2011).
Article CAS PubMed Google Scholar
Peters, M. J. et al. The transcriptional landscape of age in human peripheral blood. Nat. Commun. 6, 8570 (2015).
Article CAS PubMed Google Scholar
Glass, D. et al. Gene expression changes with age in skin, adipose tissue, blood and brain. Genome. Biol. 14, R75 (2013).
Article PubMed PubMed Central Google Scholar
Somel, M. et al. MicroRNA, mRNA, and protein expression link development and aging in human and macaque brain. Genome Res. 20, 1207–1218 (2010).
Article CAS PubMed PubMed Central Google Scholar
Tan, Q. et al. Genetic dissection of gene expression observed in whole blood samples of elderly Danish twins. Hum. Genet. 117, 267–274 (2005).
Article PubMed Google Scholar
Berchtold, N. C. et al. Gene expression changes in the course of normal brain aging are sexually dimorphic. Proc. Natl. Acad. Sci. USA 105, 15605–15610 (2008).
Article CAS PubMed PubMed Central Google Scholar
de Magalhaes, J. P., Curado, J. & Church, G. M. Meta-analysis of age-related gene expression profiles identifies common signatures of aging. Bioinformatics 25, 875–881 (2009).
Article PubMed PubMed Central Google Scholar
Mele, M. et al. Human genomics. The human transcriptome across tissues and individuals. Science 348, 660–665 (2015).
Article CAS PubMed PubMed Central Google Scholar
Consortium, G. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Article Google Scholar
Cohen, B. A., Mitra, R. D., Hughes, J. D. & Church, G. M. A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression. Nat. Genet. 26, 183 (2000).
Article CAS PubMed Google Scholar
Zhang, R., Ou, H. Y. & Zhang, C. T. DEG: a database of essential genes. Nucleic Acids Res. 32, D271–D272 (2004).
Article CAS PubMed PubMed Central Google Scholar
Yang, J. et al. Synchronized age-related gene expression changes across multiple tissues in human and the link to complex diseases. Sci. Rep. 5, 15145 (2015).
Article PubMed PubMed Central Google Scholar
Cui, Q. et al. A map of human cancer signaling. Mol. Syst. Biol. 3, 152 (2007).
Article PubMed PubMed Central Google Scholar
Lopez-Otin, C., Blasco, M. A., Partridge, L., Serrano, M., & Kroemer, G. The hallmarks of aging.Cell 153, 1194–1217 (2013).
Article CAS PubMed PubMed Central Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Shaw, A. C., Goldstein, D. R. & Montgomery, R. R. Age-dependent dysregulation of innate immunity. Nat. Rev. Immunol. 13, 875–887 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chakravarty, D. et al. OncoKB: a precision oncology knowledge base. JCO Precision Oncol. 1, 1–16, https://doi.org/10.1200/PO.17.00011 (2017).
Article Google Scholar
Bailey, M. H. et al. Comprehensive characterization of cancer driver genes and mutations. Cell 173, 371–385 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sanchez-Vega, F. et al. Oncogenic signaling pathways in the cancer genome atlas. Cell 173, 321–337 (2018).
Article CAS PubMed PubMed Central Google Scholar
Gems, D. & Partridge, L. Genetics of longevity in model organisms: debates and paradigm shifts. Annu. Rev. Physiol. 75, 621–644 (2013).
Article CAS PubMed Google Scholar
Vijg, J. & Campisi, J. Puzzles, promises and a cure for ageing. Nature 454, 1065–1071 (2008).
Article CAS PubMed PubMed Central Google Scholar
Teschendorff, A. E., West, J. & Beck, S. Age-associated epigenetic drift: implications, and a case of epigenetic thrift? Hum. Mol. Genet. 22, R7–R15 (2013).
Article CAS PubMed PubMed Central Google Scholar
Booth, L. N., & Brunet, A. The aging epigenome. Mol. Cell 62, 728–744 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wolf, Y. I., Novichkov, P. S., Karev, G. P., Koonin, E. V. & Lipman, D. J. The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent ages. Proc. Natl. Acad. Sci. USA 106, 7273–7280 (2009).
Article CAS PubMed PubMed Central Google Scholar
Popadin, KonstantinY. et al. Gene age predicts the strength of purifying selection acting on gene expression variation in humans. Am. J. Hum. Genet. 95, 660–674 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zhang, W., Landback, P., Gschwend, A. R., Shen, B. & Long, M. New genes drive the evolution of gene interaction networks in the human and mouse genomes. Genome Biol. 16, 202 (2015).
Article PubMed PubMed Central Google Scholar
Wishart, D. S. et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 46, D1074–D1082 (2018).
Article CAS PubMed Google Scholar
Su, A. I. et al. A gene atlas of the mouse and human protein-encoding transcriptomes. Proc. Natl. Acad. Sci. USA 101, 6062–6067 (2004).
Article CAS PubMed PubMed Central Google Scholar
NCBI. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 44, D7–D19 (2016).
Article Google Scholar
Chen, W. H., Minguez, P., Lercher, M. J. & Bork, P. OGEE: an online gene essentiality database. Nucleic Acids Res. 40, D901–D906 (2012).
Article CAS PubMed Google Scholar
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009).
Kinsella, R. J. et al. Ensembl BioMarts: a hub for data retrieval across taxonomic space. Database 2011, bar030, https://doi.org/10.1093/database/bar030 (2011).
Article CAS PubMed PubMed Central Google Scholar
Landrum, M. J. et al. ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res. 44, D862–D868 (2016).
Article CAS PubMed Google Scholar
Stenson, P. D. et al. The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine. Hum. Genet. 133, 1–9 (2014).
Article CAS PubMed Google Scholar
Pinero, J. et al. DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes. Database 2015, bav028 (2015).
Article PubMed PubMed Central Google Scholar
Chatr-aryamontri, A. et al. The BioGRID interaction database: 2017 update. Nucleic Acids Res. 45, D369–D379 (2017).
Article CAS PubMed Google Scholar
Hagberg, A., Swart, P. & S Chult, D. Exploring Network Structure, Dynamics, and Function Using NetworkX. (Los Alamos National Lab (LANL), Los Alamos, NM, USA 2008).
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645 (2009).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work has been supported by the grants from the Special Project on Precision Medicine under the National Key R&D Program (2016YFC0903003), the Natural Science Foundation of China (81670462).

Author information

Authors and Affiliations

Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Road, 100191, Beijing, China
Kaiwen Jia, Chunmei Cui, Yuanxu Gao, Yuan Zhou & Qinghua Cui
Center of Bioinformatics, Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, 610054, Chengdu, China
Qinghua Cui

Authors

Kaiwen Jia
View author publications
You can also search for this author in PubMed Google Scholar
Chunmei Cui
View author publications
You can also search for this author in PubMed Google Scholar
Yuanxu Gao
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Qinghua Cui
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qinghua Cui.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Edited by N. Barlev

Electronic supplementary material

Supplementary Figures 1–6

Supplementary Tables 1–9

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jia, K., Cui, C., Gao, Y. et al. An analysis of aging-related genes derived from the Genotype-Tissue Expression project (GTEx). Cell Death Discov. 4, 91 (2018). https://doi.org/10.1038/s41420-018-0093-y

Download citation

Received: 30 May 2018
Revised: 26 June 2018
Accepted: 26 July 2018
Published: 20 August 2018
DOI: https://doi.org/10.1038/s41420-018-0093-y
Springer Nature Limited

This article is cited by

Cognitive and immunological effects of yoga compared to memory training in older women at risk for alzheimer’s disease
- Adrienne Grzenda
- Prabha Siddarth
- Helen Lavretsky
Translational Psychiatry (2024)
OXR1 maintains the retromer to delay brain aging under dietary restriction
- Kenneth A. Wilson
- Sudipta Bar
- Pankaj Kapahi
Nature Communications (2024)
Tissue-specific impacts of aging and genetics on gene expression patterns in humans
- Ryo Yamamoto
- Ryan Chung
- Peter H. Sudmant
Nature Communications (2022)
Functional conservation in genes and pathways linking ageing and immunity
- Daniel K. Fabian
- Matías Fuentealba
- Janet M. Thornton
Immunity & Ageing (2021)
Molecular evolution and the decline of purifying selection with age
- Changde Cheng
- Mark Kirkpatrick
Nature Communications (2021)

An analysis of aging-related genes derived from the Genotype-Tissue Expression project (GTEx)

Abstract

Similar content being viewed by others

Introduction

Result

The human transcriptional age-associated genes from GTEx

Distribution of the age-associated genes across chromosomes

Evolutionary characteristics and expression profile of age-associated genes

Functional annotation for age-associated genes

UAGs have higher SNP density and are more enriched in disease genes

Age-associated genes in PPI and signaling networks

Discussion

Materials and methods

The age-associated gene set

Chromosomal distribution and genomic distance analyses

Evolutionary and expression characteristic analyses

Functional enrichment analysis for age-associated genes

Analysis of SNP densities

The enrichment analysis of disease-related SNP and disease genes

Network analysis

Change history

10 July 2019

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation