Introduction

According to the World Health Organization (WHO), the frequency of people with high Body Mass Index (BMI) is rising fast worldwide1,2. Overweight and obesity (excessive weight) are expected to affect 50–60% of the adult population by 20503. In tandem with this increase, the proportion of the population above age 65 grows, and consequently, the high prevalence of those who are both obese and elderly4. These conditions are associated with a higher risk of cardiovascular diseases, type 2 diabetes (T2D), and many other chronic conditions5.

Excessive weight is a multifactorial condition characterized by abnormal and/or excessive fat cells on specific depots in the body6,7. It is genetically complex, resulting from the interaction between environmental factors and at-risk genetic profile8. The genetic component for obesity accounts for 40% to 50% of the variability in body weight status, but it varies across BMI classes, being lower among individuals with normal weight (about 30%) and higher in individuals with obesity mild to severe (60–80%). Moreover, two-thirds of the heritability of BMI can be attributed to common DNA variants. According to Bouchard, obesity-promoting alleles exert minimal effects in individuals with normal weight but may have higher penetrance in individuals prone to obesity9. In this context, several genes and pathways have been identified in animal studies, as well in linkage and association studies10.

Notch signaling is a conserved pathway that regulates cell proliferation, differentiation, self-renewal potential, apoptosis, inflammatory response, and cell-fate decisions11,12. This complex pathway involves four Notch receptors encoded by NOTCH1-4 genes, and five ligands of the Jagged/Delta-like families (JAGGED1/2, DLL1/3/4)13. In the last decade, NOTCH1 has been highlighted as a key regulator of metabolism and a player in adipogenesis, browning of adipocytes, and resistance to high-fat diet-induced obesity, both in vitro and in mice experiments14,15,16. Recently, Yamaguchi et al.17 incidentally found that Notch1 heterozygous-deficient (N1+/−) mice can gain weight easily.

NOTCH1 gene (9q34) is composed of 34 exons and is expressed in stem cells and most adult tissues. Cryptic changes in this cytogenetic location cause syndromic obesity in childhood18,19,20. NOTCH1 variants have been identified in cardiovascular malformations21, health consequences of early-life stress conditions22, and cancer23. However, to the best of our knowledge, the association between NOTCH1 variants and excessive weight has not yet been explored. Thus, considering the above experimental evidence of the involvement of NOTCH1 in metabolism and adipogenesis, we hypothesize that genetic variations in this gene are associated with excessive weight phenotype. Therefore, our main goal is to verify whether genetic variations in NOTCH1 are associated with excessive weight and related traits.

Methods

Study cohort

All participants were unrelated and selected from the interdisciplinary project named SABE (Saúde, Bem Estar e Envelhecimento—Health, Well-Being, and Aging), which was coordinated by the Pan American Health Organization (PAHO/WHO) as a multicenter health survey and well-being of elder people in seven urban centers in the Caribbean and Latin America, including São Paulo—Brazil. The SABE project was approved by the Institutional Review Board of the University of São Paulo School of Public Health and CEP/CONEP (Brazilian local and national Ethical Committee Boards, CAAE: 47683115.4.0000.5421, Review: 3.600.782). All methods were performed in accordance with the Declaration of Helsinki. The collection contains all individual-level genomic data sets that agreed to participate and signed the written informed consent form. Since then, this cohort has been analyzed in genetic studies24.

Data collection

Data collection was carried out by trained staff and described elsewhere25. A standardized questionnaire (C10) focusing on medical history, lifestyle, and sociodemographic characteristics was collected from all individuals. The C10, proposed by the PAHO, was translated and adapted for use in the Brazilian cohort. We collected blood samples by venipuncture for biochemical and genomic analysis of all participants.

The following demographic and health variables were recorded: gender, age, systolic blood pressure (SBP) (mmHg), diastolic blood pressure (DBP) (mmHg), HDL cholesterol (mg/dL), LDL cholesterol (mg/dL), total cholesterol (TC) (mg/dL), fasting triglyceride (TG) (mg/dL), fasting plasma glucose (FPG) (mg/dL), glycated hemoglobin (Hb1Ac) (%), C-Reactive Protein (hsCRP) (mg/L), BMI (kg/m2) and Waist Circumference (WC) (cm).

The T2D was identified by the question “Has a doctor or nurse ever told you that you have diabetes mellitus, that is, high blood sugar levels?”, or use of medication, or FPG > 126 mg/dl or Hb1Ac > 6.5%. Hypertension is identified by the question “Has a doctor or nurse ever told you that you have hypertension, that is, high blood pressure? To the both above questions, the response alternatives were: yes, no, do not know, and did not answer; the final two were considered as missing. In relation to smoking, we also used the information reported as current smoking (yes/no) and ex-smokers (stop smoking five years or more).

The weight and height were measured using a portable scale (Seca, Germany) with a capacity of 150 kg (sensitivity of 1 kg), and an anthropometer (Harpenden, England), respectively. The WC was measured midway between the lower margin of the last palpable rib and the top of the iliac crest using an inelastic measuring tape, to the nearest 0.1 cm after inhalation and exhalation. The BMI was calculated with the weight in kilograms divided by the square of height in meters (kg/m2). Measurement techniques were standardized according to literature and measurements were performed in triplicate using the mean values for the analysis. Using the WHO classification26, we split the individuals into groups according to the BMI values in excessive weight (≥ 25.0 kg/m2, including obesity) and normal weight (≤ 24.9 kg/m2). We excluded individuals with incomplete clinical or genetic data.

Next-generation sequencing data and tag SNPs selection

DNA extraction, whole-genome sequencing, and quality control of variants were followed as described elsewhere25. Genome sequence data were deposited in the public archive ABraOM—Arquivo Brasileiro Online de Mutações (Brazilian Online Mutations Archive, http://abraom.ib.usp.br). We used the individual European, African, Native American, and East Asian ancestries inferred by Naslavsky et al.25, as covariables in our models. In summary, Naslavsky et al. sequenced a total of N = 1200 individuals and conducted Kinship analyses, which led to the identification and exclusion of 29 closely related individuals. This process resulted in a final cohort comprising 1171 unrelated individuals. The majority of the cohort demonstrates admixture, with average global ancestry proportions of 72.6 ± 26.3% European, 17.8 ± 20.9% African, 6.7 ± 6.6% Native American, and 2.8 ± 16.2% East Asian. These proportions show partial correlation with self-declared race/ethnicities. Here, we calculated Fst using formula: Fst = (Ht − Hs)/Ht, considering the subpopulations White, Black, Mixed, Asian and Native-American for rs9411207, and the result was Fst = 0.1057, indicating moderate differentiation between populations.

We filtered SNPs located in the start and end positions of NOTCH1 plus 50-Kb on both sides, spanning Chr9:136,440,101 to 136,599,978 of the human reference sequence (GRCh38:NC_000009.12). After the exclusion of INDELs, firstly, we retained SNPs with Minor Allele Frequency (MAF) higher than or equal to 0.01, and in Hardy–Weinberg equilibrium (p > 0.05). Then, SNPs were selected using a tag SNP approach (pairwise tagging algorithm performed with a threshold of r2 ≥ 0.8) using HapMap genotype data (release 28) in Haploview 4.2 software27. Tag SNPs were included in the association analysis. The allele and genotype frequencies of associated SNPs were compared to the Allele Frequency Aggregator (ALFA) project from the National Center for Biotechnology Information (NCBI) database28.

Statistical analysis

Individuals' characteristics were compared using descriptive statistics. Categorical variables are presented as frequencies and percentages, N(%), and continuous variables are expressed as median and extreme values (minimum and maximum) for non-parametric data. The one-sample Kolmogorov–Smirnov test was used to test the normality of the distribution. Comparisons between excessive-weight and normal-weight individuals were carried out using the chi-square test and the non-parametric Mann–Whitney test.

The SNP-based association analysis was performed using the R package “SNPassoc”, under different genetic models (codominant, dominant, recessive, overdominant, and log-additive)29. Analysis was adjusted for age, gender, and ancestry (Model 1), and for all confounding variables (Model 2). Odds ratios (OR) and 95% confidence intervals (CI) were calculated by multinomial logistic regression. Haplotype blocks were defined based on Gabriel et al.30 and linkage disequilibrium (LD) plots were generated using Haploview 4.225. Haplotype frequencies were estimated by the Expectation—Maximization algorithm (EM algorithm) using the R statistical package “HaploStats”31.

We adopted the significance of p < 0.05 and Bonferroni correction for multiple test comparisons (p = 0.05/N of tag SNPs or N of Haplotypes tested) or when necessary. Statistical analysis was performed using SPSS version 27.0 (IBM, Armonk, NY, USA) and the computing environment R version 4.0.0 (R Development Core Team, 2020).

A power analysis was performed using the software G*Power version 3.1.9.2 to verify the rs9411207 association with excessive weight. The sample size was 1,024, and to perform the power analysis were considered: a significance level of 0.05, the OR of 1.5, statistical power of 90% and the expected squared coefficient of multiple correlations (R2) of 0.25 (moderate association).

In silico functional analysis

Functional annotation of the associated SNPs was obtained from the functional prediction websites: rVarBase31, HaploReg33, RegulomeDB34, and Gtex portal (Genotype-Tissue Expression)35. The rVarBase database (version 2.0 of rSNPBase) was used to describe the regulatory features of the SNP in the dimension of chromatin states, overlapping regulatory elements, and potential target genes32. HaploReg v4.133 and RegulomeDB34 were used to annotate the SNPs by systematic mining of comparative, regulatory, and epigenomic data, based on the Encyclopedia of DNA Elements (ENCODE) project. The RegulomeDB score was used to identify and compare potential regulatory variants; lower scores are associated with a wider range of data supporting functional importance. GTEx portal35 was used to determine the significant expression of quantitative trait loci (eQTL) for the associated SNPs. We also search for trait associations in GWAS Catalog.

Results

Study cohort

Of the total of individuals with phenotype and genotype data (N = 1024), 280 were normal weight (27.34%) and 744 were cases (72.65%), including 424 overweight (41.4%) and 320 obesity (31.25%). The median age was 71.31 years old (range 59 to 99 years old) and 64.28% were women. The clinical, anthropometric, and socio-demographic characteristics of the individuals, as well as the frequencies of European, African, Native American, and East Asian ancestries, are shown in Table 1. Cases showed increased values of DBP and prevalence of hypertension, even as FPG, Hb1Ac, hsCRP, TG levels, compared to normal weight (p < 0.001). Decreased levels of HDL cholesterol were observed in cases (p < 0.001). Excessive weight was less frequent in current smokers (p < 0.001) and ex-smokers (p < 0.05) individuals.

Table 1 Sociodemographic, genetic ancestry, anthropometric, and clinical characterization of SABE cohort.

Association of individual SNPs with excessive weight

Of a total of 3816 SNPs in the NOTCH1 region and borders (SEC16A, C9orf163, and NALT1), 566 were common variants. Of these, 453 were in HWE and 161 tag SNPs were included in the association analysis (Supplemental Table 1).

We observed an association between the SNP rs9411207 and excessive weight, after adjustment for age, gender, and ancestry (Model 1). Using Akaike’s Information Criterion (AIC), the log-additive model for this SNP best fit the data (OR 1.49; 95% CI 1.21–1.85; p = 0.0002), surviving the Bonferroni correction [p ≤ 0.0003 (0.05/161)]. We also observed, in the log-additive model, a nominal association between excessive weight and the SNPs rs2229971 (OR 1.46; 95% CI 1.17–1.81; p = 0.0005), rs11574891 (OR 1.43; 95% CI 1.15–1.77; p = 0.0012), rs3125005 (OR 1.36; 95% CI 1.10–1.67; p = 0.0038) and rs3812604 (OR 1.35; 95% CI 1.10–1.65; p = 0.0041) (Supplemental Table 2).

The SNPs rs9411207, rs11574891 and rs3125005 were located on intron 13; rs3812604 on intron 19. The rs2229971 (c.2265T>G; p.N775N) is a synonymous variant located on exon 14. All of these SNPs were found in the NOTCH1 gene, in the region corresponding Epidermal Growth Factor (EGF)-like domain, as shown in Fig. 1.

Figure 1
figure 1

Schematic diagram of the human NOTCH1 gene and its protein domain organization. (A) NOTCH1 gene and the positions of the investigated SNPs in our association study. Horizontal black arrows indicate the direction of transcription. The white bar represents the exonic region according to the National Center for Biotechnology Information (NCBI) and the University of California at Santa Cruz Browser (UCSC). (B) Domain organization of human NOTCH1. The NRR consists of the LNR and HD domains. HD N- and C-terminal portions of the heterodimerization domain, ANK ankyrin repeats, EGF repeats epidermal growth factor-like repeats, LNR LIN-12/Notch repeats, PEST proline/glutamic acid/serine/threonine rich domain, RAM RBP-Jκ-associated module, TM transmembrane domain, TAD transactivation domain, ICN intracellular NOTCH1, NEC N-terminal extracellular, NTM C-terminal transmembrane.

The genotypic distributions of the samples of individuals with normal weight and with excessive weight are in HWE. The worldwide frequency for the rs9411207 T allele is around 0.35 (ALFA project), and our cohort varied from 0.36 to 0.43 among the groups. Other variants analyzed also showed frequencies similar to the ALFA project (Supplemental Table 3). Genotype distribution for rs9411207 differed for ancestries, LDL and FPG. Among the marginally associated SNPs, genotype distribution also differed according to variables (Supplemental Table 4). Concerning anthropometry, the GG genotype of rs3125005 and rs3812604, and AA genotype of rs11574891 showed higher values for BMI (p < 0.0001, p < 0.0001, and p = 0.0186, respectively). The AA genotype of rs11574891 was also associated with WC (p = 0.0395). Concerning metabolic variables, the rs3125005 genotypes differed in LDL cholesterol levels; with exception of the rs11574891, all SNPs varied for FPG levels, and the rs2229971, rs3125005, and rs3812604 genotypes for Hb1Ac. Moreover, the rs3125005, and rs3812604 showed variation for hypertension (p = 0.0168) and SBP (p = 0.0307), respectively.

Considering that genotypes differed in relation to ancestry and the above clinical variables, we performed a multinomial logistic regression. As shown in Fig. 2, after adjusting all confounding variables in Model 2 (age, gender, ancestry, HDL, TG, hsCRP, Hb1Ac, and hypertension), the TT genotype of rs9411207 remained associated with excessive weight (OR 1.50, 95% CI 1.20–1.88; p = 0.0002) (Table 2). Genetic model analysis, along with p-values for all tagSNP are showed in Supplemental Table 5. In the Supplemental Table 6 we presented the corresponding OR, AIC and p-values for the selected SNPs.

Figure 2
figure 2

Regional association plot of NOTCH1 gene with excessive weight. –Log 10 (p-value) is shown in the upper panel, and the white circle represents the associated rs9411207, and the black circles are the marginally associated SNPs. The dimension of black circles is directly proportional to the LD (r2) with rs9411207. The dashed line represents the Bonferroni correction p-value and is the dotted line the nominal p-value. In the middle, the dashed vertical lines indicate the position of associated SNPs in the LD structure. LD blocks were shown in black triangles, representing high LD. In the lower panel, we showed the genomic structure. The LD map was created using HaploView software.

Table 2 SNP association under the log-additive genetic model adjusted for confounding variables (Model 1and 2).

We also included the rs3124603, which has a significant increase in Model 2, it has a moderate LD with rs9411207. In this last model, the inclusion of the “smoke” variable did not alter the association for rs9411207 (OR 1.51, 95% CI 1.20–1.89; p = 0.0003) (Supplemental Table 7).

LD structure and haplotype analysis

The rs9411207 showed high LD with rs2229971 (D′ = 0.93 and r2 = 0.79) and rs11574891 (D′ = 0.99 and r2 = 0.74). We investigated the effect of the combined association of these SNPs in haplotype analysis. Considering eight possible haplotypes containing these SNPs, seven were identified (Supplemental Table 8). Of these, three had a frequency above 5% and the GAT haplotype, encompassing risk alleles, was more frequent in excessive (32.30%) than normal weight (25.66%) (OR 1.42; 95% CI 1.14–1.78, p = 0.003) (Table 3).

Table 3 Haplotype analysis of NOTCH1 tag SNPs.

In silico functional analysis

The rVarBase predicted that the rs9411207, rs2229971, rs11574891, rs3125005, rs3812604, and rs3124603 were located in a region which might regulate distally and RNA–protein bound elements. This database suggests that these SNPs interact with NOTCH1, AGPAT2, C9orf163, NALT1, and the microRNAs miR4673 and miR4674, both originating from the NOTCH1 gene sequence. These SNPs are located in the chromatin interactive region with a predominantly strong transcription function in various cell types, including adipose-derived mesenchymal stem cells and adipose nuclei (Table 4). In addition, the rs9411207 has a RegulomeDB score of 3a, and the SNPs rs11574891 and rs3125005 of 2b, predict their role as likely to affect gene expression level. Moreover, the region encompassing these SNPs is a binding site for RFX1, a transcriptional regulator. The HaploReg database further highlighted these SNPs as enhancing histone marks in the brain and muscular tissues. Of these, only the SNP rs2229971 is likely to influence histone marks in fat adipose nuclei. Using the GTEx portal, we found significant eQTL activity for the SNP rs3125005 related to CARD9 in the pancreas tissue and SEC16A in the stomach, for the SNP rs3812604 related to SDCCAG3 in the subcutaneous adipose tissue, and for rs3124603 related to NALT1 in Whole Blood and other tissues. Furthermore, GWAS Catalog showed association between the T allele of rs9411207 and NOTCH1 protein level measurement (beta = 0.107 unit increase; CI 0.08–0.134, p = 8 × 10–15).

Table 4 In silico functional analysis of NOTCH1 tag SNPs.

Discussion

The evolutionarily conserved Notch pathway has emerged as a regulator of metabolism in both in vitro and animal studies14,15,16,36. Based on this, we evaluated the genetic association of SNPs in the NOTCH1 gene and borders with excessive weight in a Brazilian elderly cohort. We showed an independent association between the rs9411207 and the risk of excessive weight conferred by the TT genotype, which increased 1.5-fold compared to other genotypes (Table 2). This SNP is in high LD with the rs2229971 and rs11574891, and the haplotype GAT, constituted by these three SNPs, might be a risk factor for excessive weight. The rs9411207 is associated with NOTCH1 protein levels. Using in silico functional analysis, we found that these SNPs were located in a region involved in the transcriptional regulation of NOTCH1 and other genes. Moreover, the rs2229971 was located in an enhancer region.

Synonymous variants might affect nucleic acid stability, the secondary conformation of RNA, and also the structure, function, and protein levels37. In turn, SNPs located in intronic regions might have a strong functional impact by mechanisms such as alterations in the stability of mRNA, activation of cryptic splice sites, and/or loss of regulatory repressor elements38,39. Intronic SNPs have been associated with several diseases, including obesity, T2D, and other disorders, e.g. variants in FTO40,41,42 and TCF7L223,43,44. Although intronic variants in NOTCH1 are found associated with cardiac developmental defects45, to date, we have not identified any studies of the association between genetic variants in NOTCH1 and excessive weight or related traits.

The possibility of LD with other real functional variants or non-synonymous variants associated with the phenotype should be considered. We did not identify LD with non-synonymous variants, however, the rs9411207 and the rs11574891 are in high LD with the synonymous rs2229971, which has been associated with bicuspid aortic valve46 and cancer47,48. In line with this, both haplotype studies and in silico functional analysis provide crucial information to enhance our comprehension of the interaction between genetic variations and the traits of diseases, as well as the region responsible for regulating gene expression49. Thus, it is plausible that one of these variants is in a regulatory region and that the risk allele boosts or decreases the transcription rate50, deregulating the Notch1 signaling pathway.

In this context, according to in silico functional analysis, the rs9411207 interacts with the chromatin of the NOTCH1 and other genes. The rVarBase suggests that this region interacts with AGPAT2, which triggers the synthesis of triglycerides inside the adipocyte51 and regulates adipogenesis52. In vitro and animal models, studies showed that AGPAT2 is essential for postnatal development and maintenance of white and brown adipose tissue (WAT and BAT, respectively), along with insulin signaling53,54,55. Also, AGPAT2 mutations cause human lipodystrophy (OMIM 603100), a condition in which individuals, although thin, have metabolic syndrome, similar to that found in common obesity52.

Moreover, we also verified that these SNPs might interact with the long noncoding RNA gene NALT1 and the microRNAs miR4673 and miR4674, all involved in the NOTCH1 expression by different mechanisms. While overexpression of NALT1 is associated with up-regulation of the Notch1 signaling pathway56, the miR4673 and miR4674 might be involved in the inhibition or degradation of NOTCH157. Besides, the miR4673 is involved in oxaguanine-DNA repair and inflammation58 and, the miR4674 in the regulation of γ-synuclein, an adipocyte-neuron gene with increased activity in obesity and control of body lipid metabolism59,60, and angiogenesis61.

The SNPs rs11574891 and rs3125005 showed a score of 2b in Regulome DB, indicating their potential to affect gene expression levels. Also, these SNPs seem to be in a binding site for RFX1, a transcription factor important to adipogenesis and implicated in Alstron syndrome (OMIM 203800), which is considered a human model for obesity and other metabolic disorders62,63. It is important to note that the rs3125005, rs3812604, and rs3124603 are in eQTL region. The GG genotype for rs3125005 increases the CARD9 expression in the pancreas and SEC16A in the stomach; and the GG genotype for rs3812604 increases the SDCCAG3 in subcutaneous adipose tissue. The CARD9 plays a role in multiple metabolic diseases, such as obesity, insulin resistance, and atherosclerosis64. The SEC16A is a RAB10 effector required for insulin-stimulated GLUT4 trafficking in adipocytes65. SNPs in SEC16B, a SEC16A ortholog, are consistently associated with obesity risk in different populations66.

The role of the Notch pathway in adipogenesis is not fully understood. There is a crosstalk between Notch1 and Wnt signaling pathways, which in turn negatively regulates adipogenesis15,67. Several studies claim to have a positive role of NOTCH1 for the adipocyte differentiation process68,69,70,71, energy metabolism, and adipocyte browning14,72,73,74. Inhibition or deletion of Notch1 reduces WAT mass and increases expression of BAT-signature genes, promoting the formation of beige adipocytes7. Furthermore, it increases energy expenditure, improves insulin sensitivity, and protects mice from obesity induced by a high-fat diet75. Conversely, activation of Notch1 signaling in adipocytes is sufficient to promote a whitening phenotype in perivascular adipose tissue35.

More recently, Wan et al. (2021)76 showed that adipogenesis promoted by period circadian regulator 3 (PER3) was mediated by Notch1 pathway inhibition, and Yamaguchi et al.17 verified that haploinsufficiency of Notch1 promotes fat accumulation and adipogenesis. Yamaguchi et al. discussed that the difference between the investigations could be due to the timing of activation and dose effects on Notch1 signaling downstream transcription factors, as well as differences in the study protocols (e.g., pharmacological or genetic interference, and time course). Further analysis will be required to gain insight into the underlying mechanisms.

Our study has limitations. Older adults are particularly susceptible to sarcopenic obesity, which involves a decrease in muscle and bone mass, meanwhile an increase in fat mass. Thus, the BMI values apparently remain stable and might be overestimated for the elderly with sarcopenic obesity. We did not evaluate important environmental factors such as physical activity and diet. Despite these limitations, the strength of the present study includes the median age of our population, which exceeds the age of onset obesity and common comorbidities, thus minimizing a typical bias in the selection of the control group77. Moreover, we evaluated a multiethnic population, which might increase the potential for the identification of new genes and variants78.

Together with previous experimental findings on Notch1 signaling in adipocytes, adipogenesis, and metabolism, our results of the association study and functional in silico SNP analyses provide insights into human excessive weight phenotype. Thus, considering the advances in the knowledge of synthetic and natural NOTCH1 modulators on cancer therapies79, as well as a better refinement of overweight-related phenotypes, the validation of these results in other populations is important and might contribute to precision medicine.

Conclusion

In summary, our data suggest that the T allele and TT genotype of rs9411207, as well as GAT haplotype in the NOTCH1 gene, are associated with an increased risk of excessive weight in the Brazilian population. Although the exact mechanism accounting for their influence on excessive weight remains to be determined, our data suggest that these NOTCH1 genetic variants might affect the transcription of relevant genes for adipogenic pathways and corroborate the possibility of abnormal NOTCH1 activity, opening a new perspective of the investigation of overweight and obesity-related traits.