Introduction

The circadian rhythm, which is generated by endogenous oscillators, forms the temporal structure of human physiology by regulating energy metabolism in accordance with the natural light and dark cycle. When the circadian rhythm, gut microbial activities, and geophysical environmental signals are out of sync, energy metabolism can be disrupted, and conditions like insulin resistance may develop (Stenvers et al. 2019). Insulin sensitivity in skeletal muscle is remotely modulated by light and Sirtuin 1 (SIRT1) dependent pathway (Aras et al. 2019). It has been reported that certain polymorphic variants of SIRT1, as well as other circadian genes such as Circadian Locomotor Output Cycles Kaput (CLOCK), Brain and Muscle Arnt-Like 1 (BMAL1, also known as aryl hydrocarbon receptor nuclear translocator-like 1, ARNTL1), Cryptochrome 1 (CRY1) may lead to insulin resistance and increase the risk of metabolic and mood disorders (Dong et al. 2011; Dashti et al. 2014; Kovanen et al. 2015; Angelousi et al. 2018).

Various genome-wide association studies (GWASs) and meta-analyses of GWASs have identified single nucleotide polymorphisms (SNPs) from various loci associated with insulin resistance and type 2 diabetes (Hong et al. 2014). However, identifying the genes associated with disease phenotypes in complex, multifactorial disorders can be more challenging compared to that in simple Mendelian diseases. It is generally thought that the combined effects of multiple genetic variants with small effect sizes may constitute such complex traits. However, the genome-wide significance threshold at the level of p < 5.0 × 10− 8, which is applied in GWAS to prevent type 1 error, might filter SNPs with small effect sizes unless the sample size is extremely large. Therefore, additional methodologies might be necessary in order to compensate for the limitations of traditional GWASs (Stringer et al. 2011).

Breuer et al. (2018) addressed such issues of concern for GWAS of complex traits and hypothesized that analyzing the joint effect of genetic variants via the association rule mining (ARM) technique could be an alternative solution. ARM technique is a data mining technique that aims to extract frequent patterns from large databases. The technique is generally applied to analyze customer habits from transaction databases, and gene expression from microarray data. Breuer et al. considered this technique can unravel unknown associations between genome-wide genetic variants. Besides the genetic variation of specific SNPs, minor allele content (MAC) can be calculated as a quantitative measurement. Genome-wide accumulation of MAC in individuals might play a role in the pathogenesis of complex diseases such as type 2 diabetes and Alzheimer’s disease (Lei and Huang 2017; Chen et al. 2020). According to Chen et al. (2014) and Kido et al. (2018), it can be hypothesized that as the result of a natural selection process, risk alleles, compared to protective alleles might face negative selection and be kept at lower proportions. Therefore, risk alleles are more likely to be found among minor alleles.

From the above literature, we have concluded that individuals who carry certain variants of circadian genes may be more vulnerable to metabolic disorders. For the circadian genes, considering the joint effects of multiple variants and MAC might further enhance the prediction power of genome-wide association models. In this study, we will examine sex differences in circadian gene polymorphisms, and demographic and clinical characteristics associated with insulin resistance. Our hypothesis is: carrying certain sets of homozygous minor alleles or more minor alleles in the circadian and metabolic sensing genes may be metabolically riskier in the long term thus contributing to the pathogenesis of insulin resistance.

Materials and methods

Study design and settings

In this retrospective cross-sectional study, we analyzed the data of Korean adults (n = 1404) provided by the Korean medicine Data Center (KDC). The study protocol conformed to the declaration of Helsinki. Due to the retrospective nature of the work, this study was granted an exemption by the local ethics committee with a waiver of consent (Daejeon University institutional review board exemption: 1040647-202104-HR-019-03). The study population were relatively healthy adults recruited from the local community from 2017 to 2019, and had complete data on demographics, vital signs, anthropometric and bioelectric impedance measurements, hematological profile, and single nucleotide polymorphisms (SNPs). All data were anonymized, and the SNP data were analyzed on-site at the data center using a computer completely isolated from the internet and secured with a password and a locking device.

Study population

For this study, we used minimal inclusion and exclusion criteria so that the study population could be representative of the general population. The inclusion criterion was: relatively healthy adults recruited from the local community, who had complete data on demographics, vital signs, anthropometric and bioelectric impedance measurements, hematological profiles, and SNPs. The exclusion criterion was: those with a history of malignant tumor or cardio-cerebrovascular diseases.

Hematological profile

Overnight fasting blood samples (about 22.5 ml) were collected between 8 AM and 12 PM from the median cubital vein of each participant. The samples were centrifuged for 10 min at 3450 rpm and then transported to the laboratory (Seoul Clinical Laboratories, Yongin, Korea) within 24 h of collection for hematological examination. The blood test includes complete blood count, liver function, lipid profile, fasting plasma glucose and insulin levels, inflammatory markers, and differential count. Granulocyte count was calculated by summing the neutrophil, eosinophil and basophil counts. The granulocyte to lymphocyte ratio (GLR) was calculated by dividing the granulocyte count by the lymphocyte count, the neutrophil to lymphocyte ratio (NLR) was calculated by dividing the neutrophil count by the lymphocyte count, and the lymphocyte to monocyte ratio (LMR) was calculated by dividing the monocyte count by the lymphocyte count.

In this study, we used the homeostasis model assessment of insulin resistance (HOMA-IR), an indirect measurement of insulin resistance. HOMA-IR was calculated from fasting blood insulin and glucose levels. The primary cut-off point used for HOMA-IR in this study was 2, which corresponds to about the 75th percentile in the Korean population aged 40–69 years in Kim et al. (2018)’s study. The formula used for calculating HOMA-IR was as follows:

$$\text{H}\text{O}\text{M}\text{A}-\text{I}\text{R} = \frac{\text{f}\text{a}\text{s}\text{t}\text{i}\text{n}\text{g} \text{g}\text{l}\text{u}\text{c}\text{o}\text{s}\text{e} (\text{m}\text{g}/\text{d}\text{L})\times \text{f}\text{a}\text{s}\text{t}\text{i}\text{n}\text{g} \text{i}\text{n}\text{s}\text{u}\text{l}\text{i}\text{n}(\text{u}\text{U}/\text{m}\text{L})}{405}$$

Measurements

Anthropometric measurements were obtained and recorded according to a standardized protocol. Height, weight, waist and hip circumferences were measured to the nearest ± 0.1 cm and kg. Body mass index (BMI) was calculated as body weight in kilograms divided by the square of height in meters (kg/m2). Mass and proportion of body fat and muscle were measured with the InBody 770 bioelectrical impedance analyzer (Biospace, Seoul, Korea).

Gene analysis

SNP selection

A total of 750,050 SNPs were analyzed using the Theragen Precision Medicine Research Array (PMRA) chip (TheragenEtex Bio Institute, Suwon, Korea). Following genes that are involved in the circadian cycle and metabolic sensing (Rijo-Ferreira and Takahashi 2019; Stenvers et al. 2019) were selected for this study: AKT1, AKT2, ARNTL1, ARNTL2, CLOCK, CRY1, CRY2, DEC1, FOXO1, IRS1, JAK2, LIPE, LPL, MAPK8, NFE2L2, NFKB1, NPAS2, NR1D1, NR1D2, PBP4, PCK2, PER1, PER2, PER3, PNPLA2, RORA, RORB, RORC, SIRT1, SIRT6, SLC2A1, SLC2A2, SLC16A1, SLC16A4, SLC16A7, SREBF1, STAT3, STRA6, and TIMELESS. A list of SNPs for each gene was downloaded from the Single-Nucleotide Polymorphism database (National Center for Biotechnology Information (NCBI) dbSNP). Only common, intersecting SNPs from the selected genes and the Theragen PMRA chip were used for the analysis. All SNPs were autosomal SNPs. Before data analysis, SNPs were pruned using PLINK (v1.07) software (Purcell et al. 2007). SNPs with minor allele frequency and Hardy-Weinberg equilibrium of greater than 0.05 were selected. Then, SNPs with missingness per marker greater than 10%, and missingness per individual greater than 5%, were excluded. This resulted in 478 SNPs that were passed on to later processes.

Linkage disequilibrium (LD) in a specific population refers to a non-random association of alleles at different loci. LDs for the SNPs used in this study were calculated using PLINK software. In this study, we pruned SNPs with LD above the r2 threshold of 0.5 with PLINK option --indep-pairwise 20 3 0.5. This left us with 353 SNPs that were passed on to later processes. Further processes for selecting and pruning SNPs are shown in Fig. 1.

Fig. 1
figure 1

Schematic diagram of the SNP selection process for cluster analysis and exploratory association rule mining. SNP, single nucleotide polymorphism; PMRA, Precision Medicine Research Array; MAC, minor allele content

Exploratory association rule mining

For association rule mining, if the input data are too scarce, then rules can hardly be mined. Meanwhile, if all the SNPs including those with higher MAF are included in the association rule mining, then the homozygous minor allele genotypes with higher MAF will have a higher chance to be randomly associated with each other compared to those with lower MAF. Since risk alleles, in many cases, tend to be minor alleles with low MAF (Kido et al. 2018), we had to find a point of compromise by further pruning process. Therefore, from the SNPs used above, those with high MAF, in which homozygous minor allele genotype presented in more than 10% of the entire population were removed.

Genotype data of the 305 SNPs from 1404 individuals were obtained with PLINK software. The Association rule mining algorithm was programmed with Python 3.9.1 (Python Software Foundation, Wilmington, DE, USA) using the Mlxtend library (Raschka 2018). For association rule mining, minor allele-minor allele pairs were coded as 1 and minor allele-major allele pairs and major allele-major allele pairs were all coded as 0. All missing values in genotype data were assumed to be non-homozygous minor genotypes and thus were coded as 0. We created dummy variables and dropped the minor-major and major-major allele categories so that we could come up with association rules between homozygous minor allele genotypes as a result.

We divided the SNP dataset according to males and females with high and normal HOMA-IR values. From the entire dataset of n items, we can generate \(\sum _{k=2}^{n}\left(\genfrac{}{}{0pt}{}{n}{k}\right)\times ({2}^{k}-2)\) association rules. We then applied the apriori algorithm to the dataset to obtain candidates for the later association rule mining step. We used minimum support of 0.05 which is how often the item is present in the whole dataset. For the association rules, we used a threshold of 0.1 for minimum confidence. For the rules consisting of SNP sets of A and B, the support S is defined as follows: \(S\left(A\to B\right)=P\left(A\cap B\right)=N(A\cap B)/n\), where P stands for the probability and N stands for the number of sets. The support is used to select the frequently occurring sets from the entire dataset. Meanwhile, the confidence C is defined as follows: \(C\left(A\to B\right)=P\left(B|A\right)=N(A\cap B)/N\left(A\right)\). The confidence, in this case, is the probability of SNP B appears given the SNP A appears. The association rule mining algorithm looks for sets with a high level of support and confidence in relation to their threshold.

Minor allele content

MAC of each individual was counted from the above dataset of n = 478 SNPs for circadian genes, and n = 133 SNPs from core clock genes including ARNTL1, ARNTL2, CLOCK, CRY1, CRY2, DEC1, NPAS2, PER1, PER2, and PER3. Furthermore, we pruned the SNPs using different MAFs of 0.1 or 0.3.

Statistical analysis

The Hardy-Weinberg equilibrium, missing data, MAF, and LD were processed and calculated using PLINK software. The rest of the statistical analysis was performed using SPSS 23.0 for Windows (International Business Machines Corporation, Armonk, NY, USA) and Python 3.9.1 (Python Software Foundation, Wilmington, DE, USA). We applied log-normalization before analysis when appropriate. Males and females were analyzed separately. Differences were considered significant at P < .05. Correlation coefficients were interpreted as being negligible (0.00-0.10), weak (0.10–0.39), moderate (0.40–0.69), and strong (0.70 to 1.00).

Results

Population characteristics

Table 1 contains the demographic and clinical characteristics of the study population. The age of the population ranged from 30 to 55 years. Among the 1404 subjects, 960 (68.4%) were women and 444 (31.6%) were men. Most of the clinical characteristics differed significantly between men and women. Therefore, we conducted statistical analyses on men and women separately. Furthermore, because the number of male subjects was lower than that of female subjects, the statistical power of males in this study was lower than that of females. The mean ages of men and women were also found to be significantly different. The actual difference in the mean age, however, was about 1.5 years, so we considered that it was clinically negligible.

The distribution of HOMA-IR in men and women in this study population is shown in Table 2. HOMA-IR of 2, which was used as a cut-off point in this study, was at the 82nd percentile in the entire population, 85th percentile in women, and 76th percentile in men. Among the study population, 144 (15.0%) women and 108 (24.3%) men had HOMA-IR values above 2. There were significant differences in most of the clinical characteristics between the groups with HOMA-IR values above and below 2 (Table 1).

Table 1 Demographic and Clinical Characteristics of the Study Population
Table 2 Distribution of HOMA-IR in Men and Women

The hematological characteristics of the study population are shown in Table 3. Most of the hematological characteristics differed significantly between men and women, and between the groups with high and low insulin sensitivity (i.e., HOMA-IR value below and above 2). Subjects with a HOMA-IR value of 2 or higher had higher levels of high-sensitivity C-reactive protein (hsCRP), neutrophils, lymphocytes, monocytes, red blood cell (RBC) count, hemoglobin, and hematocrit, and lower mean corpuscular volume (MCV) and mean corpuscular hemoglobin (MCH) in both men and women. In addition, women with HOMA-IR 2 or higher had higher GLR and NLR.

Table 3 Hematological Characteristics of the Study Population

The body impedance measurements of the study population are shown in Table 4. Compared to men, women had a greater visceral fat area (VFA), percentage of body fat (PBF), body fat mass (BFM) of the upper and lower limbs, fat to lean mass ratio (FLMR), fat to skeletal muscle mass ratio (FSMMR), and lean fat to trunk fat ratio (LFTFR). Meanwhile, men had greater soft lean mass (SLM) and skeletal muscle mass (SMM). BFM of the trunk did not differ between men and women. Subjects with HOMA-IR of 2 or higher had higher VFA, PBF, BFM, FLMR, SLM, SMM, FSMMR, and LFTFR.

Table 4 Clinical Differences in Fat and Muscle Distributions and Ratios Based on Body Impedance Measurements

Figure 2 summarizes the contents in Tables 1 and 3, and 4. Men and Women in the study population demonstrated both similar and different pathophysiological features of insulin resistance. The shared pathophysiological features of insulin resistance in men and women were increased levels of hsCRP and BFM of the trunk. The features that were found significant only in insulin-resistant women were elevated levels of low-density lipoprotein cholesterol (LDL-C), GLR, NLR, and LFTFR. Meanwhile, BMI, blood pressure, fasting plasma insulin and glucose, liver enzymes, triglyceride, neutrophils, lymphocytes, monocytes, RBC count, hemoglobin, and hematocrit were all found to be significantly elevated in the insulin-resistant population and they were also significantly higher in men. VFA, PBF, FLMR, and FSMMR were significantly elevated in the insulin-resistant population and they were also significantly higher in women. Furthermore, correlations between levels of inflammatory markers, trunk fat mass, low-density lipoprotein cholesterol, liver enzymes, hemoglobin, and hematocrit in men and women are shown in Fig. S1. Correlations between HOMA-IR and clinical parameters in men and women in different age groups are shown in Table S13.

Fig. 2
figure 2

Shared and unique features of insulin resistance between men and women. LDL-C, low-density lipoprotein cholesterol; GLR, granulocyte to lymphocyte ratio; NLR, neutrophil to lymphocyte ratio; LFTFR, lean fat to trunk fat ratio; hsCRP, high sensitivity C-reactive protein; BMI, body mass index; RBC, red blood cell; MCV, mean corpuscular volume; MCH, mean corpuscular hemoglobin, HDL-C, high-density lipoprotein cholesterol; VFA, visceral fat area; PBF, percent body fat; FLMR, fat to lean mass ratio; FSMMR, fat to smooth muscle mass ratio

Exploratory association rule mining

N = 283, 309, 143, 411 candidate rules were stratified for the group of (1) females and (2) males with high insulin sensitivity (normal HOMA-IR values), and (3) females and (4) males with low insulin sensitivity (high HOMA-IR values), respectively. The cut-off value of ≥ 2 was used to differentiate between normal and high HOMA-IR values. Supports ranged from 0.050 to 0.10 and confidence ranged from 0.46 to 1 in the outcome association rules.

Breuer et al. (2014) used binary variables in their work by coding homozygous minor alleles and heterozygous major-minor allele pairs as a single variable, and homozygous major alleles as another variable. Their resulting association rules consisted of sets of SNPs from different genes. However, in our study, we adopted a slightly different approach by coding homozygous minor alleles as one variable, and heterozygous minor-major allele pairs and homozygous major alleles into another variable, and dropped the latter variable when generating dummy variables. Therefore, in this study, we searched for the association patterns of homozygous minor alleles of the circadian and metabolic sensing genes. This resulted in the rules consisting of SNPs from the same genes only. It may be due to the relatively small sample of genes examined in this study, and linkage disequilibrium as proposed by Zondervan and Cardon (2004). Since some of the association rules were shared among all groups while other rules only appeared in certain groups, we compiled all the rules from groups with different clinical features altogether. Among the rules, those that were shared but displayed different frequencies between the groups (examined by the Chi-square test with a significance level of 0.10) are shown in Table 5. The characteristics of the SNPs in Table 5 are shown in Table 6. Considering the significance level of 0.05, only one set of SNPs in the PER3 gene was significant. However, the statistical power of this study was low and this significance would vanish if we consider the Bonferroni correction. Detailed results are attached as supplementary data, Tables S112.

Table 5 Results for association rule mining on insulin sensitive and resistant men and women
Table 6 Position, MAF, and Variant Type of SNPs rs10930781, rs2364720, rs10188107, rs4243387, rs17031578, and rs228669 in NFE2L2 and PER3 genes

Homozygous minor allele genotype sets of SNPs rs10930781, rs2364720, rs10188107, and rs4243387 (NFE2L2 gene) and rs17031578 and rs228669 (PER3 gene) could be more frequently found among females with HOMA-IR ≥ 2 (p = .072 for homozygous minor genotypes in SNP clusters of NFE2L2 and p = .014 for that of PER3) although the estimated odds ratio for those variations were not high (1.8 for homozygous minor genotypes in SNP clusters of NFE2L2 and 2.1 for that of PER3 in women). Those sets of SNPs could be more frequently found among males with HOMA-IR ≥ 2 as well, although the results were not statistically significant.

Effect of MAC in the pathophysiology of insulin resistance

MAC of specific genes for each individual was calculated as described in the methods and in Fig. 1. Pearson’s correlation analysis was performed between HOMA-IR and different MAC categories constructed using circadian genes and core clock genes as previously mentioned in the methods section. We performed the correlation analysis on groups of different ages and sex. Log-normalized HOMA-IR values and MAC categories did not show any correlation in female groups in their 30s, 40s, and 50s. Meanwhile, in males, different age groups showed different patterns of correlation between HOMA-IR values and MAC categories. For males in their 50s, homozygous minor/minor genotype count of core clock genes with MAF < 0.3 had a weak positive correlation (r = .2) with HOMA-IR values at the significance level of 0.05. For males in their 30 and 40 s, however, homozygous minor/minor genotype count of core clock genes with MAF < 0.3 was negatively correlated (r=-.12 without statistical significance, r=-.13 with marginal significance, respectively). The results are displayed in Table 7.

Table 7 Correlations (r) between minor allele content counts, homozygous minor/minor genotype counts and HOMA-IR values in men and women in different age groups

Discussion

Insulin, an anabolic peptide hormone, is produced, stored, and secreted by the pancreatic beta cells in response to a glycemic stimulus. In healthy adults, β-cell responsiveness and insulin sensitivity are both higher in the morning and lower in the evening as they are under circadian control (Saad et al. 2012). In addition, plasma glucose and hepatic gluconeogenesis are also regulated by circadian metabolic rhythm (Simcox et al. 2015). Insulin resistance is a complex phenomenon that can be caused by diverse pathways. Proteomics studies revealed that there are different pathological processes of insulin resistance in skeletal muscle, adipose tissue, and liver (Li et al. 2020). Men and women have different body composition and physiology of fat and muscle, and the difference may lead to sex differences in the pathophysiology of insulin resistance. In general, women rely more on lipid metabolism while men tend to prefer utilizing glucose in a situation where energy is needed. Exercising induces a greater degree of lipolysis in women compared to that in men. Meanwhile, the carbohydrate oxidation rate is higher in men compared to that in women (Hedrington and Davis 2015). It might be due to the fact that the skeletal muscle is a major site for glucose uptake and decomposition. Men on average, compared to women, have greater skeletal muscle mass. It is well known that greater skeletal muscle mass is associated with increased insulin sensitivity while sarcopenia is known to be related to insulin resistance (Moon 2013).

In addition to metabolic preferences, lipid metabolism itself displays sexual dimorphism. For instance, the aquaglyceroporin (AQP) expression pattern is different in males and females. AQPs are channel protein that facilitates glycerol transportation across the cell membrane. AQPs are generally expressed in adipocytes and hepatocytes. In rodents, males have higher hepatic AQP9 protein, and thus greater hepatic glycerol permeability. Similar tendencies are observed in humans as men have higher hepatic glycerol permeability compared to women (Rodríguez et al. 2015). In this study, serum levels of liver enzymes were significantly higher in men and in subjects with HOMA-IR values above 2. In addition, there were stronger correlations between BFM of the trunk and ALT and AST in men compared to those in women. It could be possible that in insulin-resistant men, hepatic cell damage and the resulting oxidative stress and inflammation could be more prominent. In a community-based study by Cho et al., an increase in liver enzyme activities was accompanied by an increase in C-reactive protein in diabetic patients, and both of them were independent predictors for type 2 diabetes (Cho et al. 2007). In men, the accumulation of fat in the liver, central obesity, and inflammation may be one of the common pathophysiologies of insulin resistance and type 2 diabetes. These results may partially explain the higher incidence of fatty liver disease in men as well (Rodríguez et al. 2015).

On the other hand, the accumulation of body fat is related to chronic inflammatory conditions (Fernández-Sánchez et al. 2011). In this study, women, on average, had a higher percentage of body fat and FLMR compared to those of men. Parameters such as LDL-C, GLR, and NLR were elevated and LFTFR was decreased in insulin-resistant women but not in insulin-resistant men. In addition, insulin-resistant women had a higher level of hsCRP in their plasma compared to insulin-resistant men. There were stronger correlations between hsCRP and NLR and BFM of the trunk in women compared to those in men (Fig. S1). From these results, it could be postulated that women are more prone to systemic inflammation and oxidative stress due to higher adiposity.

Besides adiposity, elevated RBC count, hemoglobin levels and hematocrit were all related to insulin resistance in this study population. Meanwhile, hsCRP was weakly correlated with Hb and Hct in women only. Many studies have suggested that insulin has an erythropoietic effect and insulin receptor is engaged in RBC proliferation. It was previously described by Choi et al. that increased erythropoiesis and chronic subclinical inflammation are related to insulin resistance in elderly Koreans (Choi et al. 2003). Iron is a major source of hydroxyl radical (•OH) which can cause a wide range of biological damage. Insulin resistance is associated with iron overload as insulin can stimulate ferritin synthesis and facilitate iron uptake by cells. It has been recently discovered that iron overload can predict early death in a dose-dependent manner in the general population (Fernández-Real et al. 2015). Prolonged iron overload can lead to decreased lysosomal pools, autophagy failure, glucose intolerance and insulin resistance. Interventions such as an iron restriction diet, blood-letting, and chelation therapy could all increase insulin sensitivity (Jahng et al. 2019).

From the exploratory association rule mining, we could see that sets consisting of homozygous minor allele genotypes of SNPs rs10930781, rs2364720, rs10188107, and rs4243387 (from NFE2L2 gene) and rs17031578 and rs228669 (from PER3 gene) could be more frequently found among females with HOMA-IR ≥ 2 although the estimated odds ratio for those variations (1.8 for homozygous minor genotypes in SNP clusters of NFE2L2 and 2.1 for that of PER3; all odds ratios may be applied to women only) are not that high. SNPs rs10930781, rs2364720, rs10188107, and rs4243387 are all intron variants of the NFE2L2 gene. They have been reported in the studies on acute respiratory distress syndrome and adult-onset cognitive deficits induced by Phencyclidine as possible functional variants in the NFE2L2 gene (Acosta-Herrera et al. 2015; Shirai et al. 2015) NFE2L2 (Nuclear factor-erythroid 2 like, also known as NRF2) is a transcription factor that regulates redox metabolism in cells. NFE2L2 activity increases during various stressful situations, including inflammation and redox perturbation to direct metabolic reprogramming of the cells and protect cells from oxidative damage. NFE2L2 is necessary for antioxidant function and nicotinamide adenine dinucleotide phosphate (NADPH) production in cells and tissues (Hayes and Dinkova-Kostova 2014; Uruno et al. 2015). In addition, NFE2L2 can regulate circadian rhythm and integrate redox metabolism into the tissue-specific circadian cycle by binding to enhancer regions of CRY2 and the stress response element (STRE) motif of PER3 and regulating their expression (Wible et al. 2018).

The cytoprotective and antioxidative activity of Kelch Like ECH Associated Protein 1 (KEAP1)-NRF2 system is important for glucose homeostasis as it can protect pancreatic beta cells from oxidative damage. Moreover, NRF2 plays an important role in iron metabolism. Iron can mediate glucose metabolism at multiple levels, and increased iron load can promote oxidative stress via the Fenton reaction (Kerins and Ooi 2018). NRF2 can restrict inflammation and protect cells from oxidative damage by regulating the pentose phosphate pathway (PPP). The PPP is also important for glutathione reduction, NADPH production, and nucleic acid biosynthesis (Early et al. 2018). Therefore, dysregulation of NRF2 may render cells vulnerable to oxidative stress and chronic inflammation. NRF2 can be a potential target for the treatment of chronic inflammation, insulin resistance, and type 2 diabetes (David et al. 2017). It has been recently discovered that Nrf2 can restore leptin and insulin sensitivity by reducing the oxidative stress produced by the hypothalamus in mice (Yagishita et al. 2017).

Meanwhile, rs228669 SNP in the PER3 gene is a synonymous variant while rs17031578 is an intron variant. In Kovac et al.’s study, SNP rs228669 was significantly associated with increased levels of triglycerides (TG) and risk of pre-term birth (Kovac et al. 2019). In Zhang et al.’s study, rs228669 SNP was significantly associated with the overall survival of patients with hepatocellular carcinoma. SNP rs228669 is located in the exonic splicing enhancer (ESE) region. Therefore, when there is a variation, it is possible that the sequence of mRNA, and structures and functions of the product protein can be altered (Zhang et al. 2014). Although it is not directly related to those SNPs above, PER3 length polymorphism in its exon 18 has been reported in patients with type 2 diabetes (Karthikeyan et al. 2014). In addition, the transcript level of PER3 was reduced in type 2 diabetes patients (Ando et al. 2009). It has been recently discovered that adipogenesis is regulated by the circadian rhythm generated by Per3 in adipocyte precursor cells (APCs), and deletion of the Per3 gene is related to increased adipogenesis in vivo in mice. In APCs in mice, the Per3-Bmal1 complex can regulate adipogenesis by modulating Klf15 (Kruppel-like factor 15) and Pparγ (peroxisome proliferator-activated receptor γ). Meanwhile, in human cells, there are conflicting results. Some studies reported that human PER3 repressed adipogenesis in human mesenchymal stem cells and downregulation of human PER3 immortalized bone marrow-derived Scp-1 cells and patient adipose-derived stromal cells. On the other hand, PER3 enhanced adipogenesis in human adipose tissue-derived stromal cells (Aggarwal et al. 2017; Wan et al. 2021). Further research may be necessary to clarify the relationship of the PER3 variations to the development of insulin resistance in humans.

Recently, Jakubiak et al. (2021) discovered that obesity and insulin resistance had the strongest relationship with oxidative stress among metabolic syndrome components such as “obesity and insulin resistance,” “dyslipidemia,” and “blood pressure.” It is worth noting that NRF2 and PER3 are related to the redox cycle and adipogenesis. According to Rey et al. (2016), inhibiting the PPP can alter circadian rhythm through mechanisms independent of the transcription-translation feedback loop and NRF2 is a key mediator linking the redox signals and circadian oscillations. 6-aminonicotinamide (6AN) treatment, which inhibits PPP and NADPH production, could delay the circadian phase in a reversible manner. Circadian genes, including PER3, displayed phase delays in transcription profiles as well. 6AN treatment also induced NRF2 activation and subsequent PER3 response. NRF2 has been shown to bind to the STRE site and mediate Per3 expression (Wang et al. 2012).

Meanwhile, the roles of NRF2 and PER3 in human adipogenesis still remain uncertain. Despite the fact that evidence from the majority of studies suggests that oxidative stress can upregulate Nrf2, which in turn promotes adipogenesis. Nrf2 inhibition can reduce lipid accumulation in response to oxidative stress. However, conflicting evidence suggests that Nrf2 can inhibit adipogenesis through the aromatic receptor pathway. More research is needed to clarify the pathway and fully comprehend the relationship between circadian rhythm, oxidative stress, obesity, and insulin resistance.

For the MAC of the circadian genes, it was found that homozygous minor genotype count of the circadian genes with MAF below < 0.3 were significantly correlated with HOMA-IR only in men in their 50s, although the correlation was very modest (r = .200, p < .05). For women, MAC or homozygous minor genotype count of the circadian genes did not display any significant correlation with HOMA-IR. It was interesting to note that homozygous minor/minor genotype count for the core clock genes with MAF < 0.03 was negatively correlated with HOMA-IR in men in their 30s (r=-.12) and 40s (r=-.13). However, for men in their 50s, the variables were positively correlated (r = .2). It might be due to aging-associated changes in gene expression. Further investigation is required. In a previous study conducted on the British population, it was found that enrichment of minor alleles of SNPs was found in patients with type 2 diabetes (Lei and Huang 2017). SNPs from genes other than circadian and metabolic sensing genes may need to be included in the model to enhance the predictive capability of MAC for insulin resistance and type 2 diabetes.

The goal of this study was to discover the minor alleles that might be related to the pathogenesis of insulin resistance. From this study, we have found that body composition and physiological differences in men and women may lead to sex differences in the pathophysiology of insulin resistance. Insulin-resistant women, compared to insulin-resistant men, had a greater degree of systemic inflammation and adiposity. Meanwhile, insulin-resistant men had greater levels of liver enzymes, plasma insulin and glucose, and hematological parameters such as hematocrit compared to those in insulin-resistant women.

From association rule mining results, we could find that homozygous minor allele contents in a circadian gene such as PER3 may predispose individuals to be more prone to insulin resistance although additional evidence from further research is necessary. In this study, homozygous minor alleles of PER3 could be more frequently found among women with decreased insulin sensitivity (p = .014). It may be due to the fact that women tend to rely more on lipid metabolism and alterations in the PER3 gene may affect lipid metabolism. Meanwhile, homozygous minor alleles of NFE2L2 could also be more frequently found among women with insulin resistance, although the result was not statistically significant (p = .072). The degree of systemic inflammation measured with hsCRP, GLR, and NLR was greater in insulin-resistant women compared to that in insulin-resistant men. Due to the higher level of systemic low-grade inflammation, women with decreased insulin sensitivity may be more prone to oxidative stress. However, because the number of female subjects was greater than that of male subjects, females had greater statistical power in this study than males. As a result, the greater significance observed in female groups could be a result of the power difference as well.

To conclude, oxidative stress due to adiposity and iron overload is related to insulin resistance in early to middle adulthood. Homozygous minor alleles of genes such as PER3 and MAC of the circadian genes may predispose individuals to insulin resistance. However, the population in this research may be relatively too young and small to draw a meaningful conclusion. Therefore, further investigation of a larger population is necessary.

Limitations

There are several limitations to this study. First, since the number of subjects was not large, our data have to be interpreted with careful consideration. We did not apply multiple comparison corrections for gene analysis because we only applied planned comparisons on specific SNPs. The results may no longer be significant if we apply Bonferroni correction. In this study, we primarily used the PLINK program for data processing and association rule mining for the main analysis. The reason for this is that the number of subjects was small and insufficient to perform GWAS. Rather than searching for SNPs associated with insulin resistance throughout the entire pool, the authors focused on the genes associated with the mechanism of insulin resistance and then looked for association rules within the SNPs of those genes. Searching for SNP sets rather than individual SNPs will also result in a larger effect size. Duplication studies with a larger population may be necessary to further verify and supplement the present investigation. In addition, there were fewer men compared to women in this study. So, the power of statistical tests may differ between men and women. The age of the population of this study is 30 to 55, which may be relatively young compared to other similar studies. Further research on the older population is necessary to apply the results to the population older than 55 years. Finally, we did not check the smoking status, drug history, and sleep and dietary patterns of the subjects in this study, which might have affected the health status of the participants. Further studies may be needed on those aspects as well.