figure b

Introduction

Type 2 diabetes has been traditionally considered as one disease characterised by both insulin resistance and insulin deficiency. Nevertheless, the disease is heterogeneous [1]. A formal description of five distinct subgroups was proposed in a large Swedish cohort [2] and replicated in other populations [2,3,4,5]: severe autoimmune diabetes (SAID) severe insulin-deficient diabetes (SIDD); severe insulin-resistant diabetes (SIRD); mild obesity-related diabetes (MOD); and mild age-related diabetes (MARD). The subgroups differed not only in their clinical characteristics at diagnosis but also in their pathophysiological mechanisms and susceptibility to complications.

India is referred to as one of the diabetes capitals of the world, and Indian individuals with type 2 diabetes differ from Europeans in that they develop diabetes at a younger age and are thinner [6, 7]. Indians also differ in body composition, having higher fat and lower lean proportions at the same BMI [8]. Given the role of adiposity in insulin resistance, it has therefore been assumed that type 2 diabetes in Indians is primarily driven by insulin resistance [9]. However, it is increasingly recognised that insulin deficiency may be a significant driver of diabetes in Indians [10]. Recent studies show that both the increase in the diabetes prevalence and the characteristics of the affected individuals vary in different parts of India [11, 12]. Lean type 2 diabetes is prevalent in India, especially in undernourished regions [13]. Recent studies have shown that subgroups of type 2 diabetes in Indians show partial concordance with those in Europeans [14, 15].

Type 2 diabetes is diagnosed at a younger age in India and its pathophysiology and heterogeneity warrants further investigation. Younger age at diagnosis has distinct implications for treatment, long-term complications and mortality as well as socioeconomic burden [16]. Therefore, early identification of subclasses may be vital for appropriate treatment to reduce adverse outcomes [17]. To address this, we implemented the Swedish algorithm [2] to identify subgroups of young Indians with type 2 diabetes diagnosed before 45 years of age from the WellGen cohort from Pune, India [18]. We then compared Indian and European type 2 diabetes subgroups to obtain information on the relative distributions and characteristics in the two populations. We also performed de novo clustering of individuals from the WellGen study to assess whether clusters obtained were similar in proportion and characteristics to those from the European-derived centroids. Finally, we investigated the subclassification of type 2 diabetes in cohorts from two other geographical regions across India.

Methods

Study population

WellGen (Pune, Maharashtra, western India)

The WellGen study includes patients visiting the Diabetes Unit, KEM Hospital, Pune and associated clinics for routine diabetes management between 2004 and 2006 [18]. Individuals diagnosed with diabetes below 45 years of age using the WHO guidelines were included [1]. Diagnosis of type 2 diabetes was based on clinical criteria: age at diagnosis >20 years; no history of ketoacidosis; central obesity (waist–hip ratio: WHR >0.80 in women and >0.90 in men); and response to treatment with oral glucose-lowering agents. Individuals with a clinical diagnosis of type 1 diabetes (diagnosis before 20 years of age, history of ketoacidosis, continuous insulin treatment since diagnosis), fibrocalculous pancreatic diabetes (FCPD) or fulfilling criteria for monogenic diabetes were excluded. In total, 1612 individuals were included (Table 1).

Table 1 Clinical and biochemical characteristics of participants enrolled in WellGen and ANDIS study with age at diagnosis less than 45 years

Clinical information including age, sex, age at diabetes diagnosis, family history and socioeconomic status was obtained through a standardised questionnaire. Height, weight, waist and hip circumferences, and BP were measured using standardised methods [18, 19]. Fasting plasma glucose, total cholesterol, HDL-cholesterol, triacylglycerols and HbA1c were measured using standard laboratory assays as described previously [18, 19]. Fasting C-peptide was measured by ELISA (Diagnostic Biochem Canade, ON, Canada). Fasting glucose and C-peptide measurements were used to calculate Homeostatic model assessment 2 estimates of β-cell function (HOMA2-B) and insulin resistance (HOMA2-IR) values [20, 21]. Details of treatment (insulin, oral glucose-, BP- and lipid-lowering medication) were recorded.

Cardiovascular complications were defined by ICD-10 codes (http://apps.who.int/classifications/icd10/browse/2016/en): Coronary artery disease (CAD) I20–21, I24, I251 and I253–259; stroke I60–61 and I63–64. Nephropathy was diagnosed by urine strip albumin measurement (nil, trace, and +), and by eGFR calculation (ml min−1 [1.73 m body surface area]−2) by the Modification of Diet in Renal Disease (MDRD) formula (>90 ml min−1 [1.73 m]−2 normal; and 90–60 ml min−1 [1.73 m]−2 mild, 60–30 ml min−1 [1.73 m]−2 moderate and <30 ml min−1 [1.73 m]−2 severe chronic kidney disease [CKD]). Diagnosis of retinopathy was based on dilated fundus examination performed by an ophthalmologist and was classified as non-proliferative diabetic retinopathy or proliferative diabetic retinopathy. Peripheral neuropathy was diagnosed by biothesiometer (non-perception of vibration sense at 15 or higher amperes at two or more sites on the feet) (Table 2).

Table 2 Complications and current treatment for participants of the WellGen and ANDIS studies stratified by sex

Ahmedabad (Gujarat, western India)

Patients with a clinical diagnosis of type 2 diabetes visiting the DiaCare Clinic, Ahmedabad during 2018–2019 diagnosed below 45 years of age and duration of diabetes less than 2 years were invited to participate and 187 individuals consented. Measurements included anthropometry, HbA1c levels and a fasting measurement of plasma glucose and C-peptide (MAGLUMI C-peptide; CLIA, Shenzhen, China) (electronic supplementary material [ESM] Table 1).

PHENOEINDY-2 (Dibrugarh, Assam, North-east India)

Patients with clinical diagnosis of type 2 diabetes attending the medical outpatient department of Assam Medical College, Dibrugarh during 2017–2019 if diagnosed below 40 years of age were invited to participate and 205 individuals consented. Measurements included anthropometry, HbA1c levels, a fasting measurement of plasma glucose and C-peptide (ELISA, Diagnostic Biochem Canade, ON, Canada) (ESM Table 2).

ANDIS (Scania, Southern Sweden)

The ANDIS project comprises newly diagnosed diabetic individuals aged >18 years in Scania County, Sweden between 2008 and 2016 [2]. Biochemical and anthropometric measurements and presence of complications were recorded as described elsewhere [2]. For the current study, 962 individuals diagnosed with type 2 diabetes before 45 years of age were included. We excluded any person with known type 1 diabetes, monogenic diabetes and GAD antibody positivity (so-called SAID) to maintain concordance with the WellGen study. The prevalence of complications (diagnosed as described previously [2]; Table 2) was recorded ~4.2 years after diagnosis.

DIREVA (Vaasa, Western Finland)

DIREVA includes 5107 individuals with diabetes recruited from 2009 to 2014 in the Vaasa Hospital District. For the current study, 424 individuals with type 2 diabetes diagnosed below 45 years of age were included; exclusion criteria were similar to those for ANDIS (ESM Table 3). Biochemical and anthropometric measurements have been described elsewhere [2]. No treatment or complication data from DIREVA have been included in the current study.

Ethics statement

All studies were approved by the local/regional Institutional Ethics Committees, and all participants gave written informed consent.

Statistical methods

Participants with measurements above or below 5 SDs from the mean for the clustering parameters were excluded from the analysis and values outside the limits for HOMA2 calculation (fasting glucose <3 mmol/l or >25 mmol/l; C-peptide <0.2 ng/ml or >3.5 ng/ml) were capped to the proximal upper or lower limits. To perform supervised clustering in relation to the European-derived cluster coordinates, phenotypes (age at diagnosis, HbA1c, HOMA2-B, HOMA2-IR and BMI) were scaled using the same scaling parameters (mean and SD) as described previously [2]. Due to the unavailability of GAD autoantibody data in the Indian study (WellGen), we only included clusters 2–5 (SIDD, SIRD, MOD and MARD). Participants were assigned to the predetermined clusters on the basis of which ANDIS cluster they were most similar to, calculated as their Euclidean distance from the nearest cluster centre derived from ANDIS coordinates. The nearest centroid method was used to find the nearest centroid (as measured with Euclidean distances) for each individual. This resulted in each participant being assigned to any of the four clusters: 2/SIDD; 3/SIRD; 4/MOD; or 5/MARD. Given the wide range of duration of diabetes in WellGen, we performed a sensitivity analysis by separately assessing the type 2 diabetes subgroups among those within 5 years of diagnosis and those above.

To perform unsupervised clustering, all previously mentioned variables were used in a separate analysis. Given that the results from the supervised clustering analysis showed a strong bias towards the 2/SIDD and 4/MOD cluster, and the silhouette analysis indicated that two was the most stable number of clusters, we performed k-means clustering into two clusters. All phenotypes were scaled to have a mean of 0 and an SD of 1, this time with scaling parameters derived from the data itself. k-means clustering was then performed separately for female and male participants using the k-means runs algorithm from the fpc package version 2.1–11.1 (https://CRAN.R-project.org/package=fpc) in R version 3.4 [22].

We investigated differences between groups (male and female participants or between four subgroups) by t test or ANOVA for clinical characteristics and by χ2 test for complications. To compare differences in the rate of complications between subgroups, we also used logistic regression with adjustment for age at diagnosis, sex and duration of diabetes. We used SPSS version 22.0 for these analyses.

Type 1 diabetes genetic risk scores

In the absence of GAD autoantibody data, we applied a previously validated ‘type 1’ genetic risk score (GRS) (ESM Table 4) to 560 WellGen participants with available data to estimate the proportion of those carrying autoimmune risk alleles. The characteristics of these individuals did not differ from the characteristics of those for whom genotyping was not available (ESM Table 5). A positive control group comprised 261 individuals with type 1 diabetes, as described previously [23]. A negative control group comprised 461 participants with normal glucose tolerance (75 g OGTT; WHO 1999 criteria) from the Pune Maternal Nutrition Study (PMNS) [24].

Genotyping

Genome-wide genotyping data was generated on WellGen and PMNS participants using Affymetrix SNP 6.0 Chips (Affymetrix, CA, USA) and the Infinium Global Screening Array V1 B37 (Illumina, San Diego, CA, USA) for the type 1 diabetes cohort. Quality control and imputation were performed as described in ESM Methods.

GRS

A previously described set of nine SNPs was used for type 1 diabetes GRS calculations (ESM Table 4) [23, 25, 26]. In the absence of genotyping data for rs7454108, a proxy SNP rs3957146 (linkage disequilibrium [LD]: r2 = 1, D′ = 1) was used. LD is the non-random association of alleles at multiple DNA markers resulting from their close proximity to one another within a chromosome and are therefore inherited together. Classical LD measures include D′ and r2, where D′ and r2 values >0.8 indicate a higher degree of co-inheritance. The haplotype was constructed using rs2187668 + rs3957146 as described previously [23] and GRS scores were computed on PLINK 1.09 (http://pngu.mgh.harvard.edu/~purcell/plink/) [27] using weighted scores. Logistic regression was performed to assess the discriminatory power of GRS between type 1 diabetes and other subgroups.

Results

We first sought to investigate the subgroups of individuals with young-onset type 2 diabetes in the Indian WellGen study and compare them with Swedish subgroups from the ANDIS study.

The WellGen study comprised individuals diagnosed with type 2 diabetes before 45 years of age, and all relevant data required for clustering were available for 1624 participants. After applying exclusion criteria, 1612 individuals (56% men) with mean age at diagnosis of 37 years, duration of diabetes ~10 years and BMI 26.18 kg/m2 were included (Table 1 and ESM Fig. 1).

For comparison, we selected 962 participants with type 2 diabetes (58.9% men, mean age at diagnosis 38.83 years) from the ANDIS study, after excluding 577 individuals belonging to cluster 1 (SAID) [2]. The Indian participants were younger at diagnosis, had lower BMI, higher fasting plasma glucose and lower fasting C-peptide, HOMA2-B and HOMA2-IR compared with the ANDIS sub-cohort (Table 1). The proportion of participants receiving lifestyle management alone, glucose-lowering oral agents and insulin treatment was broadly similar in both cohorts (Table 2). In the WellGen study, men had a higher prevalence of cardiovascular events and nephropathy compared with women, whereas the prevalence of neuropathy was higher in women. There was no difference with respect to these complications between men and women in the ANDIS study (Table 2). We did not compare the complication rates between the two cohorts because of the difference in duration of diabetes.

SIDD predominates in India, MOD in Sweden

In the absence of GAD autoantibody data, we obtained the four expected clusters, albeit with different proportions. In the WellGen study, the SIDD cluster was the largest subgroup (52.8%), followed by the mild obesity-related MOD (37.7%), while severe insulin-resistant SIRD (1.1%) and mild age-related MARD (8.4%) were less common (Fig. 1, Table 3). In a sensitivity analysis, with increasing duration of diabetes (from <5 years to >5 years), the proportion of participants in the SIDD subgroup increased (from 45.5% to 56.9%) while that in MOD group decreased (from 44.7% to 33.8%) (ESM Table 6).

Fig. 1
figure 1

Distribution of participants from the WellGen and ANDIS study in the predefined clusters. (a–c) Distribution of WellGen participants, showing all participants (n=1612) (a), men with diabetes (n=902) (b) and women with diabetes (n=710) (c). (d–f) Distribution of ANDIS participants, showing all participants (n=962) (d), men with diabetes (n=567) (e) and women with diabetes (n=395) (f)

Table 3 Characteristics of participants enrolled in WellGen and ANDIS study by clusters for all participants, and for male and female participants

In the sex-stratified analysis, SIDD (60.6%) remained the predominant cluster in men whereas the main subgroup was MOD (53.2%) in women; MARD was more common in men (13.3% vs 2.1%) (Fig. 1, Table 3).

Concordant with the diabetes subgroups in ANDIS, among the two predominant subgroups, Indian individuals in the SIDD subgroup had the lowest insulin secretion (HOMA2-B) and the highest blood glucose levels while those in the MOD subgroup had the highest BMI. Individuals in the SIRD subgroup were the most insulin resistant, whereas those in the MARD subgroup were the oldest at diagnosis, with the lowest blood glucose levels and lowest degree of insulin resistance (Fig. 2, Table 3). These results support the pathophysiological basis for subclassification in a population which has a different genetic and socioeconomic background compared with the Swedish population.

Fig. 2
figure 2

Box plot of cluster characteristics in the WellGen and ANDIS studies. Distribution of age at diagnosis (a, f), BMI (b, g), HbA1c (c, h), HOMA2-B (d, i) and HOMA2-IR (e, j) in the WellGen (ae) and ANDIS (fj) studies for each cluster. The central line within each box represents the median and the upper and lower limits of the box represent the IQR. The whiskers are the most extreme values within 1.5× the IQR from the first and second quartiles. k-means clustering was done separately for men and women; data are shown for each sex separately. F, female sex; M, male sex

The distribution of subgroups in ANDIS differed from that of WellGen; MOD was the most predominant cluster (67.57%), followed by SIDD (26.09%), SIRD (3.64%) and MARD (2.70%) (Fig. 1, Table 3). These distributions were similar in men and women. The pathophysiological characteristics of these Swedish individuals with young-onset type 2 diabetes were similar to those in the parent ANDIS cohort (Fig. 2, Table 3).

Treatment

In both WellGen and ANDIS, insulin treatment (alone or in combination with OHAs) was most commonly prescribed to individuals in the SIDD subgroup (38.3% in WellGen, 51.0% in ANDIS) (Table 4).

Table 4 Treatment and complications by cluster in the WellGen and ANDIS study

Complications

In WellGen, we compared the prevalence of complications in the two major subtypes, SIDD and MOD. Small numbers in the SIRD and MARD groups precluded comparison of complications. Retinopathy and nephropathy were most common in the SIDD subgroup whereas neuropathy was more prevalent in MOD (Table 4). The prevalence of macrovascular complications was similar in these two subtypes. Of the less common subgroups, SIRD had a high prevalence of retinopathy while MARD had a high prevalence of nephropathy and macrovascular disease.

In the ANDIS cohort, nephropathy (70.4%) and retinopathy (44.4%) prevalence were highest in SIRD whereas neuropathy was most common in SIDD (Table 4). Consistent with previous findings, SIRD also showed the highest prevalence of CKD (47.1%) while macroalbuminuria (14.9%) was most common in MOD.

De novo clusters show a high degree of concordance with SIDD and MOD

We also applied the de novo k-means clustering to assess the subgroups obtained in the Indian study population and compared them with those obtained using the previously published algorithm. Two was the optimum number of clusters (ESM Fig. 2a); cluster 1 had a prevalence of 66.6% while cluster 2 had a prevalence of 33.4% (ESM Table 7). Cluster 1 showed 88.8% concordance with SIDD (86.8% in male participants, 92.4% in female participants) while cluster 2 had an overlap of 62.5% with MOD (83.9% in male participants, ~49.5% in female participants) (ESM Fig. 2b).

Both clusters had the same cluster characteristics as seen using the centroid method, thereby providing a technical replication (ESM Table 7). The similarity also extended to complication rates, with nephropathy and retinopathy being prevalent in cluster 1 compared with cluster 2 whereas neuropathy was more prevalent in cluster 2 (ESM Table 8).

Low prevalence of genetic type 1 diabetes in WellGen type 2 diabetes subgroups

In the absence of GAD autoantibody data, a previously established type 1 diabetes GRS [25] comprising nine SNPs (ESM Table 4), which was validated in the Indian population [23], was applied to estimate the proportion of participants with autoimmune diabetes in a subset of the WellGen cohort (ESM Table 5). The GRS was associated with the positive control participants with type 1 diabetes compared with SIDD and MOD. The same GRS did not associate with either SIDD or MOD compared with non-diabetic control individuals (ESM Fig. 3, ESM Table 9). The proportion of participants with GRS ≥90% and 80% was ~5% and 28.7%, respectively, in type 1 diabetes (positive controls) whereas it was 0% and 1.4% in SIDD, 0% and 4.7% in MOD, and 0% and 1% in control participants. The same GRS was associated with SAID in young ANDIS participants (β = 7.3 ± 0.72, p < 2 × 10−16).

Indian diabetes subgroups are similar to subgroups in European cohort DIREVA with longer diabetes duration

Given the difference in duration of diabetes in ANDIS and WellGen, we compared the WellGen subgroups with those in the Finnish cohort DIREVA (n = 420) with duration of diabetes 14.4 years (ESM Table 10). The differences in proportion of subgroups and cluster characteristics between WellGen and DIREVA were similar to those between WellGen and ANDIS. Similar to ANDIS, SIDD was less common compared with MOD in DIREVA (23.6% vs 70.8%) (ESM Table 10, ESM Fig. 4).

Subgroups of diabetes in other regions of India

We applied the Swedish algorithm [2] to two studies from different geographical regions in India, Ahmedabad, Gujarat, western India (N = 187) and Dibrugarh, Assam, north-eastern India (N = 205; PHENOEINDY-2 cohort) (ESM Tables 11, 12). Concordant with findings in WellGen, the Ahmedabad cohort had the highest proportion of participants in the SIDD subgroup (56.68%) followed by MOD (33.15%) and MARD (10.16%) (ESM Table 11). The similarity extended to the subgroup distributions in the two sexes: the SIDD subgroup was most prevalent in male participants (61.59%) while MOD was most prevalent in female participants (53.06%) (ESM Table 11, ESM Fig. 5). The PHENOEINDY-2 cohort was the youngest and the thinnest cohort of all; the proportion of SIDD (66.66%) was the highest, followed by MOD (23.20%), MARD (7.72%) and SIRD (1.40%) (ESM Table 12, ESM Fig. 6).

Discussion

We showed that the clusters described in the newly diagnosed unselected European type 2 diabetes patients [2, 3] are also seen in the younger and thinner Indians. SIDD and MOD were the two predominant subgroups, while MARD was less common and SIRD the least common in both populations. SIDD was the predominant cluster in Indians whereas MOD was the most prevalent in Europeans. The predominance of SIDD was replicated in two independent, geographically distinct Indian cohorts of individuals with young-onset type 2 diabetes.

The distribution of the clusters suggests that deficient insulin secretion, rather than the often-purported insulin resistance, is the driver of young-onset type 2 diabetes in India. In contrast, in the young Swedish and Finnish type 2 diabetes populations, obesity and insulin resistance seemed to be the primary pathophysiological drivers. The proposed prominent role of insulin resistance was based on previous demonstrations of higher insulin resistance in Indians compared with Europeans at a given BMI possibly due to relatively more adipose body composition [7, 28, 29]. Despite the differences in age, BMI and duration of diabetes, the characteristics of the clusters themselves in Indians broadly reflected those in the European studies [2] Our new findings suggest a paradigm shift for the understanding of the pathophysiology of type 2 diabetes in young Indians, albeit they do not preclude the role of insulin resistance.

De novo k-means subclassification validated the two major diabetes subgroups obtained from the European-derived centroids. The concordance was greater in men for both subgroups, while it was lower in women for the newly obtained cluster 2 with MOD. While this increases our confidence in the classification, reclassification of a proportion of women with MOD to a SIDD equivalent cluster 1 highlights the role of insulin deficiency in the pathogenesis of type 2 diabetes in young Indians.

We applied the European-derived centroids to two smaller cohorts of individuals with young-onset type 2 diabetes from western (Ahmedabad, Gujarat) and north-eastern (Dibrugarh, Assam) India. The proportion of subclasses in the Ahmedabad participants was nearly identical to those in Pune, while the proportion of individuals in the SIDD subgroup was highest in Dibrugarh. Gujarat is a more affluent state while Assam has a lower development index and high prevalence of undernutrition.

Physicians in India have long realised the phenotypic differences of Indian diabetes patients compared with those described in patients of European origin [30, 31]. Interestingly, lean type 2 diabetes has been prominently reported in the impoverished states of Orissa and north-east India, where malnutrition-related diabetes (MRDM) has been described [32]. A proportion of SIDD patients from Assam could well be characterised similarly. There is an increasing recognition that early-life undernutrition could lead to smaller beta cell mass and insulin secretion defects demonstrable from early childhood in serially studied birth cohorts and could manifest as prediabetes or type 2 diabetes in young adulthood [33, 34]. Animal studies have clearly demonstrated poor beta cell development and islet dysfunction in offspring born to malnourished pregnant mothers [35,36,37]. It is intriguing that the highest rise in the prevalence of type 2 diabetes in India over the last 25 years has been demonstrated in states that have suffered chronic environmental, socioeconomic and nutritional deficits [38]. On such a background of intergenerational deprivation, a relatively small socioeconomic development appears enough to precipitate diabetes at a young age. It is of note that the prevalence of diabetes in those above 20 years of age has increased from 5.5% to 7.5% between 1990 and 2016 in the state of Assam.

Individuals in the diabetes subgroups displayed different sensitivities to micro- and macrovascular complications. Microvascular disorders of the retina and kidney were more prevalent in SIDD compared with MOD, while peripheral nerve damage was more prevalent in MOD. Possible reasons for these differences may lie in the pathophysiological mechanisms driving the subgroups and this needs to be studied further. Prevalence of macrovascular disease was similar in two subgroups. In the original Swedish classification, the SIRD subgroup generated a lot of interest given the high propensity of affected individuals for developing nephropathy. SIRD was the smallest subgroup in Indians with high insulin resistance as well as insulin secretion, although it was heterogeneous between the Indian cohorts. Intriguingly, the MARD subgroup had a strikingly high rate of macrovascular disease. The unique profiles of these subgroups could well represent population-specific differences and highlight the need for customisation of the clustering algorithm.

Other studies have investigated the heterogeneity of type 2 diabetes in Indians. The INSPIRED study from a chain of private diabetes clinics in India reported four subclasses, two of which were similar to the Swedish study (SIDD and MARD) [5] and two of which were new (insulin-resistant obese diabetes [IROD] and combined insulin resistance and deficient diabetes [CIRDD]) [14]. However, the clustering parameters were different and therefore not directly comparable with our study. The MASALA-MESA study reported subclasses in a mixed population in the USA, including migrant South Asian Indians (n = 217) [15]. They found an excess of younger, thinner and severe hyperglycaemic individuals among South Asian Indians, supporting our findings.

This study has strengths and limitations. This is the first attempt in India to subclassify patients with a diagnosis of type 2 diabetes at a young age. The presence of subgroups in our study comparable with those in a genetically and historically distinct European population validates the subgroups. While the power of the study is limited, validation by de novo clustering increases confidence in classification. The Indian patients are clinic-based and enrolled many years after diabetes diagnosis while on glucose-lowering treatment, which may affect the proportions of subclasses. We cannot rule out the possibility that some individuals might shift to different subgroups over time; however, only a small proportion did so in another study [3]. A sensitivity analysis in WellGen showed that the proportions of the subgroups varied with increasing diabetes duration, although SIDD remained the predominant subgroup (45.5%) even in those with less than 5 years of diagnosis. Observations in other two Indian cohorts further validated these findings. Another limitation is that this study is an opportunistic comparison of existing data and therefore laboratory measurements are not fully harmonised between cohorts. However, C-peptide measurements in different cohorts were calibrated against the same WHO standard, facilitating comparisons. Given the lack of GAD autoantibody data, it might be suspected that the SIDD group in WellGen includes individuals with autoimmune diabetes (latent autoimmune diabetes in adults [LADA]). However, the low prevalence of individuals with high type 1 diabetes GRS scores in WellGen deems a large contribution of autoimmune diabetes extremely unlikely.

In summary, we demonstrate the applicability of a European algorithm for subclassifying type 2 diabetes in young Indian patients. Our results demonstrate a prominent role for insulin secretion defects in the pathophysiology of diabetes in this group. These results could potentially influence treatment strategies for achieving optimal metabolic control, with possible benefits for long-term health [4, 39]. Translation to personalised medicine will come from carefully designed prospective studies including genetic and epigenetic investigations to elucidate pathophysiological mechanisms underlying the subgroups.