Abstract
Associations between HLA class I alleles and HIV progression in populations exhibiting Amerindian and Caucasian genetic admixture remain understudied. Using univariable and multivariable analyses we evaluated HLA associations with five HIV clinical parameters in 3,213 HIV clade B-infected, ART-naïve individuals from Mexico and Central America (MEX/CAM cohort). A Canadian cohort (HOMER, n = 1622) was used for comparison. As expected, HLA allele frequencies in MEX/CAM and HOMER differed markedly. In MEX/CAM, 13 HLA-A, 24 HLA-B, and 14 HLA-C alleles were significantly associated with at least one clinical parameter. These included previously described protective (e.g. B*27:05, B*57:01/02/03 and B*58:01) and risk (e.g. B*35:02) alleles, as well as novel ones (e.g. A*03:01, B*15:39 and B*39:02 identified as protective, and A*68:03/05, B*15:30, B*35:12/14, B*39:01/06, B*39:05~C*07:02, and B*40:01~C*03:04 identified as risk). Interestingly, both protective (e.g. B*39:02) and risk (e.g. B*39:01/05/06) subtypes were identified within the common and genetically diverse HLA-B*39 allele group, characteristic to Amerindian populations. While HLA-HIV associations identified in MEX and CAM separately were similar overall (Spearman’s rho = 0.33, p = 0.03), region-specific associations were also noted. The identification of both canonical and novel HLA/HIV associations provides a first step towards improved understanding of HIV immune control among unique and understudied Mestizo populations.
Similar content being viewed by others
Introduction
Polymorphism within the human leukocyte antigen (HLA) class I (HLA-A, -B and -C) loci represents the strongest host genetic modifier of HIV disease progression1,2,3,4. However, while HLA associations with HIV disease outcome have been extensively studied in Caucasian and African populations1,3,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26, Mestizo and other populations exhibiting complex genetic admixture remain understudied in this context. Populations in Mesoamerica (defined here as Mexico and Central America) possess unique immunogenetics as a result of admixture between mainly Amerindian and Caucasian, as well as African HLA haplotypes27,28, making these ideal for the identification of novel HLA correlates of HIV control. Here, we investigate HLA associations with five HIV clinical parameters among HIV-1 subtype B-infected, antiretroviral naive individuals in Mexico (MEX cohort, n = 1679) and 6 Central American countries: Guatemala, Belize, Honduras, El Salvador, Nicaragua and Panama (CAM cohort, n = 1534). We begin by characterizing the unique immunogenetics of this Mestizo population by comparing HLA allele frequencies (AF) and haplotype structures (linkage disequilibrium, LD) to those of a mainly Caucasian cohort from British Columbia, Canada (HOMER, n = 1622)29,30. We then define protective and risk HLA class I alleles in the individual (MEX, CAM) and combined Mesoamerican (MEX/CAM) cohorts using a novel approach that scores HLA alleles based on their associations with five interlinked clinical parameters relevant to HIV disease progression, while adjusting for HLA linkage disequilibrium, co-expression of known protective HLA alleles and other potential confounding factors. Finally, we explore the extent to which HLA protective/risk alleles observed in the MEX and CAM cohorts are universal versus region-specific.
Methods
Ethics statement
Recruitment and study of the Mexican and Central American cohorts was evaluated by the Ethics Committee of the National Institute of Respiratory Diseases (INER) in Mexico City (protocol codes E02–05, E10–10, E06–09). All experiments were performed in accordance to the protocol guidelines and regulations of our Institution, and approved by the Ethics Committee of INER. All participants were adults (over 18 years) and gave written informed consent in accordance with the Declaration of Helsinki before blood sample donation. Analysis of the BC HOMER reference cohort was approved and conducted according to the protocol guidelines and regulations of the Providence Health Care/University of British Columbia Research Ethics Board. All participants gave written informed consent and/or data were anonymized by REB-approved procedures.
Mexican and Central American cohorts
Three thousand two hundred and thirteen HIV-1 clade B-infected, Antiretroviral Treatment (ART)-naïve individuals from Mexico and from 6 out of 7 Central American countries were enrolled by convenience sampling from 2000 to 2016 as part of an international multicenter cross-sectional study to evaluate HIV molecular epidemiology, drug resistance prevalence and HLA adaptation in Mesoamerica. Individuals were enrolled and donated a single blood sample at the time of HIV diagnosis or at follow-up visits prior to starting ART according to national guidelines. Every HIV-infected person naive to ART attending each participating clinic was offered the opportunity to participate during active recruitment periods. No additional exclusion criteria were applied. In Mexico (MEX cohort), 1679 participants were enrolled in a national collaborative network of clinics from 23 out of 32 Mexican states comprising Baja California, Campeche, Chiapas, Chihuahua, Colima, Guerrero, Hidalgo, Jalisco, Mexico City, Michoacan, Morelos, Nuevo Leon, Oaxaca, Puebla, Queretaro, Quintana Roo, Sinaloa, Sonora, State of Mexico, Tabasco, Tlaxcala, Veracruz and Yucatan. From Central America (CAM cohort), 1534 subjects were recruited, including 418 from Guatemala, 102 from Belize, 42 from El Salvador, 402 from Honduras, 254 from Nicaragua and 316 from Panama, using convenience sampling. Participating institutions in Central America included: Guatemala: Roosevelt Hospital, Guatemala City (a national referral center)31; Belize: Ministry of Health, Belmopan; El Salvador: Rosales National Hospital, San Salvador; Honduras: University School Hospital, Tegucigalpa; National Cardio-Pulmonary Institute, Tegucigalpa; Mario Catarino Rivas Hospital, San Pedro Sula; Atlántida Hospital, La Ceiba; South Hospital, Choluteca (five of the largest HIV clinics across the country)32; Nicaragua: Roberto Calderón Hospital, Managua (the largest reference center in the country)33; Panama: Gorgas Memorial Institute for Health Studies, Panama City (a national reference center)34. Demographic data were obtained via questionnaire at the time of sample donation. Blood samples, completed consent forms and demographic questionnaires were shipped to the Centre for Research in Infectious Diseases (CIENI) of the National Institute of Respiratory Diseases (INER) in Mexico City, a WHO-accredited laboratory for HIV genotyping, within 72 hours of collection. HIV clinical information (baseline drug resistance test, plasma viral load [pVL] and CD4 counts) were sent back to the different Mexican states or countries in Central America for clinical follow-up of the participants.
HOMER cohort
The HAART Observational Medical Evaluation and Research (HOMER) cohort35 from British Columbia, Canada, a historic retrospective observational cohort comprising HIV-infected antiretroviral naive individuals initiating their first combination antiretroviral treatment regimen since 1996, was used as a reference for HLA allele frequency and HLA-pVL associations comparison. Clinical measurements from the earliest available time point before initiating ART were used. pVL in the HOMER cohort were performed with either Roche COBAS AmpliPrep/COBAS AMPLICOR HIV-1 MONITOR UltraSensitive Test, version 1.5 (1996–2008) or Roche COBAS AmpliPrep/COBAS TaqMan v1 HIV-1 Test (2008–2010). Median pVL and CD4 counts in the HOMER cohort were 4.90 (IQR 4.33–5.26) log10 copies/mL and 340 (IQR 170–500) cells/µL respectively. The cohort was predominantly male (86.2%), and the median age at recruitment was 37.4 (IQR 32–44) years. HOMER is primarily composed of Caucasian individuals (>60%); a minority of individuals self-identified as Hispanic (~2%) or Indigenous/Amerindian (~13%)35. The present analysis was restricted to subtype B-infected persons (n = 1622; 90% of total cohort35). As described elsewhere29,30, the majority of HLA class I types were defined at subtype level resolution, with the remaining intermediate-resolution data imputed to subtype-level resolution using a machine learning algorithm trained on HLA-A, B and C subtypes from >13,000 individuals with known ethnicity36.
HIV subtyping
HIV subtypes were determined using REGA HIV Subtyping Tool (3.0) (http://dbpartners.stanford.edu:8080/RegaSubtyping/stanford-hiv/typingtool/) and confirmed with the Recombination Identification Program37 (RIP, www.hiv.lanl.gov/content/sequence/RIP/RIP.html) using available plasma HIV pol (protease and reverse transcriptase) sequences, obtained as described elsewhere31. All non-subtype B-infected individuals were removed prior to analysis.
HIV clinical parameters
HIV pVL was determined by automated real-time polymerase chain reaction (PCR) using the m2000 system (Abbott, Abbott Park, IL, USA) with a detection limit of 40 HIV RNA copies/mL. CD45+, CD3+, CD4+ and CD8+ cell counts were obtained by flow cytometry using the Trucount Kit in FACSCanto II instrument (BD Biosciences, San Jose, CA).
HLA class I typing
Peripheral blood mononuclear cells (PBMCs) were isolated by density gradient centrifugation (Ficoll-Paque Pharmacia, Uppsala, SE) from blood samples from MEX cohort, while buffy coats were isolated from CAM cohort blood samples and cryopreserved until DNA extraction. Total genomic DNA was extracted from PBMCs (∼6 million cells) or buffy coats (200 µL) using the QIAmp Blood Mini Kit (QIAGEN, Valencia, CA), according to the manufacturer’s specifications. HLA-A, -B and -C types were resolved at subtype-level (e.g. second field/4-digit) resolution using a previously described protocol with some modifications38. Briefly, a nested polymerase chain reaction (PCR) using universal, locus-specific primers was used to amplify a ∼1000 base pair region spanning exons 2 and 3 (which encode the α1 and α2 domains of the HLA peptide binding groove) of HLA-A, -B and -C loci, using the Expand High Fidelity PCR system (Roche Applied Science, Laval, PQ) (3.5 U/µL). For HLA-A PCR reactions, 5% dimethyl sulfoxide (DMSO, Sigma-Aldrich, St Louis, MO) was added to decrease unspecific amplification and primer dimerization. HLA-B PCR products were cleaned up with ExoSAP-IT (Affimetrix, Cleveland, OH) and HLA-A and -C PCR products were diluted 10-fold and directly sequenced using a set of six sequencing primers per locus as previously described38 on a 3730xl Genetic Analyzer (Applied Biosystems, Foster City, CA), using the BigDye Terminator v3.1 chemistry (Life Technologies, Carlsbad, CA). Sequences were trimmed with Sequencing Analysis software v5.4 (Applied Biosystems) and HLA subtypes were assigned using UType v7.1 RUO (Applied Biosystems) using the up-to-date IPD-IMGT/HLA Database (http://www.ebi.ac.uk/ipd/imgt/hla/). Using this procedure, ambiguous HLA assignments can arise due to the presence of polymorphisms outside of the analyzed region (exons 2 and 3). For these cases, ambiguous HLA pairs were managed as G groups, including: A*74:01:01G (A*74:01 in the text, encompassing A*74:01 and A*74:02; difference in exon 1), C*18:01:01G (C*18:01 in the text, encompassing C*18:01 and C*18:02; difference in exon 5), C*17:01:01G (C*17:01 in the text, encompassing C*17:01, C*17:02 and C*17:03; difference in exons 1 and 5). Ambiguous HLA assignments may also arise due to the lack of phase resolution as a result of bulk (direct) sequencing of PCR amplicons. Considering only HLA ambiguities that affect the first (allele group-level) and second (subtype-level) HLA fields and that included either two common alleles or one common and one well-documented subtype present in the Common and Well-documented catalogue39, a total of 142 HLA-A, 161 HLA-B, and 104 HLA-C pairs were ambiguous due to lack of phase resolution (Supplementary Table S1). These ambiguities were resolved by assigning the most probable combination using HLA allele frequency (AF) and linkage disequilibrium (LD) data from the same cohorts. All HLA haplotypes were confirmed using the same published probabilistic method used for the HOMER cohort36 (HLA Completion web tool, http://boson.research.microsoft.com/hla/). Only 72 (of 6392, 1.12%) missing HLA-A or HLA-C types were imputed due to PCR/sequencing failure at these loci or lack of sample availability, using the same tool. HLA pairs with an unresolved HLA-B allele (23/3213 pairs, 0.71%) were considered missing data. As previously described40, our HLA typing methodology was validated to be 99.9% accurate in Mesoamerican Mestizo populations by comparing assigned HLA types to those obtained via amplification of exons 1 to 8 (for HLA-A and HLA-C) and exons 1 to 7 (for HLA-B) followed by next generation sequencing (TruSight HLA v2 Sequencing Panel, Illumina, San Diego, CA) in a large independent Mexican cohort (n = 323). Raw HLA typing data are available upon direct request to the authors.
HLA linkage disequilibrium analysis
LD between pairs of HLA alleles was assessed using the Los Alamos HIV Molecular Immunology Database HLA Analysis Tools (https://www.hiv.lanl.gov), with multiple comparisons addressed via Bonferroni correction. For the MEX cohort LD analysis, 12,970 two-way comparisons were performed (p-values < 3.9E-06 were considered significant), and for the CAM cohort 14,234 two-way comparisons were performed (p-values < 3.5E-06 were considered significant). The high-dimensional visualization tool Disentangler41 was used to graph HLA haplotype structures.
HLA allele frequency comparison
HLA AF were calculated by direct gene count (denominator 2n). All HLA subtypes with AF > 0.001 in at least one cohort were compared using two-tailed Fisher’s exact test; here, multiple comparisons were addressed using q-values, the p-value analogue of the false discovery rate42. Results with p < 0.05 and q < 0.2 were considered statistically significant.
Univariable and multivariable analyses of HLA-HIV clinical parameter associations
HLA allele associations with five clinical parameters previously described to be predictive of HIV disease progression were investigated. These included pVL43 and absolute CD4 count44, used in routine clinical monitoring of HIV infection, as well as CD4 percentage45, CD4/CD8 ratio45, and a proxy variable called Z-score that combines information from both CD4 count and pVL46, calculated as
(sd = standard deviation), where lower pVL and higher CD4 counts yield higher and more positive Z-scores. MEX and CAM cohorts were analyzed separately and in a combined fashion, where all HLA alleles observed in a minimum of 5 individuals (representing AF ≥ 0.0015 and ≥0.0008 for the individual and combined cohorts, respectively) were evaluated. This encompassed 147 HLA alleles (42 HLA-A, 75 HLA-B and 30 HLA-C) in the pooled MEX/CAM analysis; 119 HLA alleles (33 HLA-A, 59 HLA-B and 27 HLA-C) in the MEX analysis alone and 123 (35 HLA-A, 60 HLA-B and 28 HLA-C) in the CAM cohort alone. The Mann-Whitney U test was used to evaluate associations between each HLA allele (treated as a binary variable; e.g. comparing B*57:01+ vs. B*57:01− individuals) and the parameter of interest. Univariable analysis was also performed using Generalized Linear Models (GLM) to estimate HLA-HIV association effect sizes. Multiple comparisons were addressed using q-values47, where associations with p-values < 0.05 and q-values < 0.2 were considered significant. We additionally instituted a scoring system where, for each HLA allele investigated, we summed its total number of significant protective and risk associations (each assigned + 1 and −1, respectively), such that final scores could range from +5 to −5. Alleles with no significant associations were assigned a score of 0. Two multivariable linear regression models were also constructed to account for potential confounders of HLA-HIV associations, after which the overall 5-parameter scores were adjusted accordingly. First, independent models were constructed relating each HLA allele to each HIV clinical parameter, while adjusting for gender, age, geographical origin (country/region coded as n-1 binary variables) and the effect of the most significant HLA associations for that parameter (defined as the HLA alleles with p < 0.001 in the corresponding Mann-Whitney univariable analysis). Specific HLA alleles included in each model are listed in Supplementary Table S9. Second, we constructed models adjusting for all HLA subtypes in significant (p < 0.05 and q < 0.2) linkage disequilibrium with every HLA allele associated with an HIV clinical parameter in that cohort. Statistical analyses were undertaken using Stata/MP v14.1 (StataCorp, College Station, TX) and R v3.3.348.
Results
MEX and CAM cohort characteristics
Overall, the clinical and demographic characteristics of the Mexico (MEX; n = 1679), Central America (CAM; n = 1534) and combined (n = 3213) cohorts (Table 1) are consistent with the frequent diagnosis of HIV in advanced infection in Latin America and an concentrated epidemic mainly in men who have sex with men49. Median pVL in MEX was significantly higher (p < 0.00001) than that in CAM (4.72 versus 4.57 Log10 HIV RNA copies/mL), though CD4 T cell counts and CD4/CD8 ratios did not differ significantly between the cohorts (median 315 cells/µL and ~0.28, respectively). CD4+ T cell percentages and Z-scores46 also differed marginally between cohorts (p < 0.01).
Unique immunogenetic profiles of HIV-infected individuals in Mesoamerica
HLA class I allele frequencies differed markedly in the combined MEX/CAM versus HOMER cohorts (Fig. 1). The most frequent HLA-A, -B and -C alleles observed in MEX/CAM were A*02:01 (AF = 0.20), A*24:02 (0.15), A*02:06 (0.06), B*35:01 (0.10), B*40:02 (0.06), B*39:05 (0.05), C*04:01 (0.19), C*07:02 (0.18) and C*07:01 (0.07). Of the 153 alleles investigated, 52.9% (81/153) exhibited significantly (p < 0.05, q < 0.2) different frequencies between MEX/CAM and HOMER; these included 25/45 HLA-A, 39/77 HLA-B and 17/31 HLA-C alleles. As expected, Amerindian HLA alleles27,28 were enriched and Caucasian HLA alleles50 were underrepresented in MEX/CAM compared to HOMER. Notably, MEX/CAM featured a diversity of “typical” Amerindian HLA subtypes belonging to the A*02, A*68, B*15, B*35, B*39, and B*40 allele groups.
Comparison of HLA class I allele frequencies between the Mestizo MEX/CAM cohort (n = 3213) and the mainly Caucasian HOMER cohort (n = 1622). Allele frequencies (2n) were calculated using the HLA Analysis tool from Los Alamos HIV Database (https://www.hiv.lanl.gov); all HLA AF > 0.001 in at least one cohort are shown here. AF were compared using Fisher’s exact test, with multiple tests addressed using q-values42. Significant differences (p < 0.05, q < 0.2) are denoted by a star.
Strong HLA LD allowed us to identify common 2 and 3-loci haplotypes in our populations (Fig. 2 and Supplementary Tables S2-S7). A total of 104 distinct 2-loci HLA haplotypes were identified in MEX (Bonferroni, p < 3.9E-06; Supplementary Table S2), of which two had population frequency (PF) ≥ 0.10 (B*39:05~C*07:02 [PF = 0.1351] and B*35:01~C*04:01 [PF = 0.12]); a further 35 had PF between 0.02–0.10 (Fig. 2A). Also, 36 distinct 3-loci HLA haplotypes were identified in MEX (Bonferroni, p < 8.0E-06; Supplementary Table S3). In CAM, 95 distinct 2-loci HLA haplotypes were identified (Bonferroni p < 3.5E-06; Supplementary Table S4), of which one had PF ≥ 0.10 (B*35:01~C*04:01 [PF = 0.13] and 33 had PF between 0.02–0.10 (Fig. 2B). A further 30 distinct 3-loci HLA haplotypes were identified in CAM (Bonferroni, p < 8.2E-06; Supplementary Table S5). Analyzing MEX/CAM as a whole, 154 distinct 2-loci and 87 distinct 3-loci were identified (Bonferroni p < 2.3E-06 and p < 4.2E-06, respectively; Supplementary Tables S6–S7).
HLA class I haplotype structures and linkage disequilibrium in the MEX (panel A) and CAM (panel B) cohorts. HLA loci are stacked vertically, with each orange tile representing a specific HLA subtype, and with segments connecting linked alleles on adjacent loci. The height of each tile and the thickness of each segment correspond to HLA allele and haplotype frequencies, respectively. The most frequent HLA allele pairs (two-loci) found to be in linkage disequilibrium are highlighted in green (PF > 0.10) and blue (PF < 0.10 and > 0.02); less frequent pairs (PF < 0.02) are shown in grey. Frequently linked (PF < 0.10 and > 0.02) HLA-A and HLA-C allele pairs were also found in our cohorts including A*33:01/C*08:02, A*29:02/C*16:01, A*68:01/C*03:04, and A*68:03/C*07:02 in the MEX cohort, and A*02:06/C*07:02, A*24:02/C*01:02, A*24:02/C*03:05, and A*68:03/C*07:02 in the CAM cohort (not shown in the figure; see Supplementary Tables S4 and S6).
Overall, these findings highlight the immunogenetic uniqueness of Mesoamerican Mestizo populations and support them as ideal for identifying novel HLA correlates of HIV control.
HLA associations with HIV pVL in MEX/CAM include both canonical and novel associations
Given the significantly different HLA frequency distributions between MEX/CAM and HOMER, we hypothesized that HLA associations with HIV parameters would also differ markedly. We explored this by identifying HLA allele associations with the well-characterized marker of HIV disease progression pVL43 in both cohorts, in a univariable analysis. At the predefined statistical threshold of q < 0.2, we identified thirteen HLA alleles (5 HLA-A, 5 HLA-B, and 3 HLA-C) significantly associated with lower pVL, and four alleles (2 HLA-A, and 2 HLA-B) associated with higher pVL in HOMER, compared to sixteen HLA alleles (1 HLA-A, 7 HLA-B, and 8 HLA-C) associated with lower pVL, and twelve alleles (4 in HLA-A, 5 in HLA-B, and 3 in HLA-C) associated with higher pVL in MEX/CAM (Fig. 3). A number of these associations were consistent across cohorts: in all cases these were HLA alleles previously reported to be associated with HIV progression, including the canonical protective alleles B*57:01 and B*27:051,9,11,12,16,20,21,23,24,25 which were associated with significantly lower pVL in both cohorts. Additionally, as previously described, A*30:0214,24,25 (LD with B*18:01 and C*05:01 in both cohorts), C*02:021,23 (LD with B*27:05 in both cohorts) and C*14:025,23 (LD with B*51:01 in both cohorts) were associated with lower pVL in both cohorts; while A*68:0117,20 (LD with C*03:04 in MEX/CAM and with C*07:04 in HOMER) was associated with higher pVL in both cohorts. Numerous previously-reported HLA associations were also confirmed in one of the two cohorts. Specifically, A*25:013,11, A*32:011,11, B*14:013,16, and B*13:021,17,24,25 were associated with lower pVL exclusively in HOMER; while B*07:021,3,17,24,25, B*55:0152,53, and A*23:0111,18 were associated with higher pVL in HOMER. Moreover, B*14:023,16, B*57:035,8,15,16,17,19,23, B*58:015,12,14,17,20,22, B*81:015,12,17,20, C*03:0217 (LD with B*58:01), C*03:0524 (LD B*40:02, A*24:02), C*08:0220 (LD B*14:01/02), C*12:0220,22 (LD B*52:013), C*14:0322,23 (no significant LD found), and C*18:0114,17,20,25 (LD B*57:01/02 and B*81:01) were associated with lower pVL only in MEX/CAM; while B*35:015,6,11,13,22,23, B*35:021,6,9,11,13,23 (which exhibited the highest median pVL of all HLA-B alleles), A*24:021,11,18 (LD B*39:06, B*40:02, C*03:05), C*04:016,16,22,23 (LD B*35:01/02/08/12/14/16/17/20, B*07:02, B*53:01, among others), and C*07:021,14 (LD B*39:01/05/06/08/11, B*07:02, among others) were associated with higher pVL in MEX/CAM.
HLA associations with HIV pVL in the MEX/CAM and HOMER cohorts. Associations between HLA subtypes and pVL were investigated for HLA alleles with frequency equal or greater than 5 in HIV-1 clade B-infected ART-naïve individuals from the MEX/CAM (n = 3213) and predominantly Caucasian HOMER (n = 1622) cohorts. Associations between HLA alleles and pVL were evaluated using the Mann-Whitney U test, with multiple tests addressed using q-values. Significant (p < 0.05, q < 0.2) associations are highlighted in blue. Boxes denote median, 25th and 75th percentile, and whiskers represent the 10–90th percentile of pVL distributions of individuals expressing each HLA-B allele (left panels) and HLA-A/C allele (right panels). HLA alleles are ordered by their pVL median and the number (n) of individuals expressing each HLA allele is shown. Red vertical lines denote plasma viral load medians for each cohort.
Numerous novel associations were also identified in MEX/CAM. These included the relatively rare Amerindian allele B*15:39 (n = 8, p = 0.0052, q = 0.04), which was associated with lower pVL, as well as the common Amerindian alleles B*35:12 (n = 273, p = 0.0002, q = 0.003), B*39:05 (n = 360, p = 0.0026, q = 0.0034), B*39:06 (n = 153, p = 0.011, q = 0.07), A*68:03 (n = 370, p = 0.0079, q = 0.061; LD B*39:05, B*35:43, C*07:02), and C*03:04 (n = 427, p = 0.0339, q = 0.15; LD B*40:02/01/05/08/11, A*68:01), which were associated with significantly higher pVL (Fig. 3). Notably, these alleles, though present at relatively high frequency in MEX/CAM, are rare or absent in Caucasian and African populations (e.g. Fig. 1 and ref.50).
Additional novel associations between Amerindian HLA alleles and HIV risk/protection
We next extended our analyses of our Mesoamerican cohorts to identify HLA alleles associated with an expanded panel of five HIV clinical parameters (pVL, CD4 count, Z-score, %CD4, and CD4/CD8 ratio). HLA alleles associated with each of the 5 clinical parameters, stratified by cohort, are listed in Table 2. Consistent with previous reports12, HLA-B alleles featured prominently among these associations, with 43 HLA-B alleles (24 in MEX/CAM, 11 in MEX, and 8 in CAM) associated with at least one clinical parameter (p < 0.05, q < 0.2) compared to 29 HLA-A alleles (13 in MEX/CAM, 5 in MEX, and 11 in CAM) and 29 HLA-C alleles (14 in MEX/CAM, 9 in MEX, and 6 in CAM).
As a novel way to quantify HLA associations with HIV clinical parameters, we instituted a scoring system that summed each HLA allele’s total number of significant protective and risk associations (each assigned +1 and −1, respectively), such that final scores ranged from +5 to −5 (Fig. 4). Alleles with no significant associations were assigned a score of 0. As expected, a given HLA allele’s overall association score correlated significantly in a dose-dependent manner with its associated median value for all 5 clinical parameters in both MEX/CAM and individual cohorts (Supplementary Figure S1); also, HLA subtypes associated with pVL (Fig. 3) generally tended to be associated with other clinical parameters (e.g. 22 of 28, 78.5% in MEX/CAM), though some exceptions were noted (Fig. 4).
HLA-HIV associations in Mesoamerican cohorts using 5 HIV clinical parameters (univariable analysis). Associations between the expression of HLA class I alleles and 5 HIV clinical parameters (pVL, CD4 count, Z-score, CD4% and CD4/CD8 ratio) were investigated for alleles with frequency equal or greater than 5 in HIV-1 clade B-infected ART-naïve individuals from the pooled MEX/CAM cohort (A), only in MEX cohort (B) or only in CAM cohort (C). Associations were evaluated using the Mann-Whitney U test and multiple tests were addressed using q-values. Boxplots of only significant (p < 0.05, q < 0.2) HLA-HIV associations are shown. Alleles are grouped by HLA-HIV score ( + 5 to −5), then ordered by the median of Z-score, pVL, CD4 count, %CD4 and CD4/CD8 ratio. Protective and risk alleles are shaded with progressively deeper green and orange colors, respectively. Boxes denote the median, 25th and 75th percentile of the HIV clinical parameter of interest; whiskers represent the 10–90th percentile. The number (n = ) of individuals expressing each HLA allele is shown. Blue vertical lines denote cohort median values for each parameter.
In the MEX/CAM cohort, 7 HLA alleles (B*57:03, C*18:01 [LD B*57:02/03, B*81:01], B*27:05, B*57:01, B*14:02, C*08:02 [LD B*14:01/02], and C*02:02 [LD B*27:05]) achieved the highest protective score of +5 (Fig. 4A). The next highest-scoring group (+4) included B*42:01, A*02:05 (LD B*50:01, B*58:01), A*03:01 (new association, LD B*07:02, B*39:05), and A*11:01 (LD B*39:05); in all cases these alleles were significantly associated with all clinical variables except pVL. A total of 20 additional HLA alleles scored between +1 and +3; these included Amerindian alleles B*15:39 (LD C*03:03) and B*39:02 (LD C*07:02), both scoring +2, identifying these as novel protective alleles.
Similarly, 3 of the 5 HLA alleles that achieved the most negative score (−5) in MEX/CAM were Amerindian alleles; namely B*39:05 (LD C*07:02, A*68:03, A*02:06, A*23:01), B*35:12 (LD C*04:01, A*02:01) and A*68:03 (LD B*39:05, C*07:02) (Fig. 4A), identifying these as novel risk alleles. A total of 15 additional HLA alleles scored between −4 and −1; among these Amerindian alleles B*15:30 (LD C*01:02, score -3), B*39:06 (LD C*07:02, A*24:02; score -2), A*68:05 (score -2), and B*35:14, C*03:04 (LD B*40:02/01/05/08/11, A*68:01), B*39:01 (LD C*07:02, C*12:03) and A*02:06 (LD B*39:05, C*07:02), (score -1), were also identified as novel risk alleles. Additional risk alleles included B*35:01 (score -4, LD C*04:01, C*07:01, C*08:02, C*01:02), previously described as detrimental in some settings5,6,11,13,22,23 and B*13:02 (score -1, LD C*06:02, A*30:01) previously described as protective1,17,24,25. Analysis of MEX and CAM cohorts separately yielded results that corroborated the pooled analysis, but also revealed a small number of additional associations (Fig. 4B,C). In MEX, the latter included previously reported risk associations between HLA alleles and HIV disease progression including C*16:0120, A*23:0111,18 and B*37:0111,22 (all scoring -1) as well as a novel risk association with A*24:03 (score -1). In CAM, these included a protective association with B*51:0723 (score + 1), and risk associations with C*03:0322 and A*36:015,24 (both scoring −1).
To further quantify effects of HLA alleles on HIV parameters, we repeated univariable analyses using generalized linear models (Supplementary Table S8). HLA-HIV scores derived from the GLM analysis were highly concordant with those obtained using the Mann-Whitney U test (Spearman rho >0.9 in the combined and individual cohorts, in all cases p < 0.0001, Supplementary Figure S2).
We next performed a multivariable analysis to correct for gender, age, geographic location of recruitment, as well as HLA alleles with the most significant associations for each HIV clinical parameter in the univariable analysis in both pooled and individual cohorts. Resulting regression coefficients and 95% confidence intervals are shown in Fig. 5 and Supplementary Table S9. Overall, HLA-HIV scores of univariable and multivariable analyses were highly concordant in all cohorts (Spearman’s rho 0.8277, 0.8566, and 0.8057 in MEX/CAM, MEX and CAM respectively, all p < 0.0001; Fig. 6A–C). Importantly, the majority (10/15, 66.6%) of novel Amerindian HLA-HIV associations identified in the MEX/CAM univariable analyses remained significant following adjustment for these parameters; these included A*03:01, B*15:39, and B*39:02 as protective alleles and A*01:02, A*68:03, A*68:05, B*15:30, B*35:12, B*39:05 and C*03:04 as risk alleles. When cohorts were analyzed separately, A*03:01 (protective), A*68:03, B*35:12, and B*39:05 (risk) remained significant after multivariable adjustment in both cohorts.
Multivariable analysis of HLA-HIV associations in Mesoamerican cohorts using 5 HIV clinical parameters. An independent linear regression model (GLM) was constructed for each HLA allele and clinical parameter, while adjusting for gender, age, geographical origin (country or region) and the presence of HLA alleles with p < 0.001 in the Mann-Whitney univariable analyses for each clinical parameter; see Supplementary Table S9 for specific HLA alleles adjusted for in each model). Coefficients and 95% confidence intervals (CI) of significant (p < 0.05, q < 0.2) associations are shown. Alleles are grouped by HLA-HIV score (+5 to −5), then ordered by the coefficient of Z-score, pVL, CD4 count, %CD4 and CD4/CD8 ratio. Protective and risk alleles are shaded with progressively deeper green and orange colors, respectively. The number (n = ) of individuals expressing each HLA allele is shown. Blue vertical lines denote a coefficient equal to zero.
Correlation of 5-parameter HLA-HIV association scores within and between cohorts Scatter plot of univariable (Mann-Whitney) and multivariable HLA-HIV scores in the combined MEX/CAM (A), MEX (B) and CAM (C) cohorts. Panel (D) shows a scatter plot of univariable HLA-HIV scores between MEX and CAM cohorts. Correlation between HLA-HIV scores were determined using Spearman’s rank test. Random jittering was used to prevent dots being superimposed. For A, B & C panels only HLA alleles with at least one association with a HIV clinical parameter were considered.
A second multivariable analysis was performed to account for HLA LD effects (Supplementary Table 10). The majority of HLA-HIV associations identified in univariable analyses of the pooled MEX/CAM cohort remained significant after multivariable correction: these included the novel protective alleles A*03:01 (LD B*07:02, B*39:05), B*15:39 (LD C*03:03) and B*39:02 (LD C*07:02), and the detrimental A*01:02 (LD B*49:01), A*02:06 (LD C*07:02, B*39:05, B*39:08, C*08:01), A*68:03 (LD B*39:05, C*07:02, B*35:43), B*15:30 (LD C*01:02), B*35:12 (LD C*04:01, A*02:01), and C*03:04 (LD B*40:01/02, B*15:10, B*40:05/08/11, A*68:01) alleles. However, in some cases, the very strong LD between certain allele pairs (notably the protective B*14:02~C*08:02 haplotype, which exhibited the strongest LD in the cohort [p = 8.0E-195], the risk B*39:05~C*07:02 haplotype [LD p = 3.9E-144], and the risk B*40:01~C*03:04 haplotype [LD p = 2.6E-23]) precluded identification of the allele driving the association. Moreover, several HLA-C associations (e.g. C*02:02 as protective and C*04:01 as risk) could be partially or completely explained by their LD with HLA-B alleles: e.g. B*27:05 for C*02:02, and B*35:01/02/12 for C*04:01 (the apparent effect of C*04:01 in strong LD with B*35 alleles has been resolved previously9). Furthermore, some HLA-B alleles did not remain significant after LD correction. Among these were the new Amerindian risk alleles B*39:01 and B*39:06, which are both in strong LD with C*07:02. As the latter allele is also in extremely strong LD with the risk and highly frequent allele B*39:05, it is possible that inclusion of C*07:02 in the model confounds our ability to validate B*39:01 and B*39:06 as independent risk factors.
Secondary analyses using only pVL and CD4 count
Although the use of the 5-parameter scoring system enhanced sensitivity to identify associations (see discussion), HLA/HIV association studies have traditionally used pVL and CD4 count only5,11,17,20. To facilitate direct comparison of our results to previous studies, results based on pVL and CD4 count only are provided in Supplementary Tables S8–S9. Correlation between the results of the 5- and 2-parameter scoring systems were robust for both univariable (all rho >0.93, in all cases p < 0.0001; Supplementary Figure S3, panels A-C) and multivariable analyses (all rho > 0.88, in all cases p < 0.0001; Supplementary Figure S3, panels D-F).
Regional variation in HLA-HIV associations between MEX and CAM cohorts
Finally, we wished to investigate the extent to which HLA-HIV associations in MEX and CAM were universal versus region-specific. Taking the union of all HLA alleles that achieved a non-zero score in univariable analyses in either the MEX and/or CAM cohorts, we assessed the degree to which their scores correlated between cohorts using Spearman’s correlation. Overall, the relationship was statistically significant but the strength of the correlation was rather modest (Spearman rho = 0.334, p = 0.032, Fig. 6D). A total of 8 associations were shared between the MEX and CAM cohorts (protective: B*57:01, B*57:03, and A*03:01; detrimental: B*39:05, A*68:03, C*07:02, B*35:12, and C*04:01), while 16 were exclusive to MEX and 16 to CAM. It is important to note that these differences could be attributable, at least in part, to the significant HLA AF distribution differences between cohorts (Supplementary Figure S4), which in turn influences power to identify individual HLA associations below a predefined significance threshold. Nevertheless, this observation suggests that, while many HLA-HIV associations are common to both regions, others may be region-specific.
Discussion
In the present study, the largest and most comprehensive of its kind undertaken to date in the unique and highly genetically admixed Latin American Mestizo population, we characterized HLA allele frequency distributions, haplotype structures and identified both canonical and novel associations between HLA alleles and HIV control. To accomplish the latter, we implemented a novel scoring system based on five partially interrelated clinical parameters, an approach that allowed more nuanced classification of the consistency of associations observed. Indeed, despite the cross-sectional nature of our study and the fact that we investigated these associations during chronic HIV infection, we readily detected numerous canonical associations with HIV control (e.g. in MEX/CAM analysis: B*57:01/03, B*27:05 and B*14:02; in MEX: B*14:02 and B*27:05; in CAM: B*57:02; all with +5 score), even though the frequencies of some of these alleles are significantly lower in MEX/CAM than in other global populations. The 5-parameter scoring system also allowed us to detect known protective alleles that were not identified using pVL and CD4 only, including B*42:015,17,20, B*57:0217,26 and B*81:015,12,17,20; furthermore it allowed us to distinguish strong association effects from weaker ones (e.g. all canonical protective associations, including B*57:01, B*57:03 and B*27:05 were associated with all 5 parameters). Confirmation of numerous other HLA associations previously reported in other HIV clade B-infected populations (e.g. B*57:011,3,9,11,16,21,23, B*57:0316, B*27:051,3,9,11,23, B*44:0322, B*58:0122, B*14:023,16, A*25:011,11, B*35:013,22, B*35:021,6,9,11,23) supports the accuracy of our analysis, and increases confidence in the novel associations found. Among the latter, HLA alleles enriched in Amerindian populations featured prominently, particularly among the risk alleles. These included A*03:01, B*15:39 and B*39:02 (identified as protective) and A*01:02, A*24:03, A*68:03/05, B*15:30, B*35:12/14, B*39:01/06, B*39:05~C*07:02, and B*40:01~C*03:04 (identified as risk). Importantly, most protective and risk HLA alleles remained significant after accounting for potential confounding factors including age, gender, geographical location, HLA alleles with significant effects for each HIV clinical parameter, and HLA linkage disequilibrium, adding further confidence to our findings.
Of note, many of the novel risk HLA alleles identified in the present study are relatively common in Mesoamerica. Indeed, 35.2% (1133/3213) individuals in the MEX/CAM cohort, 40.2% (676/1679) in the MEX cohort and 29.7% (457/1534) in the CAM cohort expressed at least one risk HLA allele. Prevalence of risk alleles was particularly high in Guatemala (48%), El Salvador (42%) and the Southeastern Mexican states of Campeche, Chiapas, Quintana Roo, Tabasco, Veracruz and Yucatan (overall 48%). These observations underscore the importance of implementing and sustaining culturally and locally-appropriate HIV prevention, diagnostic and early antiretroviral treatment programmes, particularly in regions characterized by populations with high Amerindian admixture.
Interestingly, both protective and risk HLA subtypes were identified within the common and genetically diverse Amerindian HLA-B*39 allele group, including B*39:02 (protective) and B*39:05, B*39:01 and B*39:06 (risk). HIV control by B*39:02 might be explained by the presence of Glutamine in position 63 and Serine in position 67 (HLA-B amino acid positions previously associated with HIV control in genome-wide association studies3,54) within the HLA peptide binding groove, which distinguish this allele from other HLA-B*39 molecules. It is well-established that a difference of only one or two amino acids can influence the risk/protective status of closely-related HLA subtypes (e.g. between B*42:01 and B*42:02; B*57:03 and B*57:02; B*35:01 and B*35:02/03; B*44:02 and B*44:039,55,56,57,58,59,60,61,62,63), suggesting that a similar mechanism could underlie the differences between HLA-B*39 subtypes. Of note, B*39:02 attained a protective score of +2 in univariable (%CD4, CD4/CD8) and +3 in multivariable (pVL, %CD4, CD4/CD8) analyses in the pooled (MEX/CAM) and +2 in univariable (pVL, CD4/CD8) and +3 in multivariable (pVL, %CD4, CD4/CD8) analyses in MEX-only cohort (though it was not identified as significantly protective in CAM only, possibly as a result of significantly lower power on account of its lower frequency in CAM [AF = 0.009] compared to MEX [AF = 0.02]; p = 0.0004, q < 0.001 [Supplementary Figure S4]). Moreover, B*39:02-associated protection was not attributable to co-expression of other advantageous HLA-B alleles; in fact, a number of B*39:02-expressing individuals co-expressed risk alleles including B*35:01/02/03/12 and B*39:06, suggesting that B*39:02-associated protective effects may overcome risks associated with deleterious HLA alleles. Consistent with anti-Gag (particularly p24) CD8+ T-cell responses as key determinants of HIV control61,64,65,66,67, we previously identified two highly significant B*39:02-associated polymorphisms in Gag (at codons 315 and 319, both in p24) and three in Pol (at codons 70 and 79 in protease, and 322 in reverse transcriptase)40, suggesting strong and reproducible targeting of these regions by B*39:02-restricted T-cell responses as possible determinants of B*39:02-mediated HIV control, where such effects remain detectable into chronic HIV-1 infection. Interestingly, B*39:02 has been previously associated with Takayasu’s arteritis, a rare autoimmune disease, in Japanese and Mexican populations68,69,70,71,72,73, and also with spondyloarthropathies in Japan74, observations which mirror established links between HLA alleles canonically associated with HIV protection (e.g. B*27:05) and autoimmune conditions75,76,77.
In contrast, the common Amerindian B*39:05 subtype (the 3rd most frequent HLA-B allele in MEX/CAM [AF = 0.057]) consistently ranked among the strongest risk alleles (score -5 MEX/CAM, -4 MEX and -4 CAM) even after multivariable correction (though it is important to note that this allele is in strong LD with C*07:02). Notably, B*39:05 was not identified as mounting strong immune pressures in Gag p24, but rather in Pol40 (at codons 37, 134 and 296). Similarly, no significant Gag codons have been identified as being under strong immune pressure by the Amerindian B*39:01, B*39:06 and B*35:14 alleles40, which were also identified as risk alleles in univariable analyses in the combined MEX/CAM cohort. The common Amerindian B*35:12 subtype (the 6th most frequent HLA-B allele in MEX/CAM [AF = 0.042]) was also consistently identified as a risk allele (univariable score -5 in MEX/CAM, -3 in MEX and -3 in CAM; multivariable score -1 in MEX/CAM, -1 in MEX and -2 in CAM). Indeed, by virtue of its similarity with other B*35-Px members, notably B*35:02/03, at positions 114 and 116 in the HLA peptide binding groove78, B*35:12 represents a putative new member of the established B*35-Px HIV risk allele group9.
A number of caveats and considerations merit mention. First, it is important to note the interrelated nature of the five clinical parameters evaluated (e.g. in the MEX/CAM cohort, Spearman rho between pVL and CD4, %CD4 and CD4/CD8 was -0.54 in all cases; Spearman rho between pVL and Z-score was −0.87; Spearman rho for CD4 and Z-score, %CD4 and CD4/CD8 was > 0.79 in all cases). Despite these relatively strong relationships however, pVL and CD4 are nevertheless well-established as independent predictors of HIV progression; as such it is customary to include both in HLA association studies (e.g.17,19). Similarly, inclusion of all 5 variables in our scoring system afforded us increased sensitivity to detect both previously-described HLA-HIV associations (e.g. in MEX/CAM univariable: B*57:0217,26 [score + 2, %CD4 and CD4/CD8], B*44:035,17,22 [score + 1, CD4/CD8] and C*08:0120 [score + 1, %CD4]; in MEX/CAM multivariable: B*42:015,17,20 [score + 2, %CD4 and CD4/CD8], B*57:0217,26 [score + 2, %CD4 and CD4/CD8] and C*08:0120 [score + 1, CD4/CD8]) as well as novel ones (e.g. in MEX/CAM univariable: B*39:02 [score + 2, %CD4, CD4/CD8], A*68:05 [score -2, %CD4 and CD4/CD8], B*35:14 [score -1, Z-score] and B*40:01 [score −2, %CD4 and CD4/CD8]). It also allowed us to identify associations with rare HLA alleles (e.g. in MEX/CAM univariable: B*40:0622 [n = 8, score + 2], B*81:015,12,17,20 [n = 16, score + 2], C*14:0322 [n = 6, score + 1]; in MEX/CAM multivariable: B*57:0217,26 [n = 7, score +2] and B*81:015,12,17,20 [n = 16, score -1]). Similarly, the observation that canonical protective associations including B*57:01, B*57:03 and B*27:05 all achieved the highest protective score (+5) also increases confidence in our identification of B*39:05 and B*35:12, which both achieved the lowest scores (−5), as novel risk alleles. Moreover, the observation that some HLA alleles showed associations with different HIV clinical parameters raises the intriguing possibility that certain HLA subtypes influence HIV progression through different mechanisms (though insufficient power to detect relatively weak associations, combined with the use of a predefined significance threshold, cannot be ruled out). As with all HLA association studies, strong linkage disequilibrium, particularly between HLA-B and HLA-C loci located in close proximity within the MHC’s beta block gene group79,80,81 makes it difficult to tease apart individual allele effects in some cases. Associations between certain HLA-C alleles and HIV clinical parameters may thus be partially or completely attributable to LD with HLA-B alleles (e.g. B*27:05 for C*02:02 and B*35:12 for C*04:01). Moreover, for three HLA-B~C combinations (B*14:02~C*08:02, B*39:05~C*07:02 and B*40:01~C*03:04), the allele responsible for the observed association could not be resolved due to strong LD. Similarly, we cannot rule out the possibility of additive and/or synergistic effects between HLA alleles. Finally, the risk of spurious associations is an ever-present concern in HLA association studies, particularly when reporting novel associations with relatively rare alleles (e.g. the protective B*15:39 or the detrimental A*01:02 alleles, both observed in 8 individuals only). Validation of these novel associations in other cohorts, along with elucidation of possible mechanisms, are therefore warranted. In particular, the evaluation of HIV-specific CTL responses in Latin American individuals expressing protective or risk HLA alleles should provide important mechanistic insights. Despite these caveats, our study further confirms that some HLA allele associations (e.g. B*27:05 and B*57:01) transcend the boundaries of race and HIV subtype, whereas others are likely to be particular to the unique immunogenetic background of the population under study.
Results of our study may also be relevant to the ultimate pursuit of effective HIV vaccines, whether these be prophylactic or therapeutic, global or geographically-tailored. In particular, detailed characterization (and continuous monitoring82) of HLA associations with HIV clinical parameters in ethnically diverse human populations hardest hit by the HIV epidemic may help inform the design of CTL epitope-based vaccine constructs83, predict the relative population coverage of such constructs, and ultimately aid in the interpretation of results from future HIV vaccine trials in a population-specific context.
References
Fellay, J. et al. Common genetic variation and the control of HIV-1 in humans. PLoS Genet 5, e1000791, https://doi.org/10.1371/journal.pgen.1000791 (2009).
Fellay, J. et al. A whole-genome association study of major determinants for host control of HIV-1. Science 317, 944–947, https://doi.org/10.1126/science.1143767 (2007).
International_HIV_Controllers_Study et al. The major genetic determinants of HIV-1 control affect HLA class I peptide presentation. Science 330, 1551–1557, https://doi.org/10.1126/science.1195271 (2010).
Kroner, B. L. et al. Concordance of human leukocyte antigen haplotype-sharing, CD4 decline and AIDS in hemophilic siblings. Multicenter Hemophilia Cohort and Hemophilia Growth and Development Studies. AIDS 9, 275–280 (1995).
Carlson, J. M. et al. Widespread impact of HLA restriction on immune control and escape pathways of HIV-1. Journal of virology 86, 5230–5243, https://doi.org/10.1128/JVI.06728-11 (2012).
Carrington, M. et al. HLA and HIV-1: heterozygote advantage and B*35-Cw*04 disadvantage. Science 283, 1748–1752 (1999).
Carrington, M. & O’Brien, S. J. The influence of HLA genotype on AIDS. Annual review of medicine 54, 535–551, https://doi.org/10.1146/annurev.med.54.101601.152346 (2003).
Costello, C. et al. HLA-B*5703 independently associated with slower HIV-1 disease progression in Rwandan women. AIDS 13, 1990–1991 (1999).
Gao, X. et al. Effect of a single amino acid change in MHC class I molecules on the rate of progression to AIDS. N Engl J Med 344, 1668–1675, https://doi.org/10.1056/NEJM200105313442203 (2001).
Honeyborne, I. et al. Control of human immunodeficiency virus type 1 is associated with HLA-B*13 and targeting of multiple gag-specific CD8+ T-cell epitopes. Journal of virology 81, 3667–3672, https://doi.org/10.1128/JVI.02689-06 (2007).
Kaslow, R. A. et al. Influence of combinations of human major histocompatibility complex genes on the course of HIV-1 infection. Nature medicine 2, 405–411 (1996).
Kiepiela, P. et al. Dominant influence of HLA-B in mediating the potential co-evolution of HIV and HLA. Nature 432, 769–775, https://doi.org/10.1038/nature03113 (2004).
Klein, M. R. et al. Associations between HLA frequencies and pathogenic features of human immunodeficiency virus type 1 infection in seroconverters from the Amsterdam cohort of homosexual men. The Journal of infectious diseases 169, 1244–1249 (1994).
Koehler, R. N. et al. Class I HLA-A*7401 is associated with protection from HIV-1 acquisition and disease progression in Mbeya, Tanzania. The Journal of infectious diseases 202, 1562–1566, https://doi.org/10.1086/656913 (2010).
Lazaryan, A. et al. Human leukocyte antigen B58 supertype and human immunodeficiency virus type 1 infection in native Africans. Journal of virology 80, 6056–6060, https://doi.org/10.1128/JVI.02119-05 (2006).
Lazaryan, A. et al. The influence of human leukocyte antigen class I alleles and their population frequencies on human immunodeficiency virus type 1 control among African Americans. Human immunology 72, 312–318, https://doi.org/10.1016/j.humimm.2011.01.003 (2011).
Leslie, A. et al. Additive contribution of HLA class I alleles in the immune control of HIV-1 infection. Journal of virology 84, 9879–9888, https://doi.org/10.1128/JVI.00320-10 (2010).
MacDonald, K. S. et al. Influence of HLA supertypes on susceptibility and resistance to human immunodeficiency virus type 1 infection. The Journal of infectious diseases 181, 1581–1589, https://doi.org/10.1086/315472 (2000).
Matthews, P. C. et al. HLA-A*7401-mediated control of HIV viremia is independent of its linkage disequilibrium with HLA-B*5703. J Immunol 186, 5675–5686, https://doi.org/10.4049/jimmunol.1003711 (2011).
Matthews, P. C. et al. Co-operative additive effects between HLA alleles in control of HIV-1. PLoS One 7, e47799, https://doi.org/10.1371/journal.pone.0047799 (2012).
Migueles, S. A. et al. HLA B*5701 is highly associated with restriction of virus replication in a subgroup of HIV-infected long term nonprogressors. Proceedings of the National Academy of Sciences of the United States of America 97, 2709–2714, https://doi.org/10.1073/pnas.050567397 (2000).
Naruto, T. et al. HLA class I-mediated control of HIV-1 in the Japanese population, in which the protective HLA-B*57 and HLA-B*27 alleles are absent. Journal of virology 86, 10870–10872, https://doi.org/10.1128/JVI.00689-12 (2012).
O’Brien, S. J., Gao, X. & Carrington, M. HLA and AIDS: a cautionary tale. Trends in molecular medicine 7, 379–381 (2001).
Tang, J. et al. Human leukocyte antigens and HIV type 1 viral load in early and chronic infection: predominance of evolving relationships. PLoS One 5, e9629, https://doi.org/10.1371/journal.pone.0009629 (2010).
Tang, J. et al. Favorable and unfavorable HLA class I alleles and haplotypes in Zambians predominantly infected with clade C human immunodeficiency virus type 1. Journal of virology 76, 8276–8284 (2002).
Kloverpris, H. N. et al. HLA-B*57 Micropolymorphism shapes HLA allele-specific epitope immunogenicity, selection pressure, and HIV immune control. Journal of virology 86, 919–929, https://doi.org/10.1128/JVI.06150-11 (2012).
Barquera, R. et al. HLA class I and class II haplotypes in admixed families from several regions of Mexico. Molecular immunology 45, 1171–1178, https://doi.org/10.1016/j.molimm.2007.07.042 (2008).
Zuniga, J. et al. HLA class I and class II conserved extended haplotypes and their fragments or blocks in Mexicans: implications for the study of genetic diversity in admixed populations. PLoS One 8, e74442, https://doi.org/10.1371/journal.pone.0074442 (2013).
Brumme, Z. L. et al. Evidence of differential HLA class I-mediated viral evolution in functional and accessory/regulatory genes of HIV-1. PLoS Pathog 3, e94, https://doi.org/10.1371/journal.ppat.0030094 (2007).
Brumme, Z. L. et al. HLA-associated immune escape pathways in HIV-1 subtype B Gag, Pol and Nef proteins. PLoS One 4, e6687, https://doi.org/10.1371/journal.pone.0006687 (2009).
Avila-Rios, S. et al. HIV-1 drug resistance surveillance in antiretroviral treatment-naive individuals from a reference hospital in Guatemala, 2010-2013. AIDS Res Hum Retroviruses 31, 401–411, https://doi.org/10.1089/aid.2014.0057 (2015).
Avila-Rios, S. et al. HIV Drug Resistance Surveillance in Honduras after a Decade of Widespread Antiretroviral Therapy. PLoS One 10, e0142604, https://doi.org/10.1371/journal.pone.0142604 (2015).
Avila-Rios, S. et al. HIV Drug Resistance in Antiretroviral Treatment-Naive Individuals in the Largest Public Hospital in Nicaragua, 2011-2015. PLoS One 11, e0164156, https://doi.org/10.1371/journal.pone.0164156 (2016).
Mendoza, Y. et al. Molecular epidemiology of HIV-1 in Panama: origin of non-B subtypes in samples collected from 2007 to 2013. PLoS One 9, e85153, https://doi.org/10.1371/journal.pone.0085153 (2014).
Patterson, S. et al. Cohort Profile: HAART Observational Medical Evaluation and Research (HOMER) cohort. Int J Epidemiol 44, 58–67, https://doi.org/10.1093/ije/dyu046 (2015).
Listgarten, J. et al. Statistical resolution of ambiguous HLA typing data. PLoS computational biology 4, e1000016, https://doi.org/10.1371/journal.pcbi.1000016 (2008).
Siepel, A. C., Halpern, A. L., Macken, C. & Korber, B. T. A computer program designed to screen rapidly for HIV type 1 intersubtype recombinant sequences. AIDS Res Hum Retroviruses 11, 1413–1416, https://doi.org/10.1089/aid.1995.11.1413 (1995).
Cotton, L. A. et al. HLA class I sequence-based typing using DNA recovered from frozen plasma. J Immunol Methods 382, 40–47, https://doi.org/10.1016/j.jim.2012.05.003 (2012).
Mack, S. J. et al. Common and well-documented HLA alleles: 2012 update to the CWD catalogue. Tissue antigens 81, 194–203, https://doi.org/10.1111/tan.12093 (2013).
Soto-Nava, M. et al. Weaker HLA Footprints on HIV in the Unique and Highly Genetically Admixed Host Population of Mexico. Journal of virology 92, https://doi.org/10.1128/JVI.01128-17 (2018).
Kumasaka, N., Nakamura, Y. & Kamatani, N. The textile plot: a new linkage disequilibrium display of multiple-single nucleotide polymorphism genotype data. PLoS One 5, e10207, https://doi.org/10.1371/journal.pone.0010207 (2010).
Carlson, J., Heckerman, D. & G, S. False Discovery Rate Calculator for 2x2 Contingency Tables. (https://mscompbio.codeplex.com/wikipage?title=False%20Discovery%20Rate&referringTitle=Home, 2009).
Mellors, J. W. et al. Prognosis in HIV-1 infection predicted by the quantity of virus in plasma. Science 272, 1167–1170 (1996).
Mellors, J. W. et al. Plasma viral load and CD4+ lymphocytes as prognostic markers of HIV-1 infection. Ann Intern Med 126, 946–954 (1997).
Fahey, J. L. et al. The prognostic value of cellular and serologic markers in infection with human immunodeficiency virus type 1. N Engl J Med 322, 166–172, https://doi.org/10.1056/NEJM199001183220305 (1990).
Garrido-Rodriguez, D. et al. Killer cell immunoglobulin-like receptor and human leukocyte antigen gene profiles in a cohort of HIV-infected Mexican Mestizos. Immunogenetics 68, 703–717, https://doi.org/10.1007/s00251-016-0920-3 (2016).
Storey, J. D. A direct approach to false discovery rates. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 64, 479–498 (2002).
R., Core Team. R: A Language and Enviroment for Statistical Computing. R foundation for Statistical Computing. (http://www.r-project.org/) v. 3.3.3 (Vienna, Austria, 2013).
De Boni, R., Veloso, V. G. & Grinsztejn, B. Epidemiology of HIV in Latin America and the Caribbean. Curr Opin HIV AIDS 9, 192–198, https://doi.org/10.1097/COH.0000000000000031 (2014).
Gragert, L., Madbouly, A., Freeman, J. & Maiers, M. Six-locus high resolution HLA haplotype frequencies derived from mixed-resolution DNA typing for the entire US donor registry. Human immunology 74, 1313–1320, https://doi.org/10.1016/j.humimm.2013.06.025 (2013).
Avila-Rios, S. et al. Unique features of HLA-mediated HIV evolution in a Mexican cohort: a comparative study. Retrovirology 6, 72, https://doi.org/10.1186/1742-4690-6-72 (2009).
Dorak, M. T. et al. Influence of human leukocyte antigen-B22 alleles on the course of human immunodeficiency virus type 1 infection in 3 cohorts of white men. The Journal of infectious diseases 188, 856–863, https://doi.org/10.1086/378071 (2003).
Hendel, H. et al. New class I and II HLA alleles strongly associated with opposite patterns of progression to AIDS. J Immunol 162, 6942–6946 (1999).
McLaren, P. J. et al. Polymorphisms of large effect explain the majority of the host genetic contribution to variation of HIV-1 virus load. Proceedings of the National Academy of Sciences of the United States of America 112, 14658–14663, https://doi.org/10.1073/pnas.1514867112 (2015).
Kloverpris, H. N. et al. HIV control through a single nucleotide on the HLA-B locus. Journal of virology 86, 11493–11500, https://doi.org/10.1128/JVI.01020-12 (2012).
Archbold, J. K. et al. Natural micropolymorphism in human leukocyte antigens provides a basis for genetic control of antigen recognition. The Journal of experimental medicine 206, 209–219, https://doi.org/10.1084/jem.20082136 (2009).
Bade-Doding, C. et al. The impact of human leukocyte antigen (HLA) micropolymorphism on ligand specificity within the HLA-B*41 allotypic family. Haematologica 96, 110–118, https://doi.org/10.3324/haematol.2010.030924 (2011).
Burrows, J. M. et al. The impact of HLA-B micropolymorphism outside primary peptide anchor pockets on the CTL response to CMV. Eur J Immunol 37, 946–953, https://doi.org/10.1002/eji.200636588 (2007).
Green, K. J. et al. Potent T cell response to a class I-binding 13-mer viral epitope and the influence of HLA micropolymorphism in controlling epitope length. Eur J Immunol 34, 2510–2519, https://doi.org/10.1002/eji.200425193 (2004).
Huang, K. H. et al. Progression to AIDS in South Africa is associated with both reverting and compensatory viral mutations. PLoS One 6, e19018, https://doi.org/10.1371/journal.pone.0019018 (2011).
Kiepiela, P. et al. CD8+ T-cell responses to different HIV proteins have discordant associations with viral load. Nature medicine 13, 46–53, https://doi.org/10.1038/nm1520 (2007).
Macdonald, W. A. et al. A naturally selected dimorphism within the HLA-B44 supertype alters class I structure, peptide repertoire, and T cell recognition. The Journal of experimental medicine 198, 679–691, https://doi.org/10.1084/jem.20030066 (2003).
Zernich, D. et al. Natural HLA class I polymorphism controls the pathway of antigen presentation and susceptibility to viral evasion. The Journal of experimental medicine 200, 13–24, https://doi.org/10.1084/jem.20031680 (2004).
Edwards, B. H. et al. Magnitude of functional CD8+ T-cell responses to the gag protein of human immunodeficiency virus type 1 correlates inversely with viral load in plasma. Journal of virology 76, 2298–2305 (2002).
Klein, M. R. et al. Kinetics of Gag-specific cytotoxic T lymphocyte responses during the clinical course of HIV-1 infection: a longitudinal analysis of rapid progressors and long-term asymptomatics. The Journal of experimental medicine 181, 1365–1372 (1995).
Riviere, Y. et al. Gag-specific cytotoxic responses to HIV type 1 are associated with a decreased risk of progression to AIDS-related complex or AIDS. AIDS Res Hum Retroviruses 11, 903–907 (1995).
Zuniga, R. et al. Relative dominance of Gag p24-specific cytotoxic T lymphocytes is associated with human immunodeficiency virus control. Journal of virology 80, 3122–3125, https://doi.org/10.1128/JVI.80.6.3122-3125.2006 (2006).
Kimura, A., Kitamura, H., Date, Y. & Numano, F. Comprehensive analysis of HLA genes in Takayasu arteritis in Japan. Int J Cardiol 54, Suppl, S61–69 (1996).
Kimura, A. et al. Mapping of the HLA-linked genes controlling the susceptibility to Takayasu’s arteritis. Int J Cardiol 75 Suppl 1, S105–110; discussionS111–102 (2000).
Kitamura, H., Kobayashi, Y., Kimura, A. & Numano, F. Association of clinical manifestations with HLA-B alleles in Takayasu arteritis. Int J Cardiol 66(Suppl 1), S121–126 (1998).
Renauer, P. & Sawalha, A. H. The genetics of Takayasu arteritis. Presse Med 46, e179–e187, https://doi.org/10.1016/j.lpm.2016.11.031 (2017).
Vargas-Alarcon, G. et al. Comparative study of the residues 63 and 67 on the HLA-B molecule in patients with Takayasu’s Arteritis. Immunol Lett 96, 225–229, https://doi.org/10.1016/j.imlet.2004.08.009 (2005).
Yoshida, M., Kimura, A., Katsuragi, K., Numano, F. & Sasazuki, T. DNA typing of HLA-B gene in Takayasu’s arteritis. Tissue antigens 42, 87–90 (1993).
Yamaguchi, A. et al. Association of HLA-B39 with HLA-B27-negative ankylosing spondylitis and pauciarticular juvenile rheumatoid arthritis in Japanese patients. Evidence for a role of the peptide-anchoring B pocket. Arthritis Rheum 38, 1672–1677 (1995).
Cortes, A. et al. Major histocompatibility complex associations of ankylosing spondylitis are complex and involve further epistasis with ERAP1. Nat Commun 6, 7146, https://doi.org/10.1038/ncomms8146 (2015).
Chen, H. et al. Psoriasis patients are enriched for genetic variants that protect against HIV-1 disease. PLoS Genet 8, e1002514, https://doi.org/10.1371/journal.pgen.1002514 (2012).
Matzaraki, V., Kumar, V., Wijmenga, C. & Zhernakova, A. The MHC locus and genetic susceptibility to autoimmune and infectious diseases. Genome Biol 18, 76, https://doi.org/10.1186/s13059-017-1207-1 (2017).
Juarez-Molina, C. I. et al. Impact of HLA-B*35 subtype differences on HIV disease outcome in Mexico. AIDS 28, 1687–1690, https://doi.org/10.1097/QAD.0000000000000322 (2014).
Dawkins, R. et al. Genomics of the major histocompatibility complex: haplotypes, duplication, retroviruses and disease. Immunol Rev 167, 275–304 (1999).
Yunis, E. J. et al. Inheritable variable sizes of DNA stretches in the human MHC: conserved extended haplotypes and their fragments or blocks. Tissue antigens 62, 1–20 (2003).
Romero, V. et al. Genetic fixity in the human major histocompatibility complex and block size diversity in the class I region including HLA-E. BMC Genet 8, 14, https://doi.org/10.1186/1471-2156-8-14 (2007).
Kawashima, Y. et al. Adaptation of HIV-1 to human leukocyte antigen class I. Nature 458, 641–645, https://doi.org/10.1038/nature07746 (2009).
Novitsky, V. et al. Identification of human immunodeficiency virus type 1 subtype C Gag-, Tat-, Rev-, and Nef-specific elispot-based cytotoxic T-lymphocyte responses for AIDS vaccine design. Journal of virology 75, 9210–9228, https://doi.org/10.1128/JVI.75.19.9210-9228.2001 (2001).
Acknowledgements
This work was funded by the Mexican Government (Comisión de Equidad y Género de las Legislaturas LX-LXI, Comisión de Igualdad de Género de la Legislatura LXII de la H. Cámara de Diputados) and Consejo Nacional de Ciencia y Tecnología (CONACyT, SALUD-2013–01–202475). This work was also supported in part by a grant from the Canadian Institutes for Health Research (CIHR) to ZLB, SAR, GRT and MAB (PJT-148621). HVP is a doctoral student from Programa de Doctorado en Ciencias Biomédicas, Universidad Nacional Autónoma de México (UNAM), and received a scholarship (registry 245172) from the Consejo Nacional de Ciencia y Tecnología (CONACYT). MAB is a Tier II Canada Research Chair in Viral Pathogenesis and Immunity. ZLB is supported by a Scholar Award from the Michael Smith Foundation for Health Research. CJB is supported by a Postgraduate Fellowship from CIHR (MFE-146750). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. We would like to thank CIENI’s Virological Diagnostic Laboratory staff (Carolina Demenegui, Mario Preciado), administrative staff (Zeidy Arenas, Guadalupe Hernández, María de Jesús Espinosa, Berenice Cancino, and Eduardo López), Molecular Biology Laboratory staff, and informatics staff (Pablo César, Héctor Esteban Paz). We would like to give a special thank you to all individuals who took part in this study.
Author information
Authors and Affiliations
Consortia
Contributions
Conceived and designed the experiments: H.V.P., S.A.R., C.J.B., Z.L.B., G.R.T. Recruited participant subjects in the Mexican states and Central American countries, applied demographic and clinical questionnaires, and managed the shipment of blood samples: C.G.M., D.T.T., A.M.O., C.M.V., I.Y.E.U., R.P.M., J.M.P., Y.Z., G.P.C., C.Q.D., I.L., R.I.M., E.Y.P., M.M., R.A.C. HOMER reference cohort: P.R.H., Z.L.B., M.A.B., C.J.B., HLA typing: H.V.P., S.A.H., D.G.R., M.S.N., T.G.T., T.E.G., V.S.Q.M., S.A.S., C.A. Plasma H.I.V. viral load and flow cytometry counts: E.R.A., R.H.J. Managed cohort clinical, demographic and HLA data: C.G.M., D.T.T., H.V.P. Analyzed the data: H.V.P., S.A.H., F.J.P.G., C.J.B. Prepared all figures: H.V.P., M.S.N. Wrote the paper: H.V.P., S.A.R., C.J.B., Z.L.B., G.R.T. All authors contributed to the critical revision of the manuscript and saw and approved the final version.
Corresponding authors
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
A comprehensive list of consortium members appears at the end of the paper
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Valenzuela-Ponce, H., Alva-Hernández, S., Garrido-Rodríguez, D. et al. Novel HLA class I associations with HIV-1 control in a unique genetically admixed population. Sci Rep 8, 6111 (2018). https://doi.org/10.1038/s41598-018-23849-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-018-23849-7
- Springer Nature Limited
This article is cited by
-
The immunogenetics of COVID-19
Immunogenetics (2023)