HLA Typing pp 235-247 | Cite as

Accurate Assembly and Typing of HLA using a Graph-Guided Assembler Kourami

  • Heewook LeeEmail author
  • Carl Kingsford
Part of the Methods in Molecular Biology book series (MIMB, volume 1802)


Accurate typing of human leukocyte antigen (HLA) is essential for successful organ transplantation and HLA genes are heavily associated with various diseases. Widely used typing assays often involve a set of specially designed primers or probes requiring additional experiments. With the maturing of high-throughput sequencing (HTS) technologies, whole genome sequencing (WGS) as well as other HTS assays are becoming more accessible even in the clinical settings. We describe various computational methods capable of directly typing HLA genes using HTS data including Kourami, our HLA assembler. Kourami is the first HLA assembler capable of discovering novel alleles. Kourami assembles full-length sequences across the peptide-binding regions of HLA genes. Here, we focus on how a user would use Kourami on a new sample. We demonstrate the application by typing HLA alleles from a recently published WGS data with validated HLA types using Kourami.


Whole genome sequencing WGS HLA Assembly High-throughput Bioinformatics in silico 



This research was funded in part by the Gordon and Betty Moore Foundation’s Data-Driven Discovery Initiative through Grant GBMF4554 to C.K., by the US National Science Foundation (CCF-1256087, CCF-1319998) and by the US National Institute of Health (R01HG007104, R01GM122935).


  1. 1.
    Boegel S, Löwer M, Schäfer M et al (2012) HLA typing from RNA-Seq sequence reads. Genome Med 4(12):102CrossRefPubMedPubMedCentralGoogle Scholar
  2. 2.
    Robinson J, Halliwell JA, Hayhurst JD et al (2015) The IPD and IMGT/HLA database: allele variant databases. Nucleic Acids Res 43(Database issue):D423–D431. Scholar
  3. 3.
    Wheeler DA, Srinivasan M, Egholm M et al (2008) The complete genome of an individual by massively parallel DNA sequencing. Nature 452(7189):872–876CrossRefPubMedGoogle Scholar
  4. 4.
    Telenti A, Pierce LCT, Biggs WH et al (2016) Deep sequencing of 10,000 human genomes. Proc Natl Acad Sci U S A 113(42):11901–11906CrossRefPubMedPubMedCentralGoogle Scholar
  5. 5.
    Nagasaki M, Yasuda J, Katsuoka F et al (2015) Rare variant discovery by deep whole-genome sequencing of 1,070 Japanese individuals. Nat Commun 6:8018CrossRefPubMedPubMedCentralGoogle Scholar
  6. 6.
    Gudbjartsson DF, Helgason H, Gudjonsson SA et al (2015) Large-scale whole-genome sequencing of the Icelandic population. Nat Genet 47(5):435–444CrossRefPubMedGoogle Scholar
  7. 7.
    National Heart, Lung and Blood Institute (2017) Trans-Omics for Precision Medicine (TOPMed) Program. Accessed 29 Nov 2017
  8. 8.
    Lee H, Kingsford C (2018) Kourami: graph-guided assembly for novel human leukocyte antigen allele discovery. Genome Biology 19:16Google Scholar
  9. 9.
    Li H, Durbin R (2009) Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics 25(14):1754–1760CrossRefPubMedPubMedCentralGoogle Scholar
  10. 10.
    Li H, Handsaker B, Wysoker A et al (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25(16):2078–2079CrossRefPubMedPubMedCentralGoogle Scholar
  11. 11.
    McKenna A, Hanna M, Banks E et al (2010) The genome analysis toolkit: a mapreduce framework for analyzing next-generation DNA sequencing data. Genome Res 20(9):1297–1303CrossRefPubMedPubMedCentralGoogle Scholar
  12. 12.
    Consortium T1GP (2015) A global reference for human genetic variation. Nature 526(7571):68–74CrossRefGoogle Scholar
  13. 13.
    Seo J-S, Rhie A, Kim J et al (2016) De novo assembly and phasing of a Korean human genome. Nature 538(7624):243–247CrossRefPubMedGoogle Scholar
  14. 14.
    Zheng-Bradley X, Streeter I, Fairley S et al (2017) Alignment of 1000 genomes project reads to reference assembly GRCh38. GigaScience 6(7):1–8CrossRefPubMedPubMedCentralGoogle Scholar
  15. 15.
    Meienberg J, Bruggmann R, Oexle K et al (2016) Clinical sequencing: is WGS the better WES? Hum Genet 135(3):359–362CrossRefPubMedPubMedCentralGoogle Scholar
  16. 16.
    Asan, Xu Y, Jiang H et al (2011) Comprehensive comparison of three commercial human whole-exome capture platforms. Genome Biol 12(9):R95CrossRefPubMedPubMedCentralGoogle Scholar
  17. 17.
    1000 Genomes This README explains the alignment pipeline used to remap all the 1000 Genomes Project Phase 3 reads to GRCh38DH. Accessed 29
  18. 18.
    Warren RL, Choe G, Freeman DJ et al (2012) Derivation of HLA types from shotgun sequence datasets. Genome Med 4(12):95CrossRefPubMedPubMedCentralGoogle Scholar
  19. 19.
    Kim HJ, Pourmand N (2013) HLA haplotyping from RNA-seq data using hierarchical read weighting. PLoS One 8(6):e67885CrossRefPubMedPubMedCentralGoogle Scholar
  20. 20.
    Bai Y, Ni M, Cooper B et al (2014) Inference of high resolution HLA types using genome-wide RNA or DNA sequencing reads. BMC Genomics 15:325CrossRefPubMedPubMedCentralGoogle Scholar
  21. 21.
    Huang Y, Yang J, Ying D et al (2015) HLAreporter: a tool for HLA typing from next generation sequencing data. Genome Med 7(1):25CrossRefPubMedPubMedCentralGoogle Scholar
  22. 22.
    Nariai N, Kojima K, Saito S et al (2015) HLA-VBSeq: accurate HLA typing at full resolution from whole-genome sequencing data. BMC Genomics 16(Suppl 2):S7CrossRefPubMedPubMedCentralGoogle Scholar
  23. 23.
    Dilthey AT, Gourraud P-A, Mentzer AJ et al (2016) High-accuracy HLA type inference from whole-genome sequencing data using population reference graphs. PLoS Comput Biol 12(10):e1005151CrossRefPubMedPubMedCentralGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Computational Biology Department, School of Computer ScienceCarnegie Mellon UniversityPittsburghUSA

Personalised recommendations