A new set-valued system identification approach to identifying rare genetic variants for ordered categorical phenotype

Bi, Wenjian; Kang, Guolian; Cui, Yuehua; Li, Yun; Hartford, Christine M; Leung, Wing; Zhang, Ji-Feng

doi:10.1186/1471-2105-15-S10-P29

A new set-valued system identification approach to identifying rare genetic variants for ordered categorical phenotype

Poster presentation
Open access
Published: 29 September 2014

Volume 15, article number P29, (2014)
Cite this article

Download PDF

You have full access to this open access article

BMC Bioinformatics Aims and scope Submit manuscript

A new set-valued system identification approach to identifying rare genetic variants for ordered categorical phenotype

Download PDF

Wenjian Bi¹,
Guolian Kang²,
Yuehua Cui³,
Yun Li⁴,
Christine M Hartford⁵,
Wing Leung^5,6 &
…
Ji-Feng Zhang¹

991 Accesses
Explore all metrics

Background

For phenotype-genotype association studies that involve a phenotype with ordered multiple response categories, we usually either regroup multiple categories of the phenotype into two categories of “cases” and “controls” and then apply the standard logistic regression (LG) model [1, 2], or apply a non-parametric method of Spearman rank correlation [3] or parametric method of ordered logistic (orderLG) regression model [4] which accounts for the ordinal nature of the phenotype. However, these approaches may lose statistical power if the phenotype is obtained by categorizing an observed or complicated unmeasured or immeasurable continuous phenotype or if the underlying genetic variants are rare.

Materials and methods

Therefore, we propose a set-valued (SV) system method, which assumes that the underlying continuous phenotype follows a normal distribution, to identify genetic variants associated with an ordinal categorical phenotype. We couple this model with a set-valued system identification method to identify all underlying key system parameters.

Results

Simulation studies show that SV well controlled the Type I error rate. In the comparison among LG, SV and orderLG methods, LG had significantly lower power than both SV and orderLG due to the disregard of the ordinal nature of the phenotype, and SV had similar or higher power than orderLG. Additionally, the SV association parameter estimate was 2.7-28.7 fold less variable than the orderLG association parameter estimate. Less variability in the association parameter estimate translates to greater power and robustness across the spectrum of minor allele frequencies. These advantages are most pronounced for rare variants or even common variants when sample size is small. For instance, in a simulation with data generated from an additive orderedLG model with an odds ratio of 7.4 for a phenotype with three categories, a single nucleotide polymorphism with minor allele frequency of 0.75% and sample size of 999 (333 per category), the power of SV, orderLG and LG models were 70%, 40% and <1%, respectively, at a significance level of 10^-6. When applied to a real data set, the set of variants identified by LG and orderLG was a subset of those identified by SV. Thus, SV can be a competitive alternative to LG or orderLG in genetic association studies such as candidate gene, genome-wide association studies or next generation sequencing studies, for ordered categorical phenotype.

References

Treviño LR, Shimasaki N, Yang W, Panetta JC, Cheng C, Pei D, Chan D, Sparreboom A, Giacomini KM, Pui CH, Evans WE, Relling MV: Germline genetic variation in an organic anion transporter polypeptide associated with methotrexate pharmacokinetics and clinical effects. J Clin Oncol. 2009, 27 (35): 5972-5978.
Article PubMed Central PubMed Google Scholar
Ingle JN, Schaid DJ, Goss PE, Liu M, Mushiroda T, Chapman JA, Kubo M, Jenkins GD, Batzler A, Shepherd L, Pater J, Wang L, Ellis MJ, Stearns V, Rohrer DC, Goetz MP, Pritchard KI, Flockhart DA, Nakamura Y, Weinshilboum RM: Genome-wide associations and functional genomic studies of musculoskeletal adverse events in women receiving aromatase inhibitors. J Clin Oncol. 2010, 28 (31): 4674-4682.
Article PubMed Central CAS PubMed Google Scholar
Png E, Thalamuthu A, Ong RT, Snippe H, Boland GJ, Seielstad M: A genome-wide association study of hepatitis B vaccine response in an Indonesian population reveals multiple independent risk variants in the HLA region. Hum Mol Genet. 2011, 20 (19): 3893-3898.
Article CAS PubMed Google Scholar
Yang JJ, Cheng C, Yang W, Pei D, Cao X, Fan Y, Pounds S, Treviño LR, French D, Campana D, Downing JR, Evans WE, Pui C, Devidas M, Bowman WP, Camitta BM, Willman C, Davies SM, Borowitz MJ, Carroll WL, Hunger SP, Relling MV: Genome-wide interrogation of germline genetic variation associated with treatment response in childhood acute lymphoblastic leukemia. JAMA. 2009, 301 (4): 393-403.
Article PubMed Central CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Key Laboratory of Systems and Control, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, PRC
Wenjian Bi & Ji-Feng Zhang
Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN, 38105, USA
Guolian Kang
Department of Statistics and Probability, Michigan State University, East Lansing, MI, 48824, USA
Yuehua Cui
Department of Biostatistics, University of North Carolina, 3101 McGavran-Greenberg Hall, Chapel Hill, NC, 27599, USA
Yun Li
Department of Bone Marrow Transplantation and Cellular Therapy, St. Jude Children’s Research Hospital, Memphis, TN, 38105, USA
Christine M Hartford & Wing Leung
Department of Pediatrics, University of Tennessee Health Science Center, Memphis, TN, 38163, USA
Wing Leung

Authors

Wenjian Bi
View author publications
You can also search for this author in PubMed Google Scholar
Guolian Kang
View author publications
You can also search for this author in PubMed Google Scholar
Yuehua Cui
View author publications
You can also search for this author in PubMed Google Scholar
Yun Li
View author publications
You can also search for this author in PubMed Google Scholar
Christine M Hartford
View author publications
You can also search for this author in PubMed Google Scholar
Wing Leung
View author publications
You can also search for this author in PubMed Google Scholar
Ji-Feng Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guolian Kang.

Additional information

Wenjian Bi, Guolian Kang contributed equally to this work.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Bi, W., Kang, G., Cui, Y. et al. A new set-valued system identification approach to identifying rare genetic variants for ordered categorical phenotype. BMC Bioinformatics 15 (Suppl 10), P29 (2014). https://doi.org/10.1186/1471-2105-15-S10-P29

Download citation

Published: 29 September 2014
DOI: https://doi.org/10.1186/1471-2105-15-S10-P29

A new set-valued system identification approach to identifying rare genetic variants for ordered categorical phenotype

Background

Materials and methods

Results

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A new set-valued system identification approach to identifying rare genetic variants for ordered categorical phenotype

Background

Materials and methods

Results

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation