Abstract
Graves’ disease, the production of thyroid-stimulating hormone receptor-stimulating antibodies leading to hyperthyroidism, is one of the most common forms of human autoimmune disease. It is widely agreed that complex diseases are not controlled simply by an individual gene or DNA variation but by their combination. Single nucleotide polymorphisms (SNPs), which are the most common form of DNA variation, have great potential as a medical diagnostic tool. In this paper, the P-value is used as a SNP pre-selection criterion, and a wrapper algorithm with binary particle swarm optimization is used to find the rule for discriminating between affected and control subjects. We analyzed the association between combinations of SNPs and Graves’ disease by investigating 108 SNPs in 384 cases and 652 controls. We evaluated our method by differentiating between cases and controls in a five-fold cross validation test, and it achieved a 72.9% prediction accuracy with a combination of 17 SNPs. The experimental results showed that SNPs, even those with a high P-value, have a greater effect on Graves’ disease when acting in a combination.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Hollowell J G, Staehling N W, Flanders W D, et al. Serum TSH, T-4, and thyroid antibodies in the United States population (1988 to 1994): National Health and Nutrition Examination Survey (NHANES III) J Clin Endocrinol Metab, 2002, 87: 489–4
Chen X, Wu W S, Chen G L, et al. The effect of salt iodization for 10 years on the prevalences of endemic goiter and hyperthyroidism. J Chin Endocrinol Metab, 2000, 18: 342–344
Wang P W, Liu R T, Juo S H H, et al. Cytotoxic T lymphocyte-associated molecule-4 polymorphism and relapse of Graves’ hyperthyroidism after antithyroid withdrawal. J Clin Endocrinol Metab, 2004, 89: 169–173, 1:CAS:528:DC%2BD2cXmvFeqsQ%3D%3D, 14715845, 10.1210/jc.2003-030854
Gibbs R A, Belmont J W, Hardenbol P, et al. The International HapMap Project. Nature, 2003, 426: 789–796, 1:CAS:528:DC%2BD3sXpvVGmt7s%3D, 10.1038/nature02168
Wang T H, Wang H S. A genome-wide association study primer for clinicians. Taiwan J Obstet Gynecol, 2009, 48: 89–95, 19574166, 10.1016/S1028-4559(09)60265-5
Xavier R J, Rioux J D. Genome-wide association studies: A new window into immune-mediated diseases. Nature Rev Immunol, 2008, 8: 631–643, 1:CAS:528:DC%2BD1cXovV2gtrY%3D, 10.1038/nri2361
Kraft P, Hunter D J. Genetic risk prediction—are we there yet? N Engl J Med, 2009, 360: 1701–1703, 1:CAS:528:DC%2BD1MXkvVOnu7Y%3D, 19369656, 10.1056/NEJMp0810107
Moffatt M F, Kabesch M, Liang L M, et al. Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma. Nature, 2007, 448: 470–473, 1:CAS:528:DC%2BD2sXotFajurY%3D, 17611496, 10.1038/nature06014
Wan X, Yang C, Yang Q, et al. MegaSNPHunter: A learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study. BMC Bioinformatics, 2009, 10: 13, 19134182, 10.1186/1471-2105-10-13
Hampe J, Schreiber S, Krawczak M. Entropy-based SNP selection for genetic association studies. Hum Genet, 2003, 114: 36–43, 1:CAS:528:DC%2BD3sXovFOisb4%3D, 14505034, 10.1007/s00439-003-1017-2
Carlson C S, Eberle M A, Rieder M J, et al. Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet, 2004, 74: 106–120, 1:CAS:528:DC%2BD2cXlvVOrsA%3D%3D, 14681826, 10.1086/381000
Kusiak A, Shah S. Cancer gene search with data-mining and genetic algorithms. Comput Biol Med, 2007, 37: 251–261, 16616736, 10.1016/j.compbiomed.2006.01.007
Xie Q, Ratnasinghe L D, Hong H, et al. Decision forest analysis of 61 single nucleotide polymorphisms in a case-control study of esophageal cancer; a novel method. BMC Bioinformatics, 2005, 6: S4, 16026601, 10.1186/1471-2105-6-S2-S4
Tomida S, Hanai T, Koma N, et al. Artificial neural network predictive model for allergic disease using single nucleotide polymorphisms data. J Biosci Bioeng, 2002, 93: 470–478, 1:CAS:528:DC%2BD38XlsFSks7w%3D, 16233234
Tomita Y, Tomida S, Hasegawa Y, et al. Artificial neural network approach for selection of susceptible single nucleotide polymorphisms and construction of prediction model on childhood allergic asthma. BMC Bioinformatics, 2004, 5: 120, 15339344, 10.1186/1471-2105-5-120
Dumitru B, Alexander Z. Design and validation of methods searching for risk factor in genotype case-control studies. J Comput Biol, 2008, 15: 81–90, 10.1089/cmb.2007.0081
Ueda H, Howson J M M, Esposito L, et al. Association of the T-cell regulatory gene CTLA4 with susceptibility to autoimmune disease. Nature, 2003, 423: 506–511, 1:CAS:528:DC%2BD3sXktVygtLk%3D, 12724780, 10.1038/nature01621
Kennedy J, Eberhart R. Particle swarm optimization. In: Proceedings of 1995 IEEE International Conference on Neural Networks, Perth, Australia, 1995. 4: 1942–1948
Kennedy J, Eberhart R C. A discrete binary version of the particle swarm algorithm. In: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, Piscataway, New Jersey, 1997. 4104–4108
Salzberg S L. On comparing classifiers: Pitfalls to avoid and a recommended approach. Data Min Knowl Discov, 1997, 1: 317–328, 10.1023/A:1009752403260
Yeh C L, Chang H W, Tu C J, et al. Improved binary PSO for feature selection using gene expression data. Comput Biol Chem, 2008, 32: 29–37, 10.1016/j.compbiolchem.2007.09.005
Li S T, Wu X X, Tan M K. Gene selection using hybrid particle swarm optimization and genetic algorithm. Soft Comput, 2008: 1039–1048
Listgarten J, Damaraju S, Poulin B, et al. Predictive models for breast cancer susceptibility from multiple single nucleotide polymorphisms. Clin Cancer Res, 2004, 10: 2725–2737, 1:CAS:528:DC%2BD2cXjt1CltLg%3D, 15102677, 10.1158/1078-0432.CCR-1115-03
Uhmn S, Kim D H, Ko Y W, et al. A study on application of single nucleotide polymorphism and machine learning techniques to diagnosis of chronic hepatitis. Expert Syst, 2009, 26: 60–69, 10.1111/j.1468-0394.2008.00491.x
Author information
Authors and Affiliations
Corresponding author
Additional information
This article is published with open access at Springerlink.com
Rights and permissions
Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
About this article
Cite this article
Wei, B., Peng, Q., Zhang, Q. et al. Identification of a combination of SNPs associated with Graves’ disease using swarm intelligence. Sci. China Life Sci. 54, 139–145 (2011). https://doi.org/10.1007/s11427-010-4117-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11427-010-4117-y