Abstract
Medium-resolution genotyping has the goal of distinguishing different subgroups instead of each element in a group. An oligonucleotide array provides an inexpensive, high-throughput method to identify differences in DNA sequence among individuals, which is fundamental for genotyping. As the cost and difficulty of designing and fabricating the oligonucleotide array dramatically increase with the number of probes used, it is therefore important to have a design with a minimum number of probes meeting the requirement of medium-resolution genotyping. The first algorithm for designing and selecting probes for oligonucleotide array-based medium-resolution typing is reported. The goal in deriving the algorithm was to select a minimum number of probes from a large probe set on the premise of minimum loss of resolution. The algorithm, which was based on entropy, conditional entropy and mutual information theory, was used to select the minimum number of probes from a large probe set. The algorithm was tested on a human leukocyte antigen (HLA) sequence data set. Thirty probes were selected from 390 probes for HLA-A, and 60 probes were selected from 767 probes for HLA-B. Although the number of probes was reduced by almost ten times, the distinguishability was reduced only a little, by 0.45% (from 99.90% to 99.45%) for HLA-A and 0.27% (from 99.84% to 99.57%) for HLA-B, respectively. This is a satisfactory and practical result.
Similar content being viewed by others
References
Balmain, A. (2001): ‘Cancer genetics: from Boveri and Mendel to microarrays’,Nat. Rev. Cancer,1, pp. 77–82
Borneman, J., Chrobak, M., Della, V. G., Figueroa, A., andJiang, T. (2001): ‘Probe selection algorithms with applications in the analysis of microbial communities’,Bioinformatics,17, pp. S39-S48
Gerlach, J. A. (2002): ‘Human lymphocyte antigen molecular typing: how to identify the 1250+ alleles out there’,Arch. Pathol. Lab Med.,126, pp. 281–284
Gulledge, J., Ahmad, A., Steudler, P. A., Pomerantz, W. J., andCavanaugh, C. M. (2001): ‘Family- and genus-level 16S rRNA-targeted oligonucleotide probes for ecological studies of methanotrophic bacteria’,Appl. Environ. Microbiol.,67, pp. 4726–4733
Hacia, J. G. (1999): ‘Resequencing and mutational analysis using oligonucleotide microarrays’,Nat. Genet.,21, pp. 42–47
Herwig, R., Schmitt, A. O., Steinfath, M., O'Brien, J., Seidel, H., Meier-Ewert, S., Lehrach, H., andRadelof, U. (2000): ‘Information theoretical probe selection for hybridisation experiments’,Bioinformatics,16, pp. 890–898
‘International Histocompatability Working Group’: http//www.ihwg.org
Li, F., andStormo, G. D. (2001): ‘Selection of optimal DNA oligos for gene expression arrays’,Bioinformatics,17, pp. 1067–1076
Liu, W. T., Mirzabekov, A. D., andStahl, D. A. (2001): ‘Optimization of an oligonucleotide microchip for microbial identification studies: a non-equilibrium dissociation approach’,Environ. Microbiol.,3, pp. 619–629
Mrowka, R., Schuchhardt, J., andGille, C. (2002): ‘Oligodbinteractive design of oligo DNA for transcription profiling of human genes’,Bioinformatics,18, pp. 1686–1687
Shannon, C. E. (1948): ‘A mathematical theory of communication’,Bell Syst. Tech. J.,27, pp. 379–423 and 623–656
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhou, Y., Peng, S., Gao, H. et al. Probe selection algorithm for oligonucleotide array-based medium-resolution genotyping. Med. Biol. Eng. Comput. 42, 812–816 (2004). https://doi.org/10.1007/BF02345215
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF02345215