Abstract:
We experiment with two types of clustering, K-medians and a dimensionreduction technique known as approximate distance clustering (ADC) [Cowen and Priebe 1997], for classifying lung adenocarcinomas into high-risk and low-risk groups according to gene expression values from microarray data. The microarrays were Affymetrix oligonucleotide arrays used in studies at Michigan and Harvard, with 12,600 and 7129 probesets respectively. We show that we can obtain accurate classification based on a reduced set of genes obtained by nearest shrunken mean (NSM) [Tibshirani et al. 2002] or a combination of a variance-based approach with hierarchical clustering. The quality of the clustering is measured by using the p-values from log-rank tests, and the results are confirmed using cross-validation and by using the reduced set of genes obtained from one dataset to cluster the other.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
5. REFERENCES
Beer, DG, Kardia SL, Huang CC, Giordano TJ, Levin AM, Misek DE, Lin L, Chen G, Gharib TG, Thomas DG, Lizyness ML, Kuick R, Hayasaka S, Taylor JM, Iannettoni MD, Orringer MB, Hanash S. Gene-expression profiles predict survival of patients with lung adenocarcinoma, 2002, Nature Medicine 18(8):816–824.
Bhattacharjee, A., Richards, WG, Staunton J, Li C, Monti S, Vasa P, Ladd C, Beheshti J, Bueno R, Gillette M, Loda M, Weber G, Mark EJ, Lander ES, Wong W, Johnson BE, Golub TR, Sugarbaker DJ, and Meyerson M. 2001, Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses, PNAS 98(24):13790–13795.
Cowen, L. J. and Priebe, C.E., 1997, Randomized non-linear projections uncover highdimensional structure. Adv. Appl. Math., 19:319–331.
Tibshirani, R., Hastie, T., Narasimhan, B., and Chu, G., 2002, Diagnosis of multiple cancer types by shrunken centroids of gene expression., PNAS 99(10):6567–657.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science + Business Media, Inc. Boston
About this chapter
Cite this chapter
Zhou, W. et al. (2005). Microarray Data Analysis of Survival Times of Patients with Lung Adenocarcinomas Using ADC and K-Medians Clustering. In: Shoemaker, J.S., Lin, S.M. (eds) Methods of Microarray Data Analysis. Springer, Boston, MA. https://doi.org/10.1007/0-387-23077-7_14
Download citation
DOI: https://doi.org/10.1007/0-387-23077-7_14
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-23074-0
Online ISBN: 978-0-387-23077-1
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)