Cluster Analysis of Genomic Data

Pollard, K. S.; van der Laan, M. J.

doi:10.1007/0-387-29362-0_13

K. S. Pollard &
M. J. van der Laan

Part of the book series: Statistics for Biology and Health ((SBH))

9264 Accesses
12 Citations

Abstract

We provide an overview of existing partitioning and hierarchical clustering algorithms in R. We discuss statistical issues and methods in choosing the number of clusters, the choice of clustering algorithm, and the choice of dissimilarity matrix. We also show how to visualize a clustering result by plotting ordered dissimilarity matrices in R. A new R package hopach, which implements the Hierarchical Ordered Partitioning And Collapsing Hybrid (HOPACH) algorithm, is presented (van der Laan and Pollard, 2003). The methodology is applied to a renal cell cancer gene expression data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Authors

K. S. Pollard
View author publications
You can also search for this author in PubMed Google Scholar
M. J. van der Laan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Program in Computational Biology Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N, M2-B876, PO Box 19024, Seattle, Washington, 98109-1024, USA
Robert Gentleman
Channing Laboratory Brigham and Women’s Hospital, Harvard Medical School, 181 Longwood Ave, Boston, MA, 02115, USA
Vincent J. Carey
European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, CB10 1SD, UK
Wolfgang Huber
Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, 615 North Wolfe Street, Baltimore, MD, 21205, USA
Rafael A. Irizarry
Division of Biostatistics School of Public Health, University of California Berkeley, 140 Earl Warren Hall, #7360, Berkeley, CA, 94720-7360, USA
Sandrine Dudoit

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Pollard, K.S., van der Laan, M.J. (2005). Cluster Analysis of Genomic Data. In: Gentleman, R., Carey, V.J., Huber, W., Irizarry, R.A., Dudoit, S. (eds) Bioinformatics and Computational Biology Solutions Using R and Bioconductor. Statistics for Biology and Health. Springer, New York, NY. https://doi.org/10.1007/0-387-29362-0_13

Download citation

DOI: https://doi.org/10.1007/0-387-29362-0_13
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-25146-2
Online ISBN: 978-0-387-29362-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics