Abstract
We provide an overview of existing partitioning and hierarchical clustering algorithms in R. We discuss statistical issues and methods in choosing the number of clusters, the choice of clustering algorithm, and the choice of dissimilarity matrix. We also show how to visualize a clustering result by plotting ordered dissimilarity matrices in R. A new R package hopach, which implements the Hierarchical Ordered Partitioning And Collapsing Hybrid (HOPACH) algorithm, is presented (van der Laan and Pollard, 2003). The methodology is applied to a renal cell cancer gene expression data set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this chapter
Cite this chapter
Pollard, K.S., van der Laan, M.J. (2005). Cluster Analysis of Genomic Data. In: Gentleman, R., Carey, V.J., Huber, W., Irizarry, R.A., Dudoit, S. (eds) Bioinformatics and Computational Biology Solutions Using R and Bioconductor. Statistics for Biology and Health. Springer, New York, NY. https://doi.org/10.1007/0-387-29362-0_13
Download citation
DOI: https://doi.org/10.1007/0-387-29362-0_13
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-25146-2
Online ISBN: 978-0-387-29362-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)