Abstract
A survey is given of tasks related to the construction and evaluation of classifiers applied to a renal cell cancer data set. Balanced sample splitting, non-specific filtering, linear discriminant analysis, nearest-neighbor prediction, and support vector machines are all concretely illustrated using the MLInterfaces package. Evaluations based on single and multiple random splits of data are compared. The entire presentation is given in a very generic programming format, to facilitate the adaptation and variation, by other investigators, of the techniques used here.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this chapter
Cite this chapter
Dettling, M. (2005). Classification with Gene Expression Data. In: Gentleman, R., Carey, V.J., Huber, W., Irizarry, R.A., Dudoit, S. (eds) Bioinformatics and Computational Biology Solutions Using R and Bioconductor. Statistics for Biology and Health. Springer, New York, NY. https://doi.org/10.1007/0-387-29362-0_24
Download citation
DOI: https://doi.org/10.1007/0-387-29362-0_24
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-25146-2
Online ISBN: 978-0-387-29362-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)