Abstract
The purpose of normalization in microarray data analysis is to minimize systematic variations in the measured gene expression levels of two co-hybridized mRNA samples so that biological differences can be more easily distinguished. The most commonly and widely used normalization procedure for spotted arrays is probably the intensity dependent and print-tip LOWESS normalization. It is well known that the choices of different parameter values greatly affect the quality of the normalization results, and thus poor quality of the normalization results could be due to the arbitrary choice of the smoothing parameters for LOWESS normalization. In many normalization studies, however, LOWESS has been simply used without rigorous consideration of the parameters. In this article, we propose a bootstrap method to find the optimal window width in print-tip normalization by applying the cross validation technique. We also compare through simulation studies the normalization results by using the proposed method with those by fixing the window width.
Similar content being viewed by others
References
Berger JA, Hautaniem S, Jarvinen A, Edgren H, Mitra SK, Astola J (2004) Optimized LOWESS normalization parameter selection for DNA microarray data. BMC-Bioinform 5:1–13
Callow MJ, Dudoit S, Gong EL, Speed TP, Rubin EM (2000) Microarray expression profiling identifies genes with altered expression in hdl deficient mice. Genome Res 10:2022–2029
Cleveland WS (1978) Robust locally weighted regression and smoothing scatter plots. J Am Stat Assoc 74:829–836
Cleveland WS, Devlin SJ (1988) Locally-weighted regression: An approach to regression analysis by local fitting. J Am Stat Associ 83:590–610
Cui X, Kerr MK, Churchill GA (2003) Transformations for cDNA Microarray Data. Stat Appl Genet Mole Biol. http://www.bepress.com/sagmb/vol2/iss1/art4/
Dudoit S, Yang YH, Speed TP, Callow MJ (2002) Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Stat Sin 12:111–139
Faller D, Voss HU, Timmer J, Hobohm U (2003) Normalization of DNA-microarray data by nonlinear correlation maximization. J Comput Biol 10:751–762
Faraway J, Jhun M (1990) Bootstrap choice of bandwidth for Density estimation. J Am Stat Assoc 85:1119–1122
Futschik M, Crompton T (2004) Model selection and efficiency testing for normalization of cDNA microarray data. Genome Biol 5:R60–R79
Hastie T (2001) The Elements of Statistical Learning. Springer, NewYork
Huang J, Wang DL, Zhang CH (2005) A two-way semilinear model for normalization and significant analysis of microarray data. J Am Stat Assoc 100:814–829
Huber W, Von Heydebreck A, Sultmann H, Poustka A, Vingron M, (2002) Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 18:1–9
Kepler TB, Crosby L, Morgan KT (2002) Normalization and analysis of DNA microarray data by self-consistency and local regression. Genome Biol 3:1–12
Kim SY, Lee JW, Sohn IS (2006) Comparison of various statistical methods for identifying differentially expressed genes in replicated microarray data. Stat Meth Med Res 15(1):3–20
Lee JW, IMT-2000 Statistics Group (2004) Statistical Software System for Analyzing DNA Chip Expression Data. In: Proceedings of international symposium on bioinformatics for agricultural biotechnology. Seoul. Korea, pp 21–42
Ma S, Kosorok M, Huang J, Xie H, Manzella L, Soares MB (2006) Robust semiparametric microarray normalization and significance analysis. Biometrics 62:555–561
Tusher VG, Tibshirani R, Chu G (2001) Significace analysis of microarrays applied to the ionizing radiation response. In: Proceedings of the National Academy of Sciences 98:5116–5121
Yang YH, Dudoit S, Luu P, Lin DM, Peng V, Ngai J, Speed TP (2002) Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res 30:e15
Yang YH, Dudoit S, Luu P, Speed TP (2001) Normalization for cDNA microarray data. In Bittner ML, Chen Y, Dorsel AN, Dougherty ER (eds), Microarrays: optical technologies and informatics. In: Proceedings of SPIE, vol 4266.
Zien A, Aigner T, Zimmer R, Lengauer T (2001) Centralization: a new method for the normalization of gene expression data. Bioinformatics. 17: S323-S331
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lee, J.W., Jhun, M., Kim, J.Y. et al. An optimal choice of window width for LOWESS normalization of microarray data. OR Spectrum 30, 235–248 (2008). https://doi.org/10.1007/s00291-007-0092-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00291-007-0092-5