Skip to main content

Bioinformatics Analysis of Microarray Data

  • Protocol
  • First Online:
Cardiovascular Genomics

Part of the book series: Methods in Molecular Biology™ ((MIMB,volume 573))

Abstract

Gene expression profiling provides unprecedented opportunities to study patterns of gene expression regulation, for example, in diseases or developmental processes. Bioinformatics analysis plays an important part of processing the information embedded in large-scale expression profiling studies and for laying the foundation for biological interpretation.

Over the past years, numerous tools have emerged for microarray data analysis. One of the most popular platforms is Bioconductor, an open source and open development software project for the analysis and comprehension of genomic data, based on the R programming language.

In this chapter, we use Bioconductor analysis packages on a heart development dataset to demonstrate the workflow of microarray data analysis from annotation, normalization, expression index calculation, and diagnostic plots to pathway analysis, leading to a meaningful visualization and interpretation of the data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Reimers, M, Carey, VJ. (2006). Bioconductor: an open source framework for bioinformatics and computational biology. Method Enzymol 411, 119–134.

    Article  CAS  Google Scholar 

  2. Team, RDC. (2007). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.

    Google Scholar 

  3. Dai, M, Wang, P, Boyd, AD, et al. (2005). Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res 33, e175.

    Article  PubMed  Google Scholar 

  4. Liu, H, Zeeberg, BR, Qu, G, et al. (2007). AffyProbeMiner: a web resource for computing or retrieving accurately redefined Affymetrix probe sets. Bioinformatics 23, 2385–2390.

    Article  PubMed  CAS  Google Scholar 

  5. Hubbell, E, Liu, WM, Mei, R. Guide to Probe Logarithmic Intensity Error (PLIER) Estimation. http://www.affymetrix.com/support/technical/technotes/plier_technote.pdf

  6. Choe, SE, Boutros, M, Michelson, AM. (2005). Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset. Genome Biol 6, R16.

    Article  PubMed  Google Scholar 

  7. Seo, J, Hoffman, EP. (2006). Probe set algorithms: is there a rational best bet? BMC Bioinformatics 7, 395.

    Article  PubMed  Google Scholar 

  8. McClintick, JN, Edenberg, HJ. (2006). Effects of filtering by Present call on analysis of microarray experiments. BMC Bioinformatics 7, 49.

    Article  PubMed  Google Scholar 

  9. Pepper, SD, Saunders, EK, Edwards, LE, et al. (2007). The utility of MAS5 expression summary and detection call algorithms. BMC Bioinformatics 8, 273.

    Article  PubMed  Google Scholar 

  10. Benjamini, Y, Hochberg, Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series 57, 289–300.

    Google Scholar 

  11. Dennis, G, Jr., Sherman, BT, Hosack, J, et al. (2003). DAVID: database for Annotation, Visualization, and Integrated Discovery. Genome Biol 4, P3.

    Article  Google Scholar 

  12. Tian, L, Greenberg, SA, Kong, SW, et al. (2005). Discovering statistically significant pathways in expression profiling studies. Proc Natl Acad Sci USA 102, 13544–13549.

    Article  PubMed  CAS  Google Scholar 

  13. Nam, D, Kim, SY. (2008). Gene-set approach for expression pattern analysis. Brief Bioinform 9, 189–197.

    Article  PubMed  Google Scholar 

  14. Raghavan, N, De Bondt, AM, Talloen, W, et al. (2007). The high-level similarity of some disparate gene expression measures. Bioinformatics 23, 3032–3038.

    Article  PubMed  CAS  Google Scholar 

  15. Mootha, VK, Handschin, C, Arlow, D, et al. (2004). Erralpha and Gabpa/b specify PGC-1alpha-dependent oxidative phosphorylation gene expression that is altered in diabetic muscle. Proc Natl Acad Sci USA 101, 6570–6575.

    Article  PubMed  CAS  Google Scholar 

  16. Baitaluk, M, Qian, X, Godbole, S, et al. (2006). PathSys: integrating molecular interaction graphs for systems biology. BMC Bioinformatics 7, 55.

    Article  PubMed  Google Scholar 

  17. Draghici, S, Khatri, P, Tarca, AL, et al. (2007). A systems biology approach for pathway level analysis. Genome Res 17, 1537–1545.

    Article  PubMed  CAS  Google Scholar 

  18. Irizarry, RA, Bolstad, BM, Collin, F, et al. (2003). Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res 31, e15.

    Article  PubMed  Google Scholar 

  19. Wu, Z, Irizarry, RA. (2005). Stochastic models inspired by hybridization theory for short oligonucleotide arrays. J Comput Biol 12, 882–893.

    Article  PubMed  CAS  Google Scholar 

  20. van der Laan, M, Dudoit, S, Pollard, K. (2003). Hybrid clustering of gene expression data with visualization and bootstrap. J Stat Plan Inference 117,275–303.

    Article  Google Scholar 

  21. Reich, M, Liefeld, T, Gould, J, et al. (2006). GenePattern 2.0. Nat Genet 38, 500–501.

    Article  PubMed  CAS  Google Scholar 

  22. Saeed, AI, Sharov, V, White, J, et al. (2003). TM4: a free, open-source system for microarray data management and analysis. Biotechniques 34,374–378.

    PubMed  CAS  Google Scholar 

  23. Li, C, Wong, WH. (2001). Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application. Genome Biol 2(8), 0032.1–0032.11.

    Google Scholar 

  24. Li, C, Wong, WH. (2001). Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. Proc Natl Acad Sci USA 98, 31–36.

    Article  PubMed  CAS  Google Scholar 

  25. Tusher, VG, Tibshirani, R, Chu, G. (2001). Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA 98, 5116–5121.

    Article  PubMed  CAS  Google Scholar 

  26. Manduchi, E, Grant, GR, McKenzie, SE, et al. (2000). Generation of patterns from gene expression data by assigning confidence to differentially expressed genes. Bioinformatics 16, 685–698.

    Article  PubMed  CAS  Google Scholar 

  27. Mootha, VK, Lindgren, CM, Eriksson, KF, et al. (2003). PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet 34, 267–273.

    Article  PubMed  CAS  Google Scholar 

  28. Subramanian, A, Kuehn, H, Gould, J, et al. (2007). GSEA-P: a desktop application for Gene Set Enrichment Analysis. Bioinformatics 23, 3251–3253.

    Article  PubMed  CAS  Google Scholar 

  29. Backes, C, Keller, A, Kuentzer, J, et al. (2007). GeneTrail–advanced gene set enrichment analysis. Nucleic Acids Res 35, W186–192.

    Article  Google Scholar 

  30. Dahlquist, KD, Salomonis, N, Vranizan, K, et al. (2002). GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways. Nat Genet 31, 19–20.

    Article  PubMed  CAS  Google Scholar 

  31. Gautier, L, Cope, L, Bolstad, BM, et al. (2004). Affy–analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20, 307–315.

    Article  PubMed  CAS  Google Scholar 

  32. Smyth, GK (2004). Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol 3(1), Article 3.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Humana Press, a part of Springer Science+Business Media, LLC

About this protocol

Cite this protocol

Zhang, Y., Szustakowski, J., Schinke, M. (2009). Bioinformatics Analysis of Microarray Data. In: DiPetrillo, K. (eds) Cardiovascular Genomics. Methods in Molecular Biology™, vol 573. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-60761-247-6_15

Download citation

  • DOI: https://doi.org/10.1007/978-1-60761-247-6_15

  • Published:

  • Publisher Name: Humana Press, Totowa, NJ

  • Print ISBN: 978-1-60761-246-9

  • Online ISBN: 978-1-60761-247-6

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics