RefSeq Refinements of UniGene-Based Gene Matching Improve the Correlation of Expression Measurements Between Two Microarray Platforms

Matching genes across microarray platforms is a critical step in meta-analysis. Standard practice uses UniGene to match genes. Numerous studies have found poor correlations between platforms when using UniGene matching.

We profiled samples from 33 breast cancer patients on two different microarray platforms (Affymetrix and cDNA) and investigated gene matching. Our results confirmed that UniGene-based matching led to poor correlations of gene expression between platforms. Using RefSeq, a database maintained by the National Center for Biotechnology Information (NCBI), we developed and implemented a new method to refine gene matching. We found that the correlations between gene expression measurements were substantially higher after the RefSeq matching. Our approach differs from previously reported sequence-matching approaches and retains useful expression measurements. It is a sensible approach for matching probes across platforms.

We conclude that UniGene alone is insufficient to match genes across platforms. Refined matching based on RefSeq significantly improves the quality of matches.

Fig. 1
Fig. 2
Table I
Table II
Table III
Fig. 3
Table IV
Fig. 4


We would like to acknowledge Stephen Tirrell, James Stec, Mark Ayers and Jeffrey S Ross from Millennium Pharmaceuticals (Cambridge, MA, USA) for performing the microarray hybridisation. The Millennium Pharmaceuticals also provided research funding to Dr Pusztai to conduct the clinical trial.

This research was in part supported by the University of Texas SPORE in Lung Cancer grant CA070907 and Prostate Cancer grant CA90270.

The authors have no conflicts of interest that are directly relevant to the content of this article.

