Skip to main content

Extent of differential allelic expression of candidate breast cancer genes is similar in blood and breast



Normal gene expression variation is thought to play a central role in inter-individual variation and susceptibility to disease. Regulatory polymorphisms in cis-acting elements result in the unequal expression of alleles. Differential allelic expression (DAE) in heterozygote individuals could be used to develop a new approach to discover regulatory breast cancer susceptibility loci. As access to large numbers of fresh breast tissue to perform such studies is difficult, a suitable surrogate test tissue must be identified for future studies.


We measured differential allelic expression of 12 candidate genes possibly related to breast cancer susceptibility (BRCA1, BRCA2, C1qA, CCND3, EMSY, GPX1, GPX4, MLH3, MTHFR, NBS1, TP53 and TRXR2) in breast tissue (n = 40) and fresh blood (n = 170) of healthy individuals and EBV-transformed lymphoblastoid cells (n = 19). Differential allelic expression ratios were determined by Taqman assay. Ratio distributions were compared using t-test and Wilcoxon rank sum test, for mean ratios and variances respectively.


We show that differential allelic expression is common among these 12 candidate genes and is comparable between breast and blood (fresh and transformed lymphoblasts) in a significant proportion of them. We found that eight out of nine genes with DAE in breast and fresh blood were comparable, as were 10 out of 11 genes between breast and transformed lymphoblasts.


Our findings support the use of differential allelic expression in blood as a surrogate for breast tissue in future studies on predisposition to breast cancer.


Approximately 70% of the genetic risk associated with breast cancer is still unaccounted for and it is predicted that the remainder of susceptibility loci will include common, low-effect variants that most likely have regulatory effects. Recent genome-wide association studies (GWAS) have identified variants that account for an additional 5.9% of the genetic risk [15]. These variants are mostly associated with intronic and intergenic regions, with the most significant variant regulating the level of gene expression of FGFR2 [6]. However, as most of the identified risk loci have small effects, very large numbers of patients will have to be examined to identify further risk variants. An alternative approach for the identification of regulatory risk variants could be to use differences in allelic gene expression in heterozygotes as a quantitative phenotype [79].

Preferential expression from one allele is a common feature of the human genome (up to 60% of genes) and has a genetic basis [6, 1024]. Polymorphic variants at regulatory elements can cause differential allelic expression (DAE), thus using DAE as a quantitative trait could help identify such variation. The samples of choice for association studies are usually blood and saliva, however, relatively little is known about how DAE compares in multiple human tissues and it is questionable whether studying DAE in blood would be a proper surrogate for what happens in the disease target tissue. To date most DAE studies have been performed on EBV transformed lymphoblastoid cell lines (LCLs). Studies in fresh blood, liver and kidney have been reported in a small set of individuals [14, 16], and one recent study looking at the expression of one gene reported that there were large tissue differences in allelic expression ratios within the same individual [25]. An analogous study has been reported in mice [26].

We aimed to perform a more extensive evaluation of differential allelic expression between blood and breast in order to assess the potential usefulness of LCL and fresh blood in association studies, to identify regulatory polymorphisms related to susceptibility to breast cancer. Here we present an analysis of DAE in 12 candidate genes (BRCA1, BRCA2, C1qA, CCND3, EMSY, GPX1, GPX4, MLH3, MTHFR, NBS1, TP53 and TRXR2) likely to be involved in breast cancer, in a large set of individuals. We compared the distribution of allelic ratios of gene expression in fresh blood (B cells and total mononuclear cells), transformed lymphoblasts, and breast tissue from unmatched healthy individuals.

Materials and methods


A total of 170 white cell-reduction filters from anonymous blood donors were collected from the Blood Centre at Addenbrooke's Hospital. Mononuclear cells were separated by density gradient centrifugation using Lymphoprep (Sigma, St. Louis, MO, USA), according to the manufacturer's instructions. B cells were further isolated from these samples by magnetic sorting using CD19 labelled magnetic check beads (Milteny Biotech, Bergisch Gladbach, Germany).

Normal breast tissue was collected at Addenbroke's Hospital, from 40 women undergoing aesthetic surgery, for reasons not related to cancer. All samples were analysed by a histopathologist to ensure that they were free of dysplasia. Ethical approval was obtained for the collection and research use of all blood and breast samples used in this study.

Nineteen lymphoblastoid cell lines derived from unrelated CEPH individuals were obtained from the Coriell Cell Repository. Cell lines were grown in RPMI 1640 with 10% FCS, supplemented with penicillin, streptomycin and L-glutamine, at 37°C and 5% CO2 (Invitrogen, Carlsbad, CA, USA).

All research was carried out in compliance with ethics guidelines and regulations. Human B cells (purified from waste products of blood donations) and normal breast samples were collected with approval from the Addenbrooke's Hospital Local Research Ethics Committee (REC reference 04/Q0108/21 and 06/Q0108/221, respectively).

RNA, DNA and cDNA preparation

DNA was extracted from total mononuclear cells, B-lymphocytes, normal breast and lymphoblastoid cell lines by a conventional SDS/proteinase K/phenol method. Total RNA was extracted from all samples using Qiazol (Invitrogen, Carlsbad, CA, USA) following manufacturer's instructions. The RNA was subsequently treated with DNaseI and repurified using acidic phenol-chlorophorm, and ethanol precipitation.

For normal breast tissue RNA extraction, samples were soaked overnight in RNAlater-Ice® (Ambion, Austin, TX, USA), homogenised in Qiazol using the Precellys®24 bead mechanism (Bertin Technologies, Montigny-le-Bretonneux, France), followed by an additional centrifugation step prior to addition of chlorophorm to the lysate, to eliminate excessive fat.

cDNA was prepared from 1 μg of total RNA per 20 μl reaction using random hexamers and the Reverse Transcription kit (Applied Biosystems, Foster City, CA, USA), according to the manufacturer's instructions, and was diluted in a final volume of 100 μl.


All samples were genotyped using 5' exonuclease Taqman® technology (Applied Biosystems, Foster City, CA, USA). Approximately 20 ng of genomic DNA was used in a 5 μl PCR reaction constituted by Taqman® master mix (Applied Biosystems, Foster City, CA, USA), the two primers, and FAM- and VIC- labelled probes, each designed to anneal specifically to either of the alleles of each single nucleotide polymorphism (SNP). After completion of the PCR, plates were analysed using the Allelic Discrimination analysis method in an ABI PRISM 7900 Sequence Detector (Applied Biosystems, Foster City, CA, USA). Genotyping was carried out in 384-well plates, with random replicates included, as well as no template controls (NTC), to ensure good quality of genotyping.

Quantification of differential allelic gene expression

Allele specific levels of gene expression were determined in heterozygous samples using Taqman® technology (Applied Biosystems, Foster City, CA, USA). Each PCR reaction contains a primer pair targeting the region surrounding the transcribed SNP (tSNP), and two probes that differ by a single nucleotide and are complementary to each of the SNP alleles. The probes are labelled with different fluorochromes (VIC and FAM), generating two signals for each sample during the real-time PCR. A standard curve was generated using a dilution series of heterozygote blood DNA, serving as a reference for the 50:50 allelic ratio. In this way, once the quantity of each allele in the different samples is extrapolated from the linear regression equation, a correction for the different background fluorescence and annealing characteristics of each probe is made automatically. We avoided using cDNA as control as we would be biasing our results towards the DAE ratio of the reference sample. In contrast, there is a perfect 50:50 ratio of the two alleles in a reference DNA sample from a heterozygote with normal chromosomal copy number.

All experiments contained replicates for each sample, and were repeated at least twice. Reactions were prepared as described for genotyping and run on an ABI PRISM 7900 Sequence Detector using the Absolute Quantification method. Ct values were obtained from ABI SDS 2.3 software (Applied Biosystems, Foster City, CA, USA) and quantities of allelic expression were extrapolated from the appropriate linear regression. We defined differential allelic expression as the log2 of the allelic-expression ratio calculated as log2 [(VIC- allele)/(FAM - allele)]. A gene was considered expressed if the PCR yielded Ct values lower than 35 cycles.

Quantification of total gene expression

Total gene expression levels were determined in B cell cDNA samples using Taqman® Gene Expression Assay pre-designed by Applied Biosystems. Results were normalized with the total levels of expression of Actin-β, GAPDH, 18S and β2M.

Statistical Analysis

Real-time PCR quantification statistics were carried out on Microsoft® Excel® 2004 software. Percentage of variation between replicates was calculated as % var = (SD/Mean). Linear regression for Taqman® standard curves was performed using the function linest.

All other statistical analysis was performed using the R statistical programming language [27]. For analysis of DAE in B cells a One Sample t-Test was performed to test for deviations from null hypothesis that the mean is smaller than log2(1.20). However, genes that presented trans-effects were analysed in absolute values with the highest expressing allele divided by the lowest one. For these genes the mean could not be used as the two sides of the distribution would cancel each other out, as explained in Results and Discussion. Furthermore, we performed variance analysis for these genes using F tests for variance. Using MLH3 as a reference gene with a distribution identical to DNA (no DAE), we compared all genes with trans-effect DAE.

To compare DAE across the three different tissue types, both two-sample t-test and Wilcoxon rank sum test with continuity correction were carried out, for comparing mean ratios and variances respectively.

Correlation analysis of the total level of expression vs genotype at the tSNP for these genes was performed using the Jonckheere-Terpstra test, a non-parametric test for trend among classes.


Analysis of differential allelic expression of candidate breast cancer genes in blood cells

We studied 12 candidate genes that are implicated in the aetiology of breast cancer (Table 1): BRCA1, BRCA2, C1qA, CCND3, EMSY, GPX1, GPX4, MLH3, MTHFR, NBS1, TP53 and TRXR2 [2831]. Functionally, these genes are in different pathways including: DNA-damage repair, complement and coagulation cascades, cell cycle and apoptosis. For each gene, we selected single nucleotide polymorphisms (SNPs) markers in both the coding and untranslated regions (transcribed SNP or tSNP), with high heterozygosity frequency. This increased the number of informative individuals in our sample sets. To ascertain differential allelic expression we measured allele-specific transcript levels using real-time PCR Taqman® technology in heterozygote samples for the selected tSNPs, and calculated the ratio of one allele versus the other (plotted as Log2 ratios in Figure 1a).

Figure 1

Comparison of allelic expression in blood vs LCL vs breast tissue. (a) Heterozygote individuals are represented as dots and are coloured blue for blood, black for LCL and green for breast tissue. The mean value for each distribution is shown as a red dot, and whiskers represent the 95% confidence interval of the mean. Dotted lines delimit the cut-off of 1.2 preferential allelic expression ratio [log2(1.2) = 0.263]. (b) Pairwise correlation analysis of the mean log2 allelic expression ratios for the three sample sets. Genes are identically colour coded in all three graphs. Dotted lines represent the linear regression applied to each tissue pair, and the respective equations and R2 values are indicated on each graph.

Table 1 List of coding polymorphisms investigated for differential allelic expression, with the respective possible functional effects

Initial experiments on technical and biological replicates (different cDNA preparations) revealed a very good correlation between replicates and low noise/variation intrinsic to the technique ([mean allelic ratio: standard deviation] <20% and 5% for biological and technical replicates, respectively). Based on this we defined the cut-off allelic expression ratio of 1.2 for DAE presence in a heterozygote sample (indicated on Figure 1a by the dotted lines).

We started by analysing allelic expression in primary B-lymphocytes (magnetically sorted CD19+ cells) from 170 unrelated healthy individuals. The aim was to first identify the genes that displayed preferential allelic expression in a homogeneous population of cells, without the possible interference of multiple cell types. We found that heterozygotes in 11 out of 12 genes (92%) showed allelic imbalances in gene expression (Table 2 and blue data points in Figure 1a). As in previous reports, we identified two patterns of differential expression. In BRCA2, C1qA, GPX1, GPX4, MTHFR and TP53, the same allele was consistently expressed at a higher level in all heterozygotes with allelic imbalance, indicating that for each of these genes the regulatory variant is in strong linkage disequilibrium (LD) with the assayed tSNP. On the other hand, in BRCA1, EMSY, CCND3, NBS1 and TRXR2 different heterozygotes preferentially expressed different alleles. In this case, expression is likely to be controlled by cis-acting elements that are not in strong linkage disequilibrium with the tSNP.

Table 2 Differential allelic expression ratios in fresh B cells

We found considerable variation in the magnitude of DAE across genes, with the largest seen in GPX4 (approximately six-fold difference between the levels of expression of the two alleles). For the genes which show DAE in at least one heterozygote, the proportion of heterozygotes with unequal expression ranged from 10% to 100% (Table 2). Genes with cis-acting elements in LD with the tSNP showed a direct correlation between mean allelic ratio and number of heterozygotes with variation (that is, the greater the mean allelic ratio, the higher the number of heterozygotes with DAE) [see Additional file 1]. Genes with cis- regulation not in LD with the tSNP had a distribution of ratios that was commonly centred on the 50:50 ratio (log2 = 0 in Figure 1a). This reflects the fact that a proportion of the heterozygotes for the tSNP will be homozygote for either of the regulatory polymorphic alleles, consequently generating an equimolar transcription level of the tSNP alleles.

Since peripheral blood is a heterogeneous tissue, composed of mononuclear cells (including B lymphocytes), polymorphonuclear cells and red blood cells, we compared the allelic expression ratios in cDNA extracted from total mononuclear cells from 59 healthy unrelated donors, with those obtained for sorted B cells. We found no significant differences in terms of pattern (cis- regulation in LD with tSNP or not) or mean ratio of DAE (data not shown).

Concerns have also been raised about the effect that transformation of lymphoblasts by Epstein-Barr Virus (EBV) may have on their expression profile [3234]. As most previous studies of differential allelic expression have been performed on lymphoblastoid cell lines (LCL) and future case-control studies using DAE could be performed on this type of sample, we next compared transformed and non-transformed lymphoblasts from 19 unrelated CEPH (Centre D'Étude du Polymorphism Humaine) individuals, who were heterozygous for most of the genes included in this study (black data points in Figure 1a). Eight out of 12 genes showed DAE in both transformed and fresh lymphocytes (BRCA2, CCND3 and GPX1 did not show DAE in LCL samples, in contrast to that observed in untransformed blood, whilst MLH3 showed the opposite). Of the eight genes which showed DAE, five presented mean ratios and patterns of allelic preferential expression that were comparable between the two samples sets (Figure 1a and Tables 3 and 4). BRCA1, EMSY and NBS1 showed significantly different results from those obtained for fresh blood, in terms of the mean fold difference between alleles and/or patterns of DAE (cis- regulation in tight LD or not).

Table 3 Comparison of differential allelic expression between breast and blood (fresh and transformed)
Table 4 Differential allelic expression concordance between breast and blood (fresh and transformed).

Comparison of DAE between breast tissue and blood cells

Next, we analysed 40 normal breast tissue samples (green data points in Figure 1a). Like blood, breast is a complex organ, comprising breast epithelium, stroma, and adipocytes. The comparison between fresh blood and breast tissue showed that DAE distributions were similar for eight out of nine genes (89%) that showed DAE in both tissues. BRCA1, BRCA2, CCND3, EMSY, GPX4, and TRXR2 had similar mean ratios (based on Wilcox rank sum test) and/or patterns. In breast samples, the same alleles of MTHFR and TP53 were preferentially expressed as in the fresh blood samples, although with significantly different mean ratios (Tables 2 and 3). GPX1 showed no DAE in breast and MLH3 showed no DAE in blood, whilst NBS1 showed discordant patterns and mean allelic ratio.

Comparing the results obtained for transformed lymphoblasts with those obtained for breast we found that 10 out of 11 genes showed preferential allelic expression in both types of sample. Of these 10, five genes were comparable in terms of pattern and mean allelic ratio (CCND3, GPX4, MLH3, MTHFR and NBS1), and four were comparable only on pattern (EMSY, TP53 and TRXR2). Only BRCA1 was significantly different between the two sample sets for both mean allelic ratio and pattern of preferential expression.

Pairwise correlation analysis with the mean allelic ratios obtained for the genes that showed evidence of DAE in each two sample types showed high correlation across types of tissue (blood vs LCL R2 = 0.88, blood vs breast R2 = 0.80 and breast vs LCL R2 = 0.87) (Figure 1b).

Comparison between DAE analysis and linkage mapping of expression phenotypes

For the genes that in blood showed evidence for regulation from within the same linkage disequilibrium block (that is, genes for which all heterozygotes with imbalances showed preferential expression of the same allele), we determined total levels of expression using Taqman technology, for individuals of all genotypes. After, we performed a correlation analysis of the total level of expression vs genotype at the tSNP for these genes. We found that only MTHFR showed a significant correlation (P < 0.005) (Figure 2). For other genes, for example TP53, we found that total expression did not vary with genotype, even though we found evidence for differential allelic expression in our initial analysis [see Additional file 2].

Figure 2

Comparison between identifying cis-regulatory elements by the total expression and the allelic expression ratio method in blood samples. For both genes, the graph on the left represents the correlation between the total level of expression and the genotype at the specified tSNP (P-values were calculated using the Jonckheere-Terpstra test). The graphs on the right-hand side represent the log2 ratios of allelic expression in heterozygotes only, for the corresponding tSNPs.


Here we report an extensive analysis of differential allelic expression in breast and blood (fresh and EBV-transformed) in a set of candidate breast cancer genes using a large set of unrelated individuals of European origin. We demonstrate the feasibility of using DAE in fresh blood or transformed lymphoblasts as a quantitative trait in future association studies for susceptibility to breast cancer, as well as an approach to select genes/loci from the lists produced by the genome-wide association studies for further functional investigation and validation. We found that the magnitude (fold difference) or pattern (direction) of differential expression was concordant in eight out of nine genes which showed DAE in breast and fresh blood. The results were similar between in fresh and transformed lymphoblasts.

As reviewed by Williams et al. [35], the percentage of genes reported to be affected by genetic variation at cis-acting regulatory elements differs greatly between approaches. The most common approach to studying variation in gene expression has been linkage analysis of total gene expression [13, 15, 17], which in general reports 1 to 20% cis-linkages. When using imbalances of allelic expression in heterozygotes, previous reports point to a much greater proportion of genes (30 to 60%) with cis-acting regulation [14, 16, 18, 3639]. The discrepancy of proportion of genes showing cis-regulation reported by the different methods, that we also observe in our study (for TP53 for example), is in our view possibly the effect of a feedback control loop that maintains the total level of expression at a constant level inside the cell, irrespective of the genotype at the regulatory element. A major advantage of studying DAE is that as allelic transcript levels are compared within the same cellular and haplotype context, environmental factors, including the level and availability of transcription factors, and genetic biases are eliminated increasing the ability to detect the cis-effects (reviewed in [40, 41]). However, the high percentage of DAE that we observe in out study is likely to be biased by our list of candidate genes, and will not necessarily correspond to the percentage of DAE genome-wide for any of the tissues we studied.

The previous studies that have looked at DAE in multiple tissues have reported significant differences for one gene examined in 12 individuals [25], and for 11% of 92 studied genes in six mice [26]. Our findings differ from these for two possible reasons: we have increased statistical power due to the larger number of samples analysed compared with the Wilkins et al study [25], but also because we analysed a smaller number of genes than Campbell et al [26].

We show that the difference between allelic expression levels can vary greatly (up to six-fold) across genes and based on previous reports [14, 15] we assume that the distribution pattern of DAE can shed light on the nature of the regulatory cis-element causing DAE [7, 16]. In addition, we note that the proportion of heterozygotes displaying DAE can differ greatly between genes (11% to 100% of heterozygotes). In only a small number of genes did all heterozygotes show preferential expression of one allele (two genes in all three sample sets, and two others in transformed lymphoblasts alone). In general, high mean allelic ratios correlated with a high proportion of samples with DAE. This suggests that regulatory variants have in fact non-binary, stochastic effects on the binding of transcription factors. If in some cases the effect is a very strong one, consequently more heterozygotes will present it. For example, in the case of a polymorphism that alters the affinity of binding of a transcription factor [6, 20], the extent of the effect we detect is probably a reflection of where on the binding site sequence the nucleotide change occurs, and how specific the binding of the transcription factor is to a certain sequence. All of these considerations become important when carrying out haplotype analysis to map which regulatory variants are mechanistically responsible for DAE.

Overall, our results suggest that although the total level of expression of a gene is under tissue-specific regulation (mainly due to the availability of the necessary transcription factors), DAE is mostly tissue-independent -exerting a similar effect in most tissues where the gene is expressed - and individual specific - regulated by the genetic variation make-up of each individual (even in the same cellular/tissue context). However, it is likely that tissue-specific levels of transcription factors might also influence the magnitude of DAE, as we noted in genes that show evidence of being regulated differently in the studied tissues (BRCA1 in B cells and breast, for example). Ideally, for validation, this study should be followed-up with another on matched blood and breast samples.


In conclusion, we show that differential allelic expression is common in candidate breast cancer genes and is comparable between tissues to some extent. Our findings support the further exploration of DAE in blood and breast as a quantitative phenotype to reveal regulatory genetic variation that predisposes to breast cancer (as in recent reports for breast and colorectal cancers [9, 42]), as well as a mean to prioritise the candidate susceptibility hits from the GWAS for follow-up functional studies and confirmation.



Centre D'Étude du Polymorphism Humaine


differential allelic expression


Epstein-Barr virus


genome-wide association studies


lymphoblastoid cell line


linkage disequilibrium


single nucleotide polymorphism


transcribed/transcript single nucleotide polymorphism.


  1. 1.

    Easton DF, Pooley KA, Dunning AM, Pharoah PD, Thompson D, Ballinger DG, Struewing JP, Morrison J, Field H, Luben R, Wareham N, Ahmed S, Healey CS, Bowman R, collaborators S, Meyer KB, Haiman CA, Kolonel LK, Henderson BE, Le Marchand L, Brennan P, Sangrajrang S, Gaborieau V, Odefrey F, Shen CY, Wu PE, Wang HC, Eccles D, Evans DG, Peto J, et al: Genome-wide association study identifies novel breast cancer susceptibility loci. Nature. 2007, 447: 1087-1093. 10.1038/nature05887.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Hunter DJ, Kraft P, Jacobs KB, Cox DG, Yeager M, Hankinson SE, Wacholder S, Wang Z, Welch R, Hutchinson A, Wang J, Yu K, Chatterjee N, Orr N, Willett WC, Colditz GA, Ziegler RG, Berg CD, Buys SS, McCarty CA, Feigelson HS, Calle EE, Thun MJ, Hayes RB, Tucker M, Gerhard DS, Fraumeni JF, Hoover RN, Thomas G, Chanock SJ: A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat Genet. 2007, 39: 870-874. 10.1038/ng2075.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  3. 3.

    Stacey SN, Manolescu A, Sulem P, Rafnar T, Gudmundsson J, Gudjonsson SA, Masson G, Jakobsdottir M, Thorlacius S, Helgason A, Aben KK, Strobbe LJ, Albers-Akkers MT, Swinkels DW, Henderson BE, Kolonel LN, Le Marchand L, Millastre E, Andres R, Godino J, Garcia-Prats MD, Polo E, Tres A, Mouy M, Saemundsdottir J, Backman VM, Gudmundsson L, Kristjansson K, Bergthorsson JT, Kostic J, et al: Common variants on chromosomes 2q35 and 16q12 confer susceptibility to estrogen receptor-positive breast cancer. Nat Genet. 2007, 39: 865-869. 10.1038/ng2064.

    CAS  Article  PubMed  Google Scholar 

  4. 4.

    Stacey SN, Manolescu A, Sulem P, Thorlacius S, Gudjonsson SA, Jonsson GF, Jakobsdottir M, Bergthorsson JT, Gudmundsson J, Aben KK, Strobbe LJ, Swinkels DW, van Engelenburg KCA, Henderson BE, Kolonel LN, Le Marchand L, Millastre E, Andres R, Saez B, Lambea J, Godino J, Polo E, Tres A, Picelli S, Rantala J, Margolin S, Jonsson T, Sigurdsson H, Jonsdottir T, Hrafnkelsson J, et al: Common variants on chromosome 5p12 confer susceptibility to estrogen receptor-positive breast cancer. Nat Genet. 2008, 40: 703-706. 10.1038/ng.131.

    CAS  Article  PubMed  Google Scholar 

  5. 5.

    Ahmed S, Thomas G, Ghoussaini M, Healey CS, Humphreys MK, Platte R, Morrison J, Maranian M, Pooley KA, Luben R, Eccles D, Evans DG, Fletcher O, Johnson N, dos Santos Silva I, Peto J, Stratton MR, Rahman N, Jacobs K, Prentice R, Anderson GL, Rajkovic A, Curb JD, Ziegler RG, Berg CD, Buys SS, McCarty CA, Feigelson HS, Calle EE, Thun MJ, et al: Newly discovered breast cancer susceptibility loci on 3p24 and 17q23.2. Nat Genet. 2009, 41: 585-590. 10.1038/ng.354.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  6. 6.

    Meyer KB, Maia AT, O'Reilly M, Teschendorff AE, Chin SF, Caldas C, Ponder BA: Allele-Specific Up-Regulation of FGFR2 Increases Susceptibility to Breast Cancer. PLoS Biol. 2008, 6: e108-10.1371/journal.pbio.0060108.

    Article  PubMed  PubMed Central  Google Scholar 

  7. 7.

    Dermitzakis E, Stranger B: Genetic variation in human gene expression. Mamm Genome. 2006, 17: 503-508. 10.1007/s00335-006-0005-y.

    Article  PubMed  Google Scholar 

  8. 8.

    Yan H, Zhou W: Allelic variations in gene expression. Current opinion in oncology. 2004, 16: 39-43. 10.1097/00001622-200401000-00008.

    Article  PubMed  Google Scholar 

  9. 9.

    Chen X, Weaver J, Bove BA, Vanderveer LA, Weil SC, Miron A, Daly MB, Godwin AK: Allelic Imbalance in BRCA1 and BRCA2 Gene Expression Is Associated with an Increased Breast Cancer Risk. Hum Mol Genet. 2008, 17: 1336-1348. 10.1093/hmg/ddn022.

    CAS  Article  PubMed  Google Scholar 

  10. 10.

    Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, Colinayo V, Ruff TG, Milligan SB, Lamb JR, Cavet G, Linsley PS, Mao M, Stoughton RB, Friend SH: Genetics of gene expression surveyed in maize, mouse and man. Nature. 2003, 422: 297-302. 10.1038/nature01434.

    CAS  Article  PubMed  Google Scholar 

  11. 11.

    Cheung V, Conlin L, Weber T, Arcaro M, Jen K, Morley M, Spielman R: Natural variation in human gene expression assessed in lymphoblastoid cells. Nat Genet. 2003, 33: 422-425. 10.1038/ng1094.

    CAS  Article  PubMed  Google Scholar 

  12. 12.

    Bray N, Buckland P, Owen M, O'Donovan M: Cis-acting variation in the expression of a high proportion of genes in human brain. Hum Genet. 2003, 113: 149-153.

    PubMed  Google Scholar 

  13. 13.

    Cheung V, Spielman R, Ewens K, Weber T, Morley M, Burdick J: Mapping determinants of human gene expression by regional and genome-wide association. Nature. 2005, 437: 1365-1369. 10.1038/nature04244.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  14. 14.

    Lo H, Wang Z, Hu Y, Yang H, Gere S, Buetow K, Lee M: Allelic variation in gene expression is common in the human genome. Genome Res. 2003, 13: 1855-1862. 10.1101/gr.885403.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  15. 15.

    Morley M, Molony C, Weber T, Devlin J, Ewens K, Spielman R, Cheung V: Genetic analysis of genome-wide variation in human gene expression. Nature. 2004, 430: 743-747. 10.1038/nature02797.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  16. 16.

    Pant P, Tao H, Beilharz E, Ballinger D, Cox D, Frazer K: Analysis of allelic differential expression in human white blood cells. Genome Res. 2006, 16: 331-339. 10.1101/gr.4559106.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  17. 17.

    Stranger B, Forrest M, Clark A, Minichiello M, Deutsch S, Lyle R, Hunt S, Kahl B, Antonarakis S, Tavaré S, Deloukas P, Dermitzakis E: Genome-wide associations of gene expression variation in humans. PLoS Genet. 2005, 1: e78-10.1371/journal.pgen.0010078.

    Article  PubMed  PubMed Central  Google Scholar 

  18. 18.

    Yan H, Yuan W, Velculescu V, Vogelstein B, Kinzler K: Allelic variation in human gene expression. Science. 2002, 297: 1143-10.1126/science.1072545.

    CAS  Article  PubMed  Google Scholar 

  19. 19.

    Lee PD, Ge B, Greenwood CM, Sinnett D, Fortin Y, Brunet S, Fortin A, Takane M, Skamene E, Pastinen T, Hallett M, Hudson TJ, Sladek R: Mapping cis-acting regulatory variation in recombinant congenic strains. Physiol Genomics. 2006, 25: 294-302. 10.1152/physiolgenomics.00168.2005.

    CAS  Article  PubMed  Google Scholar 

  20. 20.

    Tao H, Cox DR, Frazer KA: Allele-specific KRT1 expression is a complex trait. PLoS Genet. 2006, 2: e93-10.1371/journal.pgen.0020093.

    Article  PubMed  PubMed Central  Google Scholar 

  21. 21.

    GuhaThakurta D, Xie T, Anand M, Edwards SW, Li G, Wang SS, Schadt EE: Cis-regulatory variations: a study of SNPs around genes showing cis-linkage in segregating mouse populations. BMC Genomics. 2006, 7: 235-10.1186/1471-2164-7-235.

    Article  PubMed  PubMed Central  Google Scholar 

  22. 22.

    Pastinen T, Ge B, Gurd S, Gaudin T, Dore C, Lemire M, Lepage P, Harmsen E, Hudson TJ: Mapping common regulatory variants to human haplotypes. Hum Mol Genet. 2005, 14: 3963-3971. 10.1093/hmg/ddi420.

    CAS  Article  PubMed  Google Scholar 

  23. 23.

    Spielman RS, Bastone LA, Burdick JT, Morley M, Ewens WJ, Cheung VG: Common genetic variants account for differences in gene expression among ethnic groups. Nat Genet. 2007, 39: 226-231. 10.1038/ng1955.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  24. 24.

    Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, Beazley C, Ingle CE, Dunning M, Flicek P, Koller D, Montgomery S, Tavare S, Deloukas P, Dermitzakis ET: Population genomics of human gene expression. Nat Genet. 2007, 39: 1217-1224. 10.1038/ng2142.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  25. 25.

    Wilkins JM, Southam L, Price AJ, Mustafa Z, Carr A, Loughlin J: Extreme context specificity in differential allelic expression. Hum Mol Genet. 2007, 16: 537-546. 10.1093/hmg/ddl488.

    CAS  Article  PubMed  Google Scholar 

  26. 26.

    Campbell CD, Kirby A, Nemesh J, Daly MB, Hirschhorn JN: A survey of allelic imbalance in F1 mice. Genome Res. 2008, 18: 555-563. 10.1101/gr.068692.107.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  27. 27.

    Team RDC: R: A Language and Environment for Statistical Computing. []

  28. 28.

    Pharoah PD, Tyrer J, Dunning AM, Easton DF, Ponder BA, SEARCH Investigators: Association between common variation in 120 candidate genes and breast cancer risk. PLoS Genet. 2007, 3: e42-10.1371/journal.pgen.0030042.

    Article  PubMed  PubMed Central  Google Scholar 

  29. 29.

    Racila E, Racila DM, Ritchie JM, Taylor C, Dahle C, Weiner GJ: The pattern of clinical breast cancer metastasis correlates with a single nucleotide polymorphism in the C1qA component of complement. Immunogenetics. 2006, 58: 1-8. 10.1007/s00251-005-0077-y.

    CAS  Article  PubMed  Google Scholar 

  30. 30.

    Song H, Ramus SJ, Quaye L, DiCioccio RA, Tyrer J, Lomas E, Shadforth D, Hogdall E, Hogdall C, McGuire V, Whittemore AS, Easton DF, Ponder BA, Kjaer SK, Pharoah PD, Gayther SA: Common variants in mismatch repair genes and risk of invasive ovarian cancer. Carcinogenesis. 2006, 27: 2235-2242. 10.1093/carcin/bgl089.

    CAS  Article  PubMed  Google Scholar 

  31. 31.

    Udler M, Maia AT, Cebrian A, Brown C, Greenberg D, Shah M, Caldas C, Dunning A, Easton D, Ponder B, Pharoah P: Common germline genetic variation in antioxidant defense genes and survival after diagnosis of breast cancer. J Clin Oncol. 2007, 25: 3015-3023. 10.1200/JCO.2006.10.0099.

    CAS  Article  PubMed  Google Scholar 

  32. 32.

    Ryan JL, Kaufmann WK, Raab-Traub N, Oglesbee SE, Carey LA, Gulley ML: Clonal evolution of lymphoblastoid cell lines. Lab Invest. 2006, 86: 1193-1200. 10.1038/labinvest.3700472.

    CAS  Article  PubMed  Google Scholar 

  33. 33.

    Attanasio C, Reymond A, Humbert R, Lyle R, Kuehn MS, Neph S, Sabo PJ, Goldy J, Weaver M, Lee K, Haydock A, Dermitzakis ET, Dorschner MO, Antonarakis SE, Stamatoyannopoulos JA: Assaying the regulatory potential of mammalian conserved non-coding sequences in human cells. Genome Biol. 2008, 9: R168-10.1186/gb-2008-9-12-r168.

    Article  PubMed  PubMed Central  Google Scholar 

  34. 34.

    Choy E, Yelensky R, Bonakdar S, Plenge RM, Saxena R, De Jager PL, Shaw SY, Wolfish CS, Slavik JM, Cotsapas C, Rivas M, Dermitzakis ET, Cahir-McFarland E, Kieff E, Hafler D, Daly MJ, Altshuler D: Genetic analysis of human traits in vitro: drug response and gene expression in lymphoblastoid cell lines. PLoS Genet. 2008, 4: e1000287-10.1371/journal.pgen.1000287.

    Article  PubMed  PubMed Central  Google Scholar 

  35. 35.

    Williams RB, Chan EK, Cowley MJ, Little PF: The influence of genetic variation on gene expression. Genome Res. 2007, 17: 1707-1716. 10.1101/gr.6981507.

    CAS  Article  PubMed  Google Scholar 

  36. 36.

    Bjornsson HT, Albert TJ, Ladd-Acosta CM, Green RD, Rongione MA, Middle CM, Irizarry RA, Broman KW, Feinberg AP: SNP-specific array-based allele-specific expression analysis. Genome Res. 2008, 18: 771-779. 10.1101/gr.073254.107.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  37. 37.

    Cheung VG, Bruzel A, Burdick JT, Morley M, Devlin JL, Spielman RS: Monozygotic Twins Reveal Germline Contribution to Allelic Expression Differences. Am J Hum Genet. 2008, 82: 1357-1360. 10.1016/j.ajhg.2008.05.003.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  38. 38.

    Verlaan DJ, Ge B, Grundberg E, Hoberman R, Lam K, Koka V, Dias J, Gurd S, Martin N, Mallmin H, Nilsson O, Harmsen E, Kwan T, Pastinen TM: Targeted screening of cis-regulatory variation in human haplotypes. Genome Res. 2008, 19: 118-127. 10.1101/gr.084798.108.

    Article  PubMed  Google Scholar 

  39. 39.

    Serre D, Gurd S, Ge B, Sladek R, Sinnett D, Harmsen E, Bibikova M, Chudin E, Barker DL, Dickinson T, Fan JB, Hudson TJ: Differential allelic expression in the human genome: a robust approach to identify genetic and epigenetic cis-acting mechanisms regulating gene expression. PLoS Genet. 2008, 4: e1000006-10.1371/journal.pgen.1000006.

    Article  PubMed  PubMed Central  Google Scholar 

  40. 40.

    Pastinen T, Ge B, Hudson TJ: Influence of human genome polymorphism on gene expression. Hum Mol Genet. 2006, 15 (Spec No 1): R9-16. 10.1093/hmg/ddl044.

    CAS  Article  PubMed  Google Scholar 

  41. 41.

    Pastinen T, Hudson T: Cis-acting regulatory variation in the human genome. Science. 2004, 306: 647-650. 10.1126/science.1101659.

    CAS  Article  PubMed  Google Scholar 

  42. 42.

    Valle L, Serena-Acedo T, Liyanarachchi S, Hampel H, Comeras I, Li Z, Zeng Q, Zhang HT, Pennison MJ, Sadim M, Pasche B, Tanner SM, de la Chapelle A: Germline allele-specific expression of TGFBR1 confers an increased risk of colorectal cancer. Science. 2008, 321: 1361-1365. 10.1126/science.1159397.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

Download references


The authors would like to thank Dr Paul Pharoah, Dr Cherie Blenkiron, Dr John Marioni, Professor Simon Tavaré and staff at Strangeways Laboratories for help with technical aspects of the work, and Dr Kerstin Meyer and Dr Laura Blackburn for help with preparation of the manuscript.

The work from the authors' laboratory was supported by grants from Cancer Research UK and (ATM, IS, AJXL, LJ, MOR, CC and BAJP). BAJP is the Li Ka Shing Professor of Oncology at the University of Cambridge. We would also like to acknowledge the support of The University of Cambridge, Cancer Research UK, Hutchison Whampoa Limited and NIHR Cambridge Biomedical Research Centre.

Acknowledged contributions were funded by grants from Cancer Research UK (PP, CB, JM, ST and KM).

Author information



Corresponding author

Correspondence to Ana-Teresa Maia.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

ATM conceived and designed the study, prepared samples, carried out experiments, prepared and edited the manuscript. IS carried out sample preparation and performed experiments. AJXL performed experiments. LJ carried out sample collection and elaborated ethics applications. MOR contributed to sample preparation. CC and BAJP helped conceiving the study and editing the manuscript. All authors have read and approved the final manuscript.

Electronic supplementary material


Additional file 1: Adobe Acrobat document containing a graph of the correlation between mean ratio of DAE and percentage of heterozygotes with DAE in B cells. (PDF 194 KB)


Additional file 2: Adobe Acrobat document containing the graphs for all extra genes in the total level of expression vs genotype correlation analysis. P values correspond to the Jonckheere-Terpstra test, like for Figure 2. (PDF 540 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Maia, AT., Spiteri, I., Lee, A.J. et al. Extent of differential allelic expression of candidate breast cancer genes is similar in blood and breast. Breast Cancer Res 11, R88 (2009).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI:


  • Lymphoblastoid Cell Line
  • Fresh Blood
  • Allelic Imbalance
  • Preferential Expression
  • Allelic Expression