A signature-based method for indexing cell cycle phase distribution from microarray profiles

Mizuno, Hideaki; Nakanishi, Yoshito; Ishii, Nobuya; Sarai, Akinori; Kitada, Kunio

doi:10.1186/1471-2164-10-137

A signature-based method for indexing cell cycle phase distribution from microarray profiles

Methodology article
Open access
Published: 30 March 2009

Volume 10, article number 137, (2009)
Cite this article

Download PDF

You have full access to this open access article

BMC Genomics Aims and scope Submit manuscript

A signature-based method for indexing cell cycle phase distribution from microarray profiles

Download PDF

Hideaki Mizuno^1,2,
Yoshito Nakanishi¹,
Nobuya Ishii¹,
Akinori Sarai² &
…
Kunio Kitada¹

6361 Accesses
27 Citations
Explore all metrics

Abstract

Background

The cell cycle machinery interprets oncogenic signals and reflects the biology of cancers. To date, various methods for cell cycle phase estimation such as mitotic index, S phase fraction, and immunohistochemistry have provided valuable information on cancers (e.g. proliferation rate). However, those methods rely on one or few measurements and the scope of the information is limited. There is a need for more systematic cell cycle analysis methods.

Results

We developed a signature-based method for indexing cell cycle phase distribution from microarray profiles under consideration of cycling and non-cycling cells. A cell cycle signature masterset, composed of genes which express preferentially in cycling cells and in a cell cycle-regulated manner, was created to index the proportion of cycling cells in the sample. Cell cycle signature subsets, composed of genes whose expressions peak at specific stages of the cell cycle, were also created to index the proportion of cells in the corresponding stages. The method was validated using cell cycle datasets and quiescence-induced cell datasets. Analyses of a mouse tumor model dataset and human breast cancer datasets revealed variations in the proportion of cycling cells. When the influence of non-cycling cells was taken into account, "buried" cell cycle phase distributions were depicted that were oncogenic-event specific in the mouse tumor model dataset and were associated with patients' prognosis in the human breast cancer datasets.

Conclusion

The signature-based cell cycle analysis method presented in this report, would potentially be of value for cancer characterization and diagnostics.

Perseus: A Bioinformatics Platform for Integrative Analysis of Proteomics Data in Cancer Research

Cell Cycle Regulation by Checkpoints

Profiling Tumor Infiltrating Immune Cells with CIBERSORT

Background

A fundamental characteristic of all cancers is cell cycle deregulation [1]. Although diverse factors such as point mutation, gene amplification, activation of oncogenes, inactivation of tumor suppressors, and hypermethylation are involved in cancer development, their influence ultimately is on the cell cycle machinery. Therefore, various methods of cell cycle phase estimation have been developed. The M phase indicator mitotic index, the number of mitotic bodies in a microscopic field, and the S-phase fraction, a DNA flow cytometry determination, are used to measure the tumor proliferation rate and are predictive for breast cancer prognosis [2–4]. Immunohistochemistry (IHC) against cell cycle markers is another tool. For example, the expression of G1-S transition marker cyclin E, S-G2 marker cyclin A, or S-G2-M marker geminin are predictive of poor prognosis of breast cancers [2–5]. However, these methods rely on one or few measurements and consequently provide a limited scope of information. There is a need for more systematic methods of cell cycle phase analysis, such as microarray-based techniques [3, 4].

Gene expression signatures, which are capable of predicting the state of a sample from a given microarray dataset, are the emerging technology for developing cancer therapeutics. The "70-gene signature" from a breast cancer dataset has shown predictive power for the risk of recurrence [6]. The "pathway deregulation signature" has shown the ability to predict pathway status and to characterize breast, lung and ovarian cancers [7]. The "chemotherapy response signature" has accurately predicted clinical response to cytotoxic drugs for breast and ovarian cancers [8]. Here, we report the development of the "cell cycle signature (CCS)" which indexes the cell cycle phase distribution from microarray profiles considering both cycling and non-cycling cells. The CCS method depicted "buried" cell cycle phase distributions that were oncogenic-event specific in a mouse tumor model dataset and were associated with patients' prognosis in human breast cancer datasets. The method has a potential to be of value in the characterization and diagnosis of cancers.

Results

Algorithm

To analyze cell cycle phase distribution, a series of CCSs were created as described in Methods (Fig. 1A, Additional file 1). The CCS masterset, 252 genes that express preferentially in cycling cells and in a cell cycle-regulated manner, represents the entire cell cycle and is henceforth denoted as CCS_cycling. Eighteen CCS subsets, each composed of genes whose expressions peak at a specific stage of the cell cycle, represent the phases of the cell cycle and are denoted using the subscript naming convention of CCS_phase. For example, the CCS subsets for the G1 phase are expressed as CCS_G1, for the G2-M phase as CCS_G2-M, and so on.

Solid tumors are composed of various proportions of cycling and non-cycling cells [9], and cell cycle phase distributions can be assessed as per total cells or as per cycling cells. Since microarray measurements are the net expression of all cells in the sample, the data is generally per total cells. To obtain data per cycling cells from a given microarray dataset (Fig. 1B, total gene dataset), a subdataset is created by extracting the expression values of CCS_cycling genes (Fig. 1B, cycling gene dataset). Then, both the total and the cycling gene datasets undergo quantile normalization which gives the same expression value distribution for each sample [10]. In the total gene dataset, normalization is done on all genes. On the other hand, in the cycling gene dataset, normalization is done only on the cycling genes. Because genes in the CCS_cycling preferentially express in cycling cells, the influence of non-cycling cells would be limited for the cycling gene dataset. Scores for each CCS are calculated for both datasets. CCS_cycling and CCS_phase scores for the total gene dataset could index the proportion of cycling cells and of cells at the designated cell cycle phase per total cells, respectively. Similarly, CCS_phase scores for the cycling gene dataset could index the proportion of cells at the cell cycle phase per cycling cells. CCS_cycling scores for the cycling gene dataset could index the proportion of cycling cells per cycling cells and thus would show constant values.

Validation

In the preliminary analysis of the Whitfiled et al. cell cycle dataset [11], CCS indexed cell cycle phase distribution as expected (Additional file 2). To confirm that the CCS method is valid for independent datasets, a cell cycle dataset of synchronized HCT116 cells was prepared and analyzed. As shown in Fig. 2A, similar heat map patterns were observed for the total and the cycling gene datasets. Differences in the CCS_cycling scores for both the total and the cycling gene datasets were slight in the situation where most cells were expected to be in the cell cycle. Peaks in the CCS_phase scores shifted according to cell cycle progression (Fig. 2A, DMSO 0–10 h), and peaks ceased around the M phase in cells treated with the mitosis inhibitor nocodazole (Fig. 2A, Ncz 7–10 h), consistent with DNA flow cytometry measurements (Fig. 2B). The CCS method was able to index cell cycle phase distribution even for an independent cell cycle dataset derived from a different cell line and a different platform.

Solid tumors are not solely composed of cycling cells but contain various numbers of non-cycling cells [9]. Theoretically, changes in the proportion of cycling cells in the sample are expected to evenly change the proportion of cells in all cell cycle phases. To examine the influence of changes in the proportion of cycling cells on CCS scores, analysis was conducted on the Fournier et al. dataset [12] of profiles of human mammary epithelial cells (HMECs) cultured in leucine-rich extra cellular matrix. In this system, HMECs grow exponentially and then enter a quiescent state [12, 13]. As shown in Fig. 2C, CCS_cycling and CCS_phase scores for the total gene dataset uniformly decreased as the HMECs transitioned from cycling (day 3) to non-cycling state (day 7) (Fig. 2C, upper panel). According to the DNA flow cytometry estimation in the original report, the S phase and G2+M phase fraction size decreased from 15% ± 5.1 (day 5) to 5.5% ± 0.5 (day 7), and from 12% ± 1.1 (day 5) to 7% ± 2.5 (day 7), respectively (day 3 data was not available) [12]. On the other hand, the G0+G1 phase fraction size increased from 73% ± 6.3 (day 5) to 86% ± 4.6 (day 7). Due to the inability of DNA flow cytometry to distinguish cells in G0 from cells in G1, decisive conclusions cannot be made. However, from two situations in which 1) 3D cultured HMECs gradually underwent growth arrest and 2) CCS_G1 scores decreased at day 7, this increase can be regarded as an increase in the number of cells at the G0 phase as well as a decrease in the number of cells at the G1 phase. To our surprise, the heat map for the cycling gene dataset showed increasing CCS_G1 scores towards day 7 (Fig. 2C, lower panel). This increase in CCS_G1 scores could be due to the G1 phase prolongation which is known to occur under G0-inducing conditions, such as serum starvation and development [14, 15]. For further confirmation, we analyzed the Cam et al. dataset [16] of profiles of growing and serum starved T98 breast cancer cells. Similar to the results for HMECs, a uniform decrease in CCS_cycling and CCS_phase scores for the total gene dataset was observed in serum-starved cells (Fig. 2D, upper panel). In addition, an increase in CCS_G1 scores for the cycling gene dataset was observed (Fig. 2D, lower panel), indicating prolongation of the G1 phase. Taken together, these results suggested that changes in the proportion of cycling cells in the sample can be presented as uniform changes in CCS_cycling and CCS_phase scores for the total gene dataset.

The mammalian cell cycle is a highly regulated and conserved process [17]. To investigate whether CCS derived from human datasets can be used to closely related species, the Yamamoto et al. dataset [18], cell cycle profiles (G0 to S) of NIH3T3 mouse fibroblasts, was analyzed. The heat map showed changes in the proportion of cycling cells (Additional file 3: upper panel) as well as cell cycle progression from G1 to S phase (Additional file 3: lower panel), as quiescent cells (FGF 0 h) re-enter the cell cycle, progress through G1 phase and enter S phase (FGF 12 h). These results showed that the human CCS created in this study can be applied for the analysis of mouse datasets.

Analysis on mouse tumor model dataset

The CCS method was applied to the Herschkowitz et al. dataset [19] which contains 122 profiles of 13 different mouse mammary carcinoma models and normal samples. The authors reported that some models developed similar tumors (homogeneous models) of gene expression and histological phenotype while other models showed heterogeneity (heterogeneous models) and gave "randomness of the molecular basis of tumor initiation" as the reason for the heterogeneity. As shown in Fig. 3A, CCS_cycling and CCS_phase scores for the total gene dataset for the normal samples were consistently very low, while scores for tumors were varying degrees higher, indicating variation in the proportion of cycling cells. It is reasonable that heterogeneous models show variation in CCS_cycling and CCS_phase scores. However, variation was also seen in each homogeneous model, although Tag models had a tendency towards higher scores and the Neu model had a tendency towards lower scores. In contrast, CCS_phase scores for the cycling gene dataset were similar within the same homogeneous models, except in the Myc model (Fig. 3A, lower panel). To illustrate this in detail, CCS_phase scores of several models for both datasets were plotted as shown in Fig. 3B. It can be seen that each model has a specific cell cycle phase distribution. High CCS_G1 and low CCS_S-G2-M scores were seen in the Neu model. The opposite pattern was seen in one of the Tag models. The Myc model showed two different cell cycle phase distributions (Additional file 4) and the reason is not clear. However, because Myc has been reported to induce genomic instability and to contribute to tumorigenesis through a dominant mutator effect [20], additional oncogenic events may have been induced. In all cases, plots for the total gene dataset were vertically shifted in varying degrees which would be due to the influence of non-cycling cells, as presented in HMECs and T98 cells. On the other hand, plots for the cycling gene dataset showed minimal variation in alignment. These results indicated two findings: (i) the cell cycle phase distribution reflects the oncogenic events in tumors, and (ii) the cell cycle phase distribution can be better indexed when the influence of non-cycling cells is taken into account. The advantage of the CCS method can be underscored considering that the current cell cycle phase estimation methods relying on one or few measurements are not sufficient to depict cell cycle phase distribution or to distinguish non-cycling cells.

Analysis on human breast cancer datasets

The CCS method was applied to the Ivshina et al. dataset [21] from a panel of 249 human breast cancers. The heat map for the total gene dataset showed various CCS_cycling scores, indicative of variations in the proportion of cycling cells in the sample (Fig. 4A, upper panel). The CCS_phase scores were not uniformly changed in some patients, suggesting that cell cycle phase distributions were also altered. The heat map for the cycling gene dataset displayed a rolling wave pattern (Fig. 4A, lower panel). Patients with high CCS_cycling scores for the total gene dataset had high CCS_S-G2-M and low CCS_G1 scores for the cycling gene dataset, but several exceptions existed (Fig. 4A), reminding the influence of non-cycling cells found in the analysis of mouse tumor models. Clinical annotations were available for this dataset and so the relevance between CCS scores and patient prognosis were tested. Patients were dichotomized by the median of each CCS score and then the risk differences between the two groups for disease free survival (DFS) were assessed using log-rank test and Cox univariate analysis (Fig. 4B). The CCS_cycling score for the total gene dataset was significantly predictive of poor prognosis (Hazard ratio [HR] = 1.98, p = 0.00134) (Fig. 4B and Fig. 4C, CCS_cycling), consistent with the common view that a larger number of cycling cells correlates with worse clinical outcome. The CCS_S-G2-M and several CCS_G1 scores for the total gene dataset were also predictive of poor prognosis. On the other hand, CCS_G1 scores for the cycling gene dataset had an adverse prognostic power and gave the highest prognostic value among the tests (HR = 0.41, p = 0.0000367) (Fig. 4B and Fig. 4C, CCS_G1).

To exclude the possibility of dataset specificity, the CCS method was also applied to the Langerød et al. dataset [22] from a panel of 80 breast cancers. Similar results were obtained (Additional file 5). For the total gene dataset, variations in CCS_cycling scores and non-uniform changes in CCS_phase scores in some patients were observed. Patients with high CCS_cycling scores for the total gene dataset had high CCS_S-G2-M and low CCS_G1 scores for the cycling gene dataset with some exceptions. CCS_G1 scores for the cycling gene dataset were predictive for DFS as with the Ivshina et al. dataset and gave the highest prognostic value (HR = 0.41, p = 0.00553) (Additional file 5). Taken together, these results indicated that: (i) variations in the proportion of cycling cells exist among tumors, (ii) the proportion of cycling cells correlated to the cell cycle phase distribution per cycling cells with several exceptions, and (iii) the cell cycle phase distribution per cycling cells better associated with patients' prognosis.

Discussion and conclusion

In this study, we developed a signature-based method to index cell cycle phase distribution from microarray profiles under consideration of cycling and non-cycling cells, providing two sources of valuable information on cancers.

One source of information is the proportion of cycling cells in the sample. The rationale of most current cell cycle phase estimation methods, including mitotic index, S phase fraction and IHC against cell cycle markers, is that the high proliferative tumors leading to poor prognosis contain more cycling cells. In the analysis of the human breast cancer datasets, higher CCS_cycling scores for the total gene dataset, indicative of a larger number of cycling cells in the sample, did associate with poor prognosis. Naturally, it can be thought that an increase in the number of cycling cells leads to a uniform increase in the number of cells at all cell cycle phases. However, some patients showed non-uniform changes in CCS_phase scores for the total gene dataset (Fig. 4A, upper panel), suggesting that each cell cycle phase was not evenly changed. Similarly, Whitfield et al. observed that some cell cycle-regulated genes did not express in correlation with proliferation status in some breast cancers [11]. Furthermore, although the G1 phase is a part of the cell cycle, G1 phase marker cyclin D1 often negatively correlates with poor prognosis of breast cancers [2–4, 23]. Therefore, considering only the proportion of cycling cells seems insufficient.

The other source of information is cell cycle phase distribution. A number of oncogenic events are known to perturb the duration of cell cycle phases. For example, activation of oncogenes such as v-H-ras, v-Src, v-Raf, cyclin D1, cyclin E, and c-myc shortens the G1 phase [24–26]. Loss of tumor suppressor Pten shortens the G1 phase [27] and loss of Lzts1 and Lats2 shortens the M phase [28, 29]. Viral infections such as SV40-Tag and HTLV-1 Tax also shorten the G1 phase [30, 31]. Such perturbations in the cell cycle phase duration subsequently alter the cell cycle phase distribution. Thus, the cell cycle phase distribution per cycling cells would reflect the biology of cancers. Actually, in the analysis of mouse tumor models, oncogenic-event specific cell cycle phase distributions were observed. This suggests that the cell cycle phase distribution under consideration of both cycling and non-cycling cells has a potential for cancer characterization.

A model of tumors with different cell cycle phase distributions is proposed in Fig. 5. Oncogenic events perturb the cell cycle each in a unique way which in turn alters the cell cycle phase distribution as well as the proliferation rate. High proliferative tumors grow rapidly and thereby produce a large number of cycling cells. The opposite is the true for low proliferative tumors. However, high proliferative tumors with a small number of cycling cells or low proliferative tumors with a large number of cycling cells would exist at a low probability. This model would account for non-uniform changes in CCS_phase scores for the total gene dataset found in some breast cancer patients, the Whitfield et al.'s observation, and the adverse prognostic value of cyclin D1. Current cell cycle phase estimation methods are insufficient for detecting such cancers. Mitotic index and S-phase fraction do not recognize non-cycling cells. Combinatorial IHC [32] still needs improvement and validation. Shetty et al. reported a relationship between breast cancer grade and G1 phase length estimated from the ratio of geminin and Ki67 IHC measurements; however, it was not significant [33]. The CCS method, on the other hand, indexed the cell cycle phase distribution under consideration of cycling and non-cycling cells, and showed a potential for characterizing cancers.

Previously, as an alternative microarray-based cell cycle analysis technique, Lu et al. introduced the "expression deconvolution" method [34]. To predict the cell cycle phase distribution of yeast, they prepared about 700 equations with 5 variables representing 5 cell cycle phases and searched for the optimal solution. The method has comparable or even better potential to improve cancer characterization than the CCS method. However, it requires a tremendous amount of computational resources to find the optimal solution and avoid the local minimum, especially as the number of variables increases (18 + 1 phases were analyzed in our study). There are some hurdles that need to be overcome before high resolution cell cycle phase analysis is practical and we are currently tackling some of them.

Methods

Cell Culture and Synchronization

The HCT116 colorectal cancer cell line (ATCC) was grown in McCoy's 5A medium modified (Sigma-Aldrich) with 10% FBS (JBS) and maintained at 37°C and 5% CO₂. Synchronous culture was obtained by incubating cells for 19 h in 2 mM of thymidine, followed by a 9-h incubation in normal medium and a second 16-h incubation in thymidine (2 mM). Cells were washed with normal medium followed by treatment with DMSO for 0, 2, 4, 6, 7, 8, 9, and 10 h as a control or 0.1 mg/ml nocodazole (Sigma-Aldrich) for 7, 8, 9, and 10 h. Cells were stained with propidium iodide and analyzed with DNA flow cytometry.

Microarray

Total RNA was reverse transcribed, labeled, and hybridized to Human Genome U133 Plus 2.0 arrays (Affymetrix) according to the manufacturer's instructions. The expression value for each probe was calculated using the GC-RMA algorithm. The microarray data were deposited in the GEO database (GEO number: GSE14103).

Signature development

Two datasets were used to create the CCS. First, the Whitfield et al. dataset [11] of 47 profiles of synchronized Hela S3 cells for 0–46 h time points (1-h intervals) after release of double thymidine block was analyzed to identify genes which express in a cell cycle-regulated manner. Raw signal intensities from the Cy5 and Cy3 channels were quantile normalized for each sample. Cy5/Cy3 ratios were log-transformed and quantile normalized across the arrays. Resulting values were smoothened using a moving average with a window size of 3 and were standardized by Z-transformation. Then, Fourier transformations were applied to each probe for 1-40-h periods in 15-min increments to identify periodicity and phase offset. Fourier transformation magnitudes for the known 51 cell cycle-regulated genes (listed in Whitfield et al. [11]) demonstrated a peak at the 14.75-h periodicity (Additional file 6). Thus, probes were selected using the criterion of

Z-score(P_i) > 1.96

where P_iis the Fourier transformation magnitude of the 14.75-h periodicity for probe i, i = 1,..., 44,160. The analysis yielded a list of 1,633 periodically expressed probes representing 976 genes. Second, the Bar-Joseph et al. dataset [35] of 17 profiles of synchronized primary human foreskin fibroblasts (FFs) for 0–32 h time points (2-h intervals) after release of double thymidine block and 2 profiles of serum starved FFs was investigated to identify genes which preferentially express in cycling cells. Serum starved cells are known to exit the cell cycle phase and to enter the non-cycling G0 phase [14], thus probes, whose expression is constantly higher throughout the cell cycle compared with non-cycling cells, were selected by the criterion

max(e_ij) < min(e_ik)

where e_ijis the expression value for probe i of serum-starved FFs sample j, j = 1, 2, and e_ikis the expression value for probe i of the synchronized FFs sample k, k = 1,..., 17. This yielded 2,304 out of 22,277 probes representing 1,779 genes. Then, from the intersection, a list of 335 probes representing 252 genes was obtained. These genes which preferentially express in cycling cells and in a cell cycle-regulated manner compose the CCS masterset (CCS_cycling). A number of well-known proliferation markers such as Ki67, geminin, TOP2A, aurora A, and PCNA [1–5, 32] were included in this signature, while some cell cycle-regulated genes such as p21 and cyclin G1 whose expression can be up-regulated in non-cycling cells [36, 37] were not. Lastly, according to their phase offsets, probes for CCS_cycling were assigned to 18 CCS subsets (CCS_phase) which correspond to a 360° cell cycle evenly divided into 20° increments, so that each CCS subset contains at least 3 genes. Because some genes were represented by multiple probes, the same genes may appear in different CCS subsets. The CCS gene list is shown in Additional file 1.

Signature scoring and data visualization

The given microarray dataset was used as the total gene dataset. The cycling gene dataset was created by extracting the expression values for CCS_cycling constituents from the total gene dataset. Both total and cycling gene datasets then underwent the following steps independently to give CCS scores. Expression values were log-transformed, quantile normalized to achieve the same expression value distribution for each sample, and standardized with Z-transformation across the samples. The Z-scores of the probes for each CCS genes were averaged for each sample and used as the CCS scores. To obtain robust scores, each CCS_phase score was adjusted by averaging with the neighboring CCS scores twice for a total of two cell cycle rounds. Heat maps were created by "Java Treeview" [38]. In the analysis of the mouse tumor model dataset, gene ID mapping was done using human-mouse orthology information from HomoloGene [39]. In the analysis of human breast cancer datasets, patients were ordered by peak in CCS_phase scores for the cycling gene dataset.

Survival analysis

Patients were dichotomized by the median of each CCS score. To assess the risk difference between two groups for DFS, Kaplan-Meier survival analysis, log-rank test and Cox univariate analysis were conducted using R "survival" package.

References

Whitfield ML, George LK, Grant GD, Perou CM: Common markers of proliferation. Nat Rev Cancer. 2006, 6: 99-106. 10.1038/nrc1802.
Article CAS PubMed Google Scholar
Landberg G, Roos G: The cell cycle in breast cancer. APMIS. 1997, 105: 575-89.
Article CAS PubMed Google Scholar
Beresford MJ, Wilson GD, Makris A: Measuring proliferation in breast cancer: practicalities and applications. Breast Cancer Res. 2006, 8: 216-10.1186/bcr1618.
Article PubMed Central PubMed Google Scholar
Colozza M, Azambuja E, Cardoso F, Sotiriou C, Larsimont D, Piccart MJ: Proliferative markers as prognostic and predictive tools in early breast cancer: where are we now?. Ann Oncol. 2005, 16: 1723-39. 10.1093/annonc/mdi352.
Article CAS PubMed Google Scholar
Gonzalez MA, Tachibana KE, Chin SF, Callagy G, Madine MA, Vowler SL, Pinder SE, Laskey RA, Coleman N: Geminin predicts adverse clinical outcome in breast cancer by reflecting cell-cycle progression. J Pathol. 2004, 204: 121-30. 10.1002/path.1625.
Article CAS PubMed Google Scholar
van 't Veer LJ, Dai H, Vijver van de MJ, He YD, Hart AA, Mao M, Peterse HL, Kooy van der K, Marton MJ, Witteveen AT, Schreiber GJ, Kerkhoven RM, Roberts C, Linsley PS, Bernards R, Friend SH: Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002, 415: 530-6. 10.1038/415530a.
Article PubMed Google Scholar
Bild AH, Yao G, Chang JT, Wang Q, Potti A, Chasse D, Joshi MB, Harpole D, Lancaster JM, Berchuck A, Olson JA, Marks JR, Dressman HK, West M, Nevins JR: Oncogenic pathway signatures in human cancers as a guide to targeted therapies. Nature. 2006, 439: 353-7. 10.1038/nature04296.
Article CAS PubMed Google Scholar
Potti A, Dressman HK, Bild A, Riedel RF, Chan G, Sayer R, Cragun J, Cottrill H, Kelley MJ, Petersen R, Harpole D, Marks J, Berchuck A, Ginsburg GS, Febbo P, Lancaster J, Nevins JR: Genomic signatures to guide the use of chemotherapeutics. Nat Med. 2006, 12: 1294-300. 10.1038/nm1491.
Article CAS PubMed Google Scholar
Baker FL, Sanger LJ, Rodgers RW, Jabboury K, Mangini OR: Cell proliferation kinetics of normal and tumour tissue in vitro: quiescent reproductive cells and the cycling reproductive fraction. Cell Prolif. 1995, 28: 1-15. 10.1111/j.1365-2184.1995.tb00035.x.
Article CAS PubMed Google Scholar
Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003, 19: 185-93. 10.1093/bioinformatics/19.2.185.
Article CAS PubMed Google Scholar
Whitfield ML, Sherlock G, Saldanha AJ, Murray JI, Ball CA, Alexander KE, Matese JC, Perou CM, Hurt MM, Brown PO, Botstein D: Identification of genes periodically expressed in the human cell cycle and their expression in tumors. Mol Biol Cell. 2002, 13: 1977-2000. 10.1091/mbc.02-02-0030..
Article PubMed Central CAS PubMed Google Scholar
Fournier MV, Martin KJ, Kenny PA, Xhaja K, Bosch I, Yaswen P, Bissell MJ: Gene expression signature in organized and growth-arrested mammary acini predicts good outcome in breast cancer. Cancer Res. 2006, 66: 7095-102. 10.1158/0008-5472.CAN-06-0515.
Article PubMed Central CAS PubMed Google Scholar
Petersen OW, Ronnov-Jessen L, Howlett AR, Bissell MJ: Interaction with basement membrane serves to rapidly distinguish growth and differentiation pattern of normal and malignant human breast epithelial cells. Proc Natl Acad Sci USA. 1992, 89: 9064-8. 10.1073/pnas.89.19.9064.
Article PubMed Central CAS PubMed Google Scholar
Prather RS, Boquest AC, Day BN: Cell cycle analysis of cultured porcine mammary cells. Cloning. 1999, 1: 17-24. 10.1089/15204559950020067.
Article CAS PubMed Google Scholar
Nygren JM, Bryder D, Jacobsen SE: Prolonged cell cycle transit is a defining and developmentally conserved hemopoietic stem cell property. J Immunol. 2006, 177: 201-8.
Article CAS PubMed Google Scholar
Cam H, Balciunaite E, Blais A, Spektor A, Scarpulla RC, Young R, Kluger Y, Dynlacht BD: A common set of gene regulatory networks links metabolism and growth inhibition. Mol Cell. 2004, 16: 399-411. 10.1016/j.molcel.2004.09.037.
Article CAS PubMed Google Scholar
Harper JV, Brooks G: The mammalian cell cycle: an overview. Methods Mol Biol. 2005, 296: 113-53.
CAS PubMed Google Scholar
Yamamoto T, Ebisuya M, Ashida F, Okamoto K, Yonehara S, Nishida E: Continuous ERK activation downregulates antiproliferative genes throughout G1 phase to allow cell-cycle progression. Curr Biol. 2006, 16: 1171-82. 10.1016/j.cub.2006.04.044.
Article CAS PubMed Google Scholar
Herschkowitz JI, Simin K, Weigman VJ, Mikaelian I, Usary J, Hu Z, Rasmussen KE, Jones LP, Assefnia S, Chandrasekharan S, Backlund MG, Yin Y, Khramtsov AI, Bastein R, Quackenbush J, Glazer RI, Brown PH, Green JE, Kopelovich L, Furth PA, Palazzo JP, Olopade OI, Bernard PS, Churchill GA, Van Dyke T, Perou CM: Identification of conserved gene expression features between murine mammary carcinoma models and human breast tumors. Genome Biol. 2007, 8: R76-10.1186/gb-2007-8-5-r76.
Article PubMed Central PubMed Google Scholar
Felsher DW, Bishop JM: Transient excess of MYC activity can elicit genomic instability and tumorigenesis. Proc Natl Acad Sci USA. 1999, 96: 3940-4. 10.1073/pnas.96.7.3940.
Article PubMed Central CAS PubMed Google Scholar
Ivshina AV, George J, Senko O, Mow B, Putti TC, Smeds J, Lindahl T, Pawitan Y, Hall P, Nordgren H, Wong JE, Liu ET, Bergh J, Kuznetsov VA, Miller LD: Genetic reclassification of histologic grade delineates new clinical subtypes of breast cancer. Cancer Res. 2006, 66: 10292-301. 10.1158/0008-5472.CAN-05-4414.
Article CAS PubMed Google Scholar
Langerød A, Zhao H, Borgan O, Nesland JM, Bukholm IR, Ikdahl T, Karesen R, Borresen-Dale AL, Jeffrey SS: TP53 mutation status and gene expression profiles are powerful prognostic markers of breast cancer. Breast Cancer Res. 2007, 9: R30-10.1186/bcr1675.
Article PubMed Central PubMed Google Scholar
Barnes DM, Gillett CE: Cyclin D1 in breast cancer. Breast Cancer Res Treat. 1998, 52: 1-15. 10.1023/A:1006103831990.
Article CAS PubMed Google Scholar
Liu JJ, Chao JR, Jiang MC, Ng SY, Yen JJ, Yang-Yen HF: Ras transformation results in an elevated level of cyclin D1 and acceleration of G1 progression in NIH 3T3 cells. Mol Cell Biol. 1995, 15: 3654-63.
Article PubMed Central CAS PubMed Google Scholar
Wimmel A, Lucibello FC, Sewing A, Adolph S, Muller R: Inducible acceleration of G1 progression through tetracycline-regulated expression of human cyclin E. Oncogene. 1994, 9: 995-7.
CAS PubMed Google Scholar
Karn J, Watson JV, Lowe AD, Green SM, Vedeckis W: Regulation of cell cycle duration by c-myc levels. Oncogene. 1989, 4: 773-87.
CAS PubMed Google Scholar
Sun H, Lesche R, Li DM, Liliental J, Zhang H, Gao J, Gavrilova N, Mueller B, Liu X, Wu H: PTEN modulates cell cycle progression and cell survival by regulating phosphatidylinositol 3,4,5,-trisphosphate and Akt/protein kinase B signaling pathway. Proc Natl Acad Sci USA. 1999, 96: 6199-204. 10.1073/pnas.96.11.6199.
Article PubMed Central CAS PubMed Google Scholar
Vecchione A, Croce CM, Baldassarre G: Fez1/Lzts1 a new mitotic regulator implicated in cancer development. Cell Div. 2007, 2: 24-10.1186/1747-1028-2-24.
Article PubMed Central PubMed Google Scholar
Yabuta N, Okada N, Ito A, Hosomi T, Nishihara S, Sasayama Y, Fujimori A, Okuzaki D, Zhao H, Ikawa M, Okabe M, Nojima H: Lats2 is an essential mitotic regulator required for the coordination of cell division. J Biol Chem. 2007, 282: 19259-71. 10.1074/jbc.M608562200.
Article CAS PubMed Google Scholar
Sladek TL, Jacobberger JW: Simian virus 40 large T-antigen expression decreases the G1 and increases the G2 + M cell cycle phase durations in exponentially growing cells. J Virol. 1992, 66: 1059-65.
PubMed Central CAS PubMed Google Scholar
Lemoine FJ, Marriott SJ: Accelerated G(1) phase progression induced by the human T cell leukemia virus type I (HTLV-I) Tax oncoprotein. J Biol Chem. 2001, 276: 31851-7. 10.1074/jbc.M105195200.
Article CAS PubMed Google Scholar
Williams GH, Stoeber K: Cell cycle markers in clinical oncology. Curr Opin Cell Biol. 2007, 19: 672-9. 10.1016/j.ceb.2007.10.005.
Article CAS PubMed Google Scholar
Shetty A, Loddo M, Fanshawe T, Prevost AT, Sainsbury R, Williams GH, Stoeber K: DNA replication licensing and cell cycle kinetics of normal and neoplastic breast. Br J Cancer. 2005, 93: 1295-300. 10.1038/sj.bjc.6602829.
Article PubMed Central CAS PubMed Google Scholar
Lu P, Nakorchevskiy A, Marcotte EM: Expression deconvolution: a reinterpretation of DNA microarray data reveals dynamic changes in cell populations. Proc Natl Acad Sci USA. 2003, 100: 10370-5. 10.1073/pnas.1832361100.
Article PubMed Central CAS PubMed Google Scholar
Bar-Joseph Z, Siegfried Z, Brandeis M, Brors B, Lu Y, Eils R, Dynlacht BD, Simon I: Genome-wide transcriptional analysis of the human cell cycle identifies genes differentially regulated in normal and cancer cells. Proc Natl Acad Sci USA. 2008, 105: 955-60. 10.1073/pnas.0704723105.
Article PubMed Central CAS PubMed Google Scholar
Ezoe S, Matsumura I, Satoh Y, Tanaka H, Kanakura Y: Cell cycle regulation in hematopoietic stem/progenitor cells. Cell Cycle. 2004, 3: 314-8.
Article CAS PubMed Google Scholar
Zhou T, Chou JW, Simpson DA, Zhou Y, Mullen TE, Medeiros M, Bushel PR, Paules RS, Yang X, Hurban P, Lobenhofer EK, Kaufmann WK: Profiles of global gene expression in ionizing-radiation-damaged human diploid fibroblasts reveal synchronization behind the G1 checkpoint in a G0-like state of quiescence. Environ Health Perspect. 2006, 114: 553-9.
Article PubMed Central CAS PubMed Google Scholar
Saldanha AJ: Java Treeview – extensible visualization of microarray data. Bioinformatics. 2004, 20: 3246-8. 10.1093/bioinformatics/bth349.
Article CAS PubMed Google Scholar
Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, Dicuccio M, Edgar R, Federhen S, Feolo M, Geer LY, Helmberg W, Kapustin Y, Khovayko O, Landsman D, Lipman DJ, Madden TL, Maglott DR, Miller V, Ostell J, Pruitt KD, Schuler GD, Shumway M, Sequeira E, Sherry ST, Sirotkin K, Souvorov A, Starchenko G, Tatusov RL, Tatusova TA, Wagner L, Yaschenko E: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2008, 36: D13-21. 10.1093/nar/gkm1000.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

We thank D. Schmitt, F. Ford, K. Takahashi, H. Ohmori, M. Haramura, M. Ashihara and M. Aoki of Chugai Pharmaceuticals for their helpful discussions and checking of the manuscript.

Author information

Authors and Affiliations

Kamakura Research Laboratories, Chugai Pharmaceutical Co Ltd, Kamakura, Kanagawa, Japan
Hideaki Mizuno, Yoshito Nakanishi, Nobuya Ishii & Kunio Kitada
Department of Biosciences and Bioinformatics, Kyushu Institute of Technology, Iizuka, Fukuoka, Japan
Hideaki Mizuno & Akinori Sarai

Authors

Hideaki Mizuno
View author publications
You can also search for this author in PubMed Google Scholar
Yoshito Nakanishi
View author publications
You can also search for this author in PubMed Google Scholar
Nobuya Ishii
View author publications
You can also search for this author in PubMed Google Scholar
Akinori Sarai
View author publications
You can also search for this author in PubMed Google Scholar
Kunio Kitada
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hideaki Mizuno.

Additional information

Authors' contributions

HM and KK designed the research. HM and YN performed the research. HM, NI, AS and KK participated in writing the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material

12864_2008_2021_MOESM1_ESM.xls

Additional file 1: The gene list for cell cycle signatures. The CCS genes and assigned CCS subset IDs are listed. (XLS 50 KB)

12864_2008_2021_MOESM2_ESM.ppt

Additional file 2: Validation of CCS method in the Whitfiled et al . cell cycle dataset. CCS scores were calculated for the total (upper panel) and the cycling (lower panel) gene dataset. The purple bars above the columns indicate Whitfield et al.'s estimations of the S phase. (PPT 75 KB)

12864_2008_2021_MOESM3_ESM.ppt

Additional file 3: Analysis of the Yamamoto et al. dataset. Serum starved NIH3T3 cells were stimulated with FGF to re-enter the cell cycle. Profiles of unstimulated cells (FGF 0 h) and FGF-stimulated cells (FGF 3–12 h) were analyzed. (PPT 56 KB)

12864_2008_2021_MOESM4_ESM.ppt

Additional file 4: CCS score plots for the WAP-Myc model. Same as for Fig. 3B. (PPT 59 KB)

12864_2008_2021_MOESM5_ESM.ppt

Additional file 5: Analysis of the Langerød et al . breast cancer dataset. (A), (B) and (C) are the same as in Fig. 4. (PPT 107 KB)

12864_2008_2021_MOESM6_ESM.ppt

Additional file 6: Power spectrum of the 51 cell cycle-regulated genes. The Hela S3 cell cycle dataset was processed as described in Methods. Fourier transformation magnitudes for the known 51 cell cycle-regulated genes for each periodicity were averaged and plotted. (PPT 48 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Mizuno, H., Nakanishi, Y., Ishii, N. et al. A signature-based method for indexing cell cycle phase distribution from microarray profiles. BMC Genomics 10, 137 (2009). https://doi.org/10.1186/1471-2164-10-137

Download citation

Received: 21 October 2008
Accepted: 30 March 2009
Published: 30 March 2009
DOI: https://doi.org/10.1186/1471-2164-10-137

A signature-based method for indexing cell cycle phase distribution from microarray profiles

Abstract

Background

Results

Conclusion

Similar content being viewed by others

Background

Results

Algorithm

Validation

Analysis on mouse tumor model dataset

Analysis on human breast cancer datasets

Discussion and conclusion

Methods

Cell Culture and Synchronization

Microarray

Signature development

Signature scoring and data visualization

Survival analysis

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation