Introduction

Osteosarcoma is a high-grade primary bone malignancy most commonly seen in children and young adults [18, 19]. The survival of patients with osteosarcoma has not improved during the past three decades since the advent of adjuvant chemotherapy, despite multiple clinical trials of intensified regimens or newer agents. Better understanding of the disease is needed. Although the etiology of osteosarcoma is not well understood [8], several factors suggest a correlation between skeletal development and initiation of osteosarcoma. First, the peak incidence of osteosarcoma coincides with a period of rapid bone growth. In addition, an earlier peak age in girls corresponds to the earlier age of their growth spurt, also suggesting a relationship with skeletal development. Moreover, most osteosarcomas occur near the major growing joints such as the distal femur, proximal tibia, and proximal humerus, which contribute to the majority of longitudinal bone growth. Other aspects of pathogenesis may be inferred from the genetic predisposition syndromes such as Li-Fraumeni syndrome (TP53 mutation; patients with this condition are at increased risk multiple cancers) and hereditary bilateral retinoblastoma (Rb mutation) [10, 11]. It appears that loss of function of tumor suppressor genes played a role in the tumorigenesis in osteosarcoma.

Bone growth is spatially and temporally controlled by systemic endocrine and local growth factors [38]. In response to growth hormone, the multipotent mesenchymal stem cells (MSCs) are recruited to the growth plate and differentiate into proliferating chondroblasts. Chondroblasts undergo hypertrophy and produce cartilage matrix underneath the growth plate. Bone marrow-associated MSCs differentiate into osteoblasts and replace cartilage with osteoid in concert with hematopoietic-derived osteoclasts. Osteoid then is mineralized with calcium and phosphate to form bone [4]. The majority of terminal osteoblasts undergo programmed cell death (apoptosis) with a few becoming osteocytes incorporated in the Haversian system. The transforming growth factor-β (TGF-β) superfamily members, especially the bone morphogenetic proteins (BMPs), are important regulators in this process, as is well documented in animal models and in vitro studies [4, 5]. The biologic action mediated by TGF-β signaling is tightly regulated at multiple levels [29]. However, the molecular basis for the relationship between bone growth and occurrence of osteosarcoma is not well understood.

Bone growth is promoted by insulin-like growth factors (IGF) systematically and locally [33]. IGF-1, the most abundant growth factor in bone, is the mediator of anabolic effects of growth hormone. It mostly is synthesized in the liver and also is secreted by MSCs and osteoblasts in an autocrine fashion. It also stimulates proliferation. In animal models, IGF level has been correlated with size and weight at birth [2, 17] and bone growth [36]. The IGF-signaling axis also includes six high-affinity IGF binding proteins (IGFBPs) and five low-affinity IGFBP-related proteins (IGFBPrPs). IGF signaling is tempered by its binding proteins in a negative feedback loop. It is among the few peptide hormones known to have binding proteins, which is usually characteristic of lipid-soluble hormones. Moreover, IGF-1, along with IGFBPs, reaches a peak level at puberty in response to growth hormone and decreases with age, in parallel with the incidence of osteosarcoma [28]. The mitogenic IGF-1 was reported to be involved in tumorigenesis in several tumor models [16]. In osteosarcoma, IGF-1 levels are associated with metastatic behavior of tumors in animal models [25]. The relationship between IGF signaling and osteosarcoma pathogenesis, however, is not entirely clear.

To better understand the molecular basis of osteosarcoma, we used a comprehensive approach to compare the transcriptional profile of primary tumor cells and cell lines with their normal cellular counterparts. In this study, serial analysis of gene expression (SAGE) was used to directly measure the transcriptome without any prior selection of genes for inclusion [34, 39]. Moreover, the absolute quantity of each transcript obtained for each sample with this approach allows a straightforward comparison to public databases to facilitate the exchange of information regarding rare diseases such as osteosarcoma [13]. We asked the following questions: (1) Can we identify gene expression in osteosarcoma specimens that differs from normal osteoblasts and MSCs; and (2) will this provide clues to the molecular pathogenesis of osteosarcoma?

Materials and Methods

Experimental Design

To compare the gene expression profile of osteosarcoma and its normal counterparts, we constructed SAGE libraries from two osteosarcoma biopsy specimens, two cell lines, and two xenografts derived from patient specimens–one from normal osteoblasts and one from MSCs (Fig. 1).

Fig. 1
figure 1

A schematic for SAGE library construction is shown.

Osteosarcoma Specimens

The osteosarcoma standard cell line SaOS-2 was purchased from American Type Culture Collection (ATCC, Rockville, MD, USA). In SaOS-2, both alleles of p53 are deleted. All other specimens were derived from patients with osteosarcoma who were treated at Memorial Sloan-Kettering Cancer Center. Specimen OS256 was obtained by biopsy from a 14-year-old female patient with osteosarcoma of the proximal tibia. Specimen OS259 was obtained by biopsy from a 13-year-old male patient with osteosarcoma of the distal femur. The osteosarcoma cell line, OS160CL, was derived from a lung metastatic lesion resected from an 11-year-old female patient with osteosarcoma of the proximal femur using described methods [27]. The OS63CL was derived from the primary surgical specimen resected from a 31-year-old male patient with osteosarcoma of the proximal humerus. Histologic diagnosis was confirmed by a pathologist (HD, AH). Cells were maintained as a monolayer in modified Eagle’s-α media (α –MEM) supplemented with 10% fetal calf serum (Life Technologies, Bethesda, MD, USA), 100 units/mL penicillin, and 3 mg/mL streptomycin at 37°C in a humidified atmosphere with 5% CO2. Human MSCs (provided by Pamela Robey PhD, National Institutes of Health, Bethesda, MD, USA) were further induced toward osteoblastic differentiation with 50 μmol/L ascorbic acid and 10 mmol/L of β-glycerolphosphate for 2 weeks [27] before total RNA was extracted. The osteoblastic differentiation was confirmed by detection of osteoblast-specific markers including alkaline phosphatase, osteocalcin, and collagen type Iα by reverse transcription–polymerase chain reaction (RT-PCR) as described previously [26]. All the cultured cells were checked monthly and shown to be free of mycoplasma contamination. The MSCs, MSC89R, were selected for SAGE analysis based on their high colony-forming efficiency and multilineage differentiation potential [7]. Briefly, the mononuclear cell fraction was recovered from a 10-mL aspirate of the iliac crest obtained from a healthy donor using a Ficoll (Ficoll-PaqueTM; Pharmacia, Piscataway, NJ, USA) gradient, and then cultured in α-MEM supplemented with 100 μg/mL penicillin, 100 μg/mL streptomycin, 2 mmol/L L-glutamine, and 20% fetal calf serum (Atlanta Biologicals, Lawrenceville, GA, USA) lot selected for rapid growth of the cells. After 24 hours the nonadherent cells were removed and the adherent layer cultured until it reached 50% to 70% confluence. Cells subsequently were passed twice and then collected for SAGE analysis or alternatively plated in 100-mm dishes at 10 cell/cm2, cultured for 10 to 14 days, and a single cell-derived colony (clone forming-unit fibroblast 35 [CFUF35]) was isolated using cloning cylinders for analysis by MicroSAGE [32].

Osteosarcoma Xenografts

Tumor cells from patient-derived cell lines OS160CL and OS63CL were used to establish xenografts in mice as previously described according to a protocol approved by the Memorial Hospital Institutional Animal Care and Use Committee [37]. Xenografts were resected when the size was approximately 1.5 cm in diameter. Specimens were histologically confirmed to be osteosarcoma by a pathologist (HD, AH) using standard hematoxylin and eosin staining and were named M160xeno and M63xeno corresponding numerically with their respective parental cell lines.

Construction of SAGE Libraries

Total RNA was extracted with an RNeasy® mini Kit (Qiagen, Valencia, CA, USA) according to the manufacturer’s instructions. SAGE libraries of normal human osteoblasts, two primary osteosarcoma tumor specimens, two osteosarcoma xenografts (M160xeno, M63xeno), and two cultured osteosarcoma cell lines (SaOS-2, OS160CL) were constructed with an iSAGETM Kit from InvitrogenTM (Carlsbad, CA, USA) according to the manufacturer’s instructions. Sequencing reactions were performed using a BigDye® Terminator V3.1 Cycle Sequencing Kit (Applied Biosystems®, Foster City, CA, USA), and data were generated with an ABI 3100 sequencer (Applied Biosystems®). To yield tags, the raw data were processed with the SAGE2000 Version 4.5 software (Invitrogen) provided by the kit manufacturer. The SAGE library of mesenchymal stem cells, MSC89R, using the same methodology as described by Tremain et al. [32]. The SAGEmap database, downloaded from the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/SAGE), was used as a reference for tag UniGene identification [13]. Two additional SAGE libraries were included for analysis, including a single cell-derived MSC SAGE library named CFUF35 using micro-SAGE methodology with 16,407 tags in total [32] and a MSC SAGE library using the standard SAGE methodology with 202,962 tags in total [30].

Quantitative Real-time RT-PCR

Real-time fluorescent quantitative RT-PCR was used to validate a fraction of genes identified by SAGE to be differentially expressed using predesigned primers and probes (Applied Biosystems®) including inhibin β A, Assay ID: Hs00170103_m1; Smad4, Hs00232068_m1; CTGF, Hs00170014_m1; IGFBP7 (Mac25), Hs00266026_m1; p63, Hs00186613_m1; IGFBP6, Hs00181853_m1; IGFBP4, Hs00181767_m1; IGFBP3, Hs00181211_m1. The housekeeping gene glyceraldehyde 3-phosphate dehydrogenase (GAPDH) was used as an endogenous control for template loading. Thermal cycling was performed with a 7500-fast real-time PCR system (Applied Biosystems®) using a 2 × TaqMan® Fast Universal PCR Master Mix provided by the manufacturer [37]. The relative quantity was calculated based on standard 2−ΔT method as described previously [37]. In total, eight SAGE libraries were constructed, including six of osteosarcomas, one of MSCs (MSC89R), and one of osteoblasts. At least 30,000 tags were obtained for each library to ensure sufficient coverage for the majority of the transcripts. The osteoblast SAGE library was constructed after 2 weeks of osteoblastic induction of the MSCs. Osteoblastic differentiation was observed by increased mRNA expression of alkaline phosphatase and type Iα collagen by RT-PCR (data not shown). It was further confirmed by a steep increase in the number of tags representing collagen type Iα in the SAGE library of osteoblasts and by lack of cartilage markers such as type II and type X collagen, which were present in the SAGE library of MSC89R. A clustering analysis was performed using BRB-Array Tools Version 3.6 (Biometrics Research Branch, Division of Cancer Treatment & Diagnosis, National Cancer Institute, Bethesda, MD, USA), including eight SAGE libraries established in this study and two additional libraries of MSCs [30, 32] (Fig. 2). Considering the wide range of variation in the six osteosarcomas profiled in this study, a two-step enrichment was adapted in the statistical analyses to identify the common transcriptional alterations in all osteosarcomas and to filter out factors contributed by the individual samples possibly attributable to the growth environment (Fig. 3). A software package, the FosterTool, was designed and developed by one of us (WZ) to facilitate the statistical analyses of the data obtained by SAGE and is available for open access at http://fosterfoundation.com. In the first step, each SAGE library of osteosarcoma was pairwise compared with that of MSC89R and of osteoblast, respectively. In the second step, only gene tags significantly differentially expressed in every osteosarcoma versus osteoblast and MSC89R simultaneously were pooled as true characteristic genes of osteosarcoma. Gene tags were grouped according to their functions as described in the database (http://www.ncbi.nlm.nih.gov).

Fig. 2
figure 2

Eight SAGE libraries obtained in this study, along with CFUF35, a single cell-derived MSC SAGE library established using MicroSAGE technique [32] (dashed boxes) and another human MSC SAGE library (MSCZago) using standard methodology (dashed boxes) [30] were normalized to tags/50,000 in total and were analyzed using BRB-Array Tools Version 3.6. A dendrogram was drawn to reveal the relationship between the SAGE libraries. The total numbers of tags obtained for each of the libraries before normalization are shown.

Fig. 3
figure 3

A two-step enrichment was adapted in the statistical analyses to identify the common transcriptional alteration in all osteosarcomas and to filter out factors contributed by the individual samples possibly owing to the microenvironment. In Step 1, each osteosarcoma SAGE library was compared with that of osteoblasts and MSC89R independently. Genes (tags), which were differentially present between tumor and normal, are pooled together. A second step identifies genes, which are commonly downregulated in osteosarcoma compared with normal osteoblasts (OB) and MSC89R.

Statistical Analyses

All statistical analyses were performed by statisticians (YZ, WZ) at the Department of Applied Mathematics and Statistics (State University of New York at Stony Brook, Stony Brook, NY, USA). A computer program was developed (NN) at the Department of Computer Science (State University of New York at Stony Brook) to facilitate the parallel comparison between the SAGE libraries. All statistical analyses were two-tailed and a probability less than 0.05 was regarded as statistically significant. The clustering analysis was performed using BRB-Array tools Version 3.6.

Results

Four categories of genes accounted for the dominant majority of the genes that were downregulated in osteosarcoma SAGE libraries (92%; 60 of 65) compared with their normal counterparts. The first group of seven genes identified by SAGE includes those involved in the IGF signaling (Table 1). Three IGFBPs and three IGFBPrPs, seen abundantly in MSCs and osteoblasts, were minimally present in any of the six osteosarcoma SAGE libraries despite IGF-1 levels that were similar in MSC89R (129/50,000 tags), osteoblasts (66/50,000 tags), and osteosarcomas (mean, 67/50,000 tags). The differential expression was validated by TaqMan® real-time RT-PCR in a subset of the genes (Fig. 4), including IGFBP3, IGFBP4, IGFBP6, IGFBPrP1 (IGFBP7), and IGFBPrP3. The fold difference (Fig. 4) measured by SAGE (normalized to tags per 50,000 total tags in each library) and detected by real-time RT-PCR in these eight genes was highly comparable (r2 = 0.8813).The second group of genes found to be downregulated in osteosarcoma includes eight genes involved in TGF- β signaling (Table 1). The identified genes are involved in regulating TGF-β signaling at various levels, including ligand (inhibin β), ligand release (TGF-β latent binding protein), ligand activation (thrombospondin 1), receptor internalization, and signal transduction (caveolin 1). Alterations also are present in extracellular antagonists (follistatin-like 1, Gremlin1, and CSRP1), including members of the IGFBPrPs (CTGF, IGFBP7, and CYR61) in the IGF-1 axis, which are important regulators of TGF-β signaling [1, 5, 14]. The third group tags identified to be downregulated in osteosarcoma encompass a group of genes largely regulated by TGF-β signaling (Table 2). These include genes involved in the extracellular matrix (ECM) formation (Type IV collagens, Type VI collagen, vimentin), ECM remodeling (SERPINs, TIMPs, LOX), and many constituents of the cytoskeleton [15].

Table 1 The differentially expressed SAGE tags*
Fig. 4
figure 4

The quantities of eight gene tags measured directly by SAGE are shown in the x-axis after normalization (tags/50,000 tags). The fold difference of mRNA expressions as measured by TaqMan® real-time RT-PCR is shown in the y-axis using predesigned primers and probes (Applied Biosystems). The genes were validated included inhibin β A, CTGF, IGFBP7 (IGFBPrP1), p63, IGFBP6, IGFBP4, and IGFBP3. The correlation coefficient between these two methods is calculated as r2 = 0.8813.

Table 2 Differentially expressed SAGE tags* involved in ECM and cytoskeleton in osteosarcoma as compared with its normal counterparts

Six genes involved in cell cycle regulation and apoptosis were downregulated in osteosarcoma (Table 3), including p63 and p21, both of which also potentially are involved in TGF-β signaling [21]. Additional genes (tags) with unknown functions that also were downregulated are shown (Table 3). In contrast to numerous genes downregulated in osteosarcoma, there was no gene identified as being consistently overexpressed in osteosarcomas at the high stringency statistical criteria applied in this analysis.

Table 3 The differentially expressed SAGE tags* involved in cell cycle regulation and others in osteosarcoma compared with its normal counterparts

Discussion

Osteosarcoma is the most common primary bone tumor in children and young adults. Unfortunately, the prognosis of the patients with this disease has not improved during the past three decades, therefore better understanding of this disease is needed. In theory, osteosarcoma could arise from osteoblasts or the more primitive MSCs [22]. Our study provides a possible molecular basis of the disease by comparing the transcriptional profile of osteosarcoma with that of its potential normal counterparts, osteoblasts and MSCs.

There are limitations to our study. We used libraries from primary materials, cell lines, and xenografts. Although a primary tumor specimen from a biopsy may be the optimal material, these are available only in small amounts because of clinical standards of care and prioritization of tissue for diagnostic purposes. Patient specimens obtained after preoperative chemotherapy usually contain at least partly necrotic tissue. Cultured cell lines and xenograft tissues are more reproducible and can be expanded, but transcriptome differences from the primary material are likely [23]. We therefore used a more stringent criterion for statistical analysis, only genes that were commonly deregulated in all osteosarcoma SAGE libraries were selected. No genes were identified to be consistently overexpressed in osteosarcoma across all the specimens included in this study. This might reflect the variance of the materials we used, including biopsy specimens, cell lines, and xenografts. This also might represent the extreme genetic instability and complexity of the genome in this disease [24].

Our data suggest that dysregulation of the IGF signaling axis might represent an essential step in the tumorigenesis of osteosarcoma. IGF-1 is the mediator of the anabolic effects of growth hormone during rapid bone growth, and IGF-1 levels peak during puberty [28]. IGF-1 level is associated with body weight and bone growth in animal models [2, 17, 36]. Sutter et al. [31] reported that a single IGF-1 allele was a major determinant of the size of dogs. Canine osteosarcoma is most commonly seen in large breeds [20]. The IGFBPs and IGFBPrPs generally are believed to be inhibitory to the mitogenic and antiapoptotic effects of IGF-1, and their expression usually is stimulated by IGF-1 [9]. It appears that this autoregulatory mechanism is disrupted in osteosarcoma. Simultaneous loss of all the IGFBPs and IGFBPrPs in osteosarcoma compared with normal bone cells is striking. This was similarly found in a study using comparable methods [12]. Cells may obtain self-sufficiency by maintaining IGF-1 activity through downregulation of the IGFBPs. This might provide new targets for therapy for this disease [12].

In our study, disruption of TGF-β in osteosarcoma is suggested downregulation of genes of this superfamily at multiple levels (Table 1) and their target genes (Table 2). Even genes such as vimentin, which is widely regarded as a marker for a mesenchymal cellular origin by pathologists, was identified to be downregulated in osteosarcoma in this study. The TGF-β superfamily is involved in virtually every aspect of cellular activity. Osteoblast differentiation is regulated by the TGF-β signaling pathway, including the BMPs [5]. Mesenchymal cells are distinct from epithelial cells in their mechanisms of cell-cell communication. Instead of direct cell-cell contact through various junctions in epithelial cells, mesenchymal cells exist in a large amount of ECM mostly produced by themselves. The ECM and the cell-matrix interactions are important for cell proliferation, survival, and migration [6]. Loss of TGF-β signaling is a hallmark of tumorigenesis in many cancers of epithelial origin [29]. Further study on TGF-β signaling in this disease may shed light on the understanding of the disease.

Some genes consistently downregulated in osteosarcoma, however, were found to be upregulated during the osteoblastic induction from MSCs, including the IGFBPs and genes in the TGF-β/BMP signaling cascade, in agreement with previous studies [1, 2, 5, 9, 14, 28]. This suggests that osteosarcoma perhaps is more primitive than MSCs, the progenitor of osteoblasts, or may suggest a trend of dedifferentiation in osteosarcoma. The question of cell of origin of osteosarcoma is still debated [3, 22, 35]. Our study provides comprehensive evidence that links osteosarcoma with skeletal development. The dysregulated genes identified in osteosarcoma, such as components in IGF and TGF-β/BMP signaling, are major players regulating normal bone growth as suggested in an animal model [35]. These findings suggest that future therapeutic strategies might be directed toward promoting cell differentiation instead of (or in addition to) more-conventional chemotherapeutic approaches. Mutations on p53 and Rb genes were found in osteosarcoma. We did not find the gene expression levels to be different among the osteosarcoma specimens versus normal tissues in this study. This suggests the loss of function of the tumor suppressors is probably the mechanism, instead of expression. Six genes involved in cell cycle regulation and apoptosis were downregulated in osteosarcoma (Table 3), including p63 and p21. Further study is needed to clarify the role of these genes in osteosarcoma.

For the first time, a highly clustered transcriptional profile, which is well preserved in osteosarcomas despite being derived from different materials grown in various conditions, was revealed by SAGE. This well-coordinated expression pattern suggests that profound alterations in the signaling axes of IGF-1 and TGF-β, in concert with cell cycle regulators, may be involved in the pathogenesis of osteosarcoma (Fig. 5). This provides a basis for further investigation to better understand the disease and to identify these pathways as potential new therapeutic targets. This study suggests the therapeutic value of inhibiting the IGF-1 and TGF pathways. Direct inhibitors of the IGF-1 signaling pathway exist in the form of IGF-1R antibodies. Although directly targeting the TGF pathway is not possible, R-spondin is one example of many drugs that target the TGF pathway and could be tested in patients with osteosarcoma. Preclinical testing of inhibitors of the IGF-1 and TGF pathways seems warranted, and with additional promising preclinical data, trials of these inhibitors in patients with osteosarcoma would be warranted.

Fig. 5
figure 5

During osteogenesis, MSCs are activated by the growth hormone (GH)-IGF axis to enter a proliferation phase and proceed to osteoblastic differentiation regulated by the TGF-β/BMPs. After matrix production, the majority of mature osteoblasts undergo apoptosis with only a few becoming osteocytes. Thus, sequential molecular events may contribute to the pathogenesis of osteosarcoma: (1) downregulation of IGFBPs free IGF-1 to stimulate unopposed cell proliferation; (2) TGF-β/BMPs signaling disrupts differentiation; and (3) alteration of apoptosis and cell cycle regulation.