In this issue of Annals of Surgical Oncology, Lim et al. 1 report on prognostic gene expression signatures from tumor and nontumor liver tissues from hepatocellular carcinoma (HCC) patients who underwent surgical resection at a single institution in Korea. The signatures were developed from a set of patients with relatively early stage diseases, and then evaluated in an independent validation cohort of patients with more advanced stage diseases. The authors report that their gene signature derived from the tumor tissues and not their gene signature from the nontumor tissue accurately predicted disease-free survival in the validation cohort. Their findings are consistent with previously published data suggesting that tumor-derived molecular information is more prognostic in advanced stage HCC, whereas nontumor liver-derived molecular information is more prognostic in earlier stage HCC.24

Prognostic gene signatures using genome-wide gene expression microarrays have been explored in many types of cancer with an eye toward improving prognostic models, guiding treatment recommendations, and identifying molecular pathways to target for therapy. In the case of HCC, dozens of prognostic gene signatures have been reported.5 HCC differs from most cancers in that there are two biological processes to simultaneously consider. HCC commonly arises within a liver compromised by cirrhosis, thereby requiring effective management of two disease processes. Thus, death from disease progression may arise from distinctly different causes—recurrence and/or dissemination of the HCC, de novo new HCC arising in the cirrhotic liver, and complications of progressive cirrhosis including hepatic decompensation. Patients with advanced stage HCC are more likely to die due to progression of the primary HCC tumor, whereas patients with earlier stage diseases are more likely to die due to additional primary HCC tumors (e.g., clonally distinct from the first HCC) or complication of cirrhosis.2,3 Nonetheless, in many cases, the precise causes are difficult to accurately determine. Accordingly, development of prognostic gene signatures for HCC requires careful consideration and delineation of the patient population as well as careful consideration of specific end points for training the signature.

Of note, Lim et al. chose disease-free survival for the end point on which to train their prognostic gene signatures. This end point combines all three processes noted above—recurrence and/or dissemination of HCC, de novo new HCC arising in the cirrhotic liver, and complications of progressive cirrhosis. Thus, rather than capturing a distinct biological process, this signature likely captures a blend of biological processes. And because the nature of the blend is highly dependent on the patient population, it is difficult to determine whether this signature can be accurately applied to other patient populations. In this sense, the use of more specific clinical outcomes, e.g., time to recurrence and time to HCC-specific death, to train the signature would provide for a more straightforward interpretation of the results as well as determination of their applicability to other patient populations.

Rapidly growing public genomics databases such as NCBI’s Gene Expression Omnibus (www.ncbi.nlm.nih.gov/geo) have enabled in silico cross-cohort validation of prognostic gene signatures.6 The authors are encouraged to deposit their genomic data set to such databases, as systematic evaluation of previously reported gene signatures on the authors’ data sets will help determine whether there is any unique advantage of their signature.6 In addition, validation of the authors’ gene signature in other publicly reported patient cohorts will provide information on the potential clinical utility of the signature, given that the signature was derived from a single-assay platform and single-institution patient cohort.

The methodology of signature gene selection is another important component to ensure robust predictive/prognostic performance of a signature. In this study, Lim et al. used a parsimonious method to choose genes independently associated with disease-free survival. With this approach, one would expect that each gene in the signature represents activation status of a distinct molecular pathway involved in the disease. This is theoretically an effective strategy to capture maximum information with minimum number of measurements. However, this could be a limitation in analyzing real-world clinical specimens, in which gene expression measurements are less reproducible at single gene level, thereby increasing the likelihood of missing in the signature information from key pathways. This issue is particularly relevant given the recent trend toward use of partially degraded RNA isolated from archived formalin-fixed tissues for expression profiling.7,8 One possible solution to the problem of missing important genes in the signature may be to include multiple genes to more robustly capture the status of each molecular pathway, even when the signals from some genes are missed as a result of, e.g., sample quality issues.9

One of the major challenges in genome-wide gene expression microarrays has been experimental variation that results from amplification of cDNA by polymerase chain reaction or similar techniques. Recent development of highly sensitive signal detection technologies has enabled digital counting of transcripts without amplification, which may have a potential to overcome this limitation.10,11 When this progress in technology is combined with the research community’s movement toward public sharing of data sets for cross-cohort validation, the future for prognostic gene signatures in all cancers—including HCC—is bright. These gene signatures are expected to provide a better understanding of the prognosis of patients with cirrhosis and HCC. More importantly, it is a reasonable expectation that these signatures will divulge important clues on the molecular pathways that are critically important for these diseases (both cirrhosis and HCC) and for which effective targeting will lead to effective therapies.