Abstract
This chapter describes the Cancer Biomedical Informatics Grid (caBIG®) project, a collaboration among the National Cancer Institute (NCI) and numerous discovery organizations, mostly NCI-designated cancer centers. The rationale for the caBIG® project opens the review, followed by description and analysis of the pilot phase (2004–2007). Changes in caBIG® as it transitions to an enterprise phase are evaluated. The chapter concludes with a review of important papers that either describe caBIG® components or are based on the initiative.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amin W, Parwani AV, Schmandt L, Mohanty SK, Farhar G, Pople AK, Winters SB, Whelan NB, Schneider AM, Milnes JT, Valdivieso A, Feldman M, Pass HI, Dhir R, Melamed J, Becich MJ (2008) National Mesothelioma Virtual Bank: a standard based biospecimen and clinical data resource to enhance translational research. BMC Cancer 8:236
Beck JR, Bondy JC (2007) The cancer biomedical informatics grid (caBIG): infrastructure and applications for a worldwide research community. Stud Health Technol Inform 129:330–334
Buetow KH (2005) Cyberinfrastructure: empowering a “third way” in biomedical research. Science 308:821–824
Buetow KH (2009) An infrastructure for interconnecting research institutions. Drug Discov Today 14:605–610
Channin DS, Mongkolwat P, Kleper V, Sepukar K, Rubin DL (2009) The caBIG annotation and image markup project. J Digital Imag (in press)
Cimino JJ, Hayamizu TF, Bodenreider O, Davis B, Stafford GA, Ringwald M (2009) The caBIG terminology review process. J Biomed Inform. doi:10.1016/j.jbi.2008.12.003
Freimuth RR, Schauer MW, Lodha P, Govindrao P, Nagarajan R, Chute CG (2008) caBIG compatibility review system: software to support the evaluation of applications using defined interoperability criteria. AMIA Annu Symp Proc 197–201
Fridsma DB, Evans J, Hastak S, Mead CN (2008) The BRIDG Project: A technical report. J Am Med Inform Assoc 15:130–137
Gurcan MN, Pan T, Sharma A, Kurc T, Oster S, Langella S, Hastings S, Siddiqui KM, Siegel EL, Saltz J (2007) GridIMAGE: a novel use of grid computing to support interactive human and computer-assisted detection decision support. J Digit Imaging 20:160–171
Hanash S (2004) Integrated global profiling of cancer. Nat Rev Cancer 4:638–643
Huang T, Shenoy PJ, Sinha R, Graiser M, Bumpers KW, Flowers CR (2009) Development of the Lymphoma Enterprise Architecture Database: a caBIG silver level compliant system. Cancer Inform 8:45–64
Kaiser J (2004) Von Eschenbach revises the NCI agenda. Science 303:1952
Kunz I, Lin M-C, Frey L (2009) Metadata mapping and reuse in caBIG. BMC Bioinformatics 10(S2):S4
Langella S, Oster S, Hastings S, Siebenlist F, Phillips J, Ervin D, Permar J, Kurc T, Saltz J (2007) The cancer biomedical informatics grid (caBIG) security infrastructure. AMIA Annu Symp Proc 433–437
Madhaven S, Zenklusen JC, Kotliarov Y, Sahni H, Fine HA, Buetow K (2009) Rembrandt: helping personalized medicine become a reality through integrative translational research. Mol Cancer Res 7:157–167
Manion FJ, Robbins RJ, Weems WA, Crowley RS (2009) Security and privacy requirements for a multi-institutional cancer research data grid: an interview-based study. BMC Med Inform Decis Mak 9:31
McConnell P, Dash RC, Chilukuri R, Pietrobon R, Johnson K, Annechiarico R, Cuticchia AJ (2008) The cancer translational research informatics platform. BMC Med Inform Decis Mak 8:60
National Cancer Institute. The caBIG Pilot Phase: Report: 2003-2007. https://cabig.nci.nih.gov/overview/pilotreport/. Accessed 3 Jan 2009
Phillips J, Chilukuri R, Fragoso G, Warzel D, Covitz PA (2006) The caCORE software development kit: streamlining construction of biomedical information services. BMC Biomed Inform Decis Mak 6:2
Prior FW, Erickson BJ, Tarbox L (2007) Open source software projects of the caBIG in vivo imaging workspace software special interest group. J Digit Imaging 20(S1):94–100
Saltz J, Oster S, Hastings S, Langella S, Kurc T, Sanchez W, Kher M, Manisundaram A, Shanbhag K, Covitz P (2006) caGrid: design and implementation of the core architecture of the cancer biomedical informatics grid. Bioinformatics 22:1910–1916
Shironoshita EP, Jean-Mary YR, Bradley RM, Kabuka MR (2008) semCDI: a query formulation for semantic data integration in caBIG. J Am Med Inform Assoc 15:559–568
Tchuvatkina O, Shimoni L, Ochs MF, Moloshok T (2006) Proteomics LIMS: a caBIG project, year 1. AMIA Annu Symp Proc 1116
Tobias J, Chilukuri R, Komatsoulis GA, Mohanty S, Sioutos N, Warzel DB, Wright LW, Crowley RS (2006) The CAP cancer protocols – a case study of caCORE based data standards implementation to integrate with the cancer biomedical informatics grid. BMC Med Inform Decis Mak 6:25
Zhu Y, Wang Z, Feng Y, Xuan J, Miller DJ, Hoffman EP, Wang Y (2006) Phenotypic-specific gene module discovery using a diagnostic tree and caBIG VISDA. Conf Proc IEEE Eng Med Biol Soc 1:5767–5770
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix: caBIG‚ Tools and Technologies at the End of the Pilot Phase
Appendix: caBIG‚ Tools and Technologies at the End of the Pilot Phase
1.1 Clinical Trials Management
-
Cancer Central Clinical Database (C3D): a clinical trials data management system based on Oracle Clinical, including protocol building, remote data capture, and review.
-
Cancer Central Clinical Participant Registry (C3PR): a web-based application for patient trial registry.
-
Clinical Data Exchange/Lab Integration Hub (caXchange): an open-source software tool used to manage laboratory data during a clinical trial.
-
Clinical Trials Object Data System (CTODS): a virtual data warehouse for clinical trials data that can capture trial information and deliver it de-identified.
-
Clinical Data System (CDS): a stand-alone data submission infrastructure for NCI-supported clinical trials.
-
Patient Study Calendar (PSC): an open-source application that can create and edit study calendar templates and manage them during a study.
-
Cancer Adverse Event Reporting System (caAERS): an open-source software tool that is used to collect process and report adverse events that occur during clinical trials.
-
caTRIP: a meta-system that allows users to query across a number of caBIG data services, based on Common Data Elements (CDEs) and integrate results for viewing and further analysis.
-
Federal Investigator Registry of Biomedical Information Research Data (FIREBIRD): a system that automates the Form 1572 registration process for clinical trials investigators.
1.2 Biospecimen Banking
-
caTissue Core: a tissue bank repository tool for biospecimen inventory, tracking, and basic annotation.
-
Cancer Text Information Extraction System (caTIES): a system to extract coded cancer information from free-text surgical pathology reports.
-
caTissue Clinical Annotation Engine (CAE): a web-based user interface for standards-based manual annotation of biospecimens.
1.3 Image Analysis
-
National Cancer Imaging Archive (NCIA): a searchable national repository of cancer images, integrated with clinical and genomic data.
1.4 Data Mapping
-
caAdapter: an open-source tool set that facilitates data mapping, validation, and transformation among various data sources and standard formats.
1.5 General Research Tools
-
Electronic Laboratory Management Information Resource (caELMIR): a Laboratory Information Management System (LIMS) for recording experimental data.
-
geWorkbench: an open-source software platform for genomic data integration.
-
GenePattern: another software platform for genomic data integration.
-
caIntegrator: a translational informatics platform that can integrate clinical and molecular information across patients and trials.
-
caBench-to-Bedside (caB2B): a caGrid client that permits users to search caGrid data services.
-
caArray: an open-source microarray data management system, that supports other caBIG® tools like geWorkbench and GenePattern.
-
Protein Information Resource (gridPIR): a data resource for genomic and proteomic resource that connects to relevant PIR databases.
-
Cancer Models Database (caMOD): a resource for animal models for human cancer, involving submission and search capabilities.
-
Bioconductor: an established open-source collection of software packages for high-throughput genome analysis.
1.6 Genome Analysis
-
SEED: a tool for making and sharing genomic annotations, and for access and analysis of annotations.
-
Transcript Annotation Prioritization and Screening System (TRAPSS): an analytical tool for screening gene sequences for mutations, including notable ones.
-
Function Express (caFE): a tool for annotation of probes on microarrays.
-
GoMiner: a tool for biological interpretation using the Gene Ontology.
-
GeneConnect: a caBIG® mapping service that interlinks various genomic identifiers.
1.7 Protein Analysis
-
RProteomics: a package for analyzing mass spectrometry proteomics data.
-
Q5: a classification tool for expression-dependent proteomic data.
-
Proteomics Laboratory Information Management System (protLIMS): a prototype LIMS for proteomics.
-
Cancer Molecular Pages (CMP): an automated annotation system for cancer-related proteins.
-
Computational Proteomics Analysis System (CPAS): an open-source tool for peptide scoring based on the Trans Proteomics Pipeline.
1.8 Pathway Analysis
-
Quantitative Pathway Analysis in Cancer (QPACA): a tool that provides a set of routines for analysis of microarray data in the context of genetic pathways.
-
Reactome: a service that establishes the Reactome system as a caGrid service.
-
Pathway Interaction Database: a curated database of information about known biomolecular interactions and cellular signaling processes.
-
Pathway Tools: a suite of tools that interact on an open-source pathway database.
1.9 Statistical Analysis
-
Visual Statistical Data Analyzer (VISDA): an analytical tool for cluster modeling and visualization.
-
Distance Weighted Discrimination (DWD): a tool that corrects microarray analyses by reducing systematic bias.
1.10 Core Infrastructure
-
caGrid: the underlying network architecture, based on the Globus toolkit, that balances local access control and central services.
-
BRIDG Model: a collaborative data model for clinical research.
-
caCORE: an open source group of software products developed by NCICB that provide the tools for open source, interoperable applications that are caGrid compliant.
-
Cancer Bioinformatic Infrastructure Objects (caBIO): a set of software objects that provide a programming interface to caCORE.
-
NCI Enterprise Vocabulary Server (EVS): a service that produces metathesaurus entities to support standard controlled vocabularies for caBIG® projects and products.
-
caDSR: a metadata registry in caCORE that stores and manages CDEs developed by participants and NCI-supported organizations.
-
caCORE Software Development Kit (caCORE SDK): a set of tools designed to aid in a caBIG® compliant, semantically integrated system.
1.11 Vocabularies
-
LexBIG: a set of software and services to load, publish, and access caBIG® compliant vocabulary.
-
Mouse–Human Anatomy Mapping Ontology (MHAP): a mapping and harmonization of murine and human anatomical descriptors.
-
Cancer Nutrition Ontology: a unified set of Nutrition vocabularies and mappings.
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Beck, J.R. (2010). The Cancer Biomedical Informatics Grid (caBIG‚): An Evolving Community for Cancer Research. In: Ochs, M., Casagrande, J., Davuluri, R. (eds) Biomedical Informatics for Cancer Research. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-5714-6_9
Download citation
DOI: https://doi.org/10.1007/978-1-4419-5714-6_9
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5712-2
Online ISBN: 978-1-4419-5714-6
eBook Packages: MedicineMedicine (R0)