The Challenges in Blood Proteomic Biomarker Discovery
Although discovering proteomic biomarker by using mass spectrometry technique is promising, its rate of introducing proteomic biomarker approved by the US Food and Drug Administration is falling every year and nearly 1 per year on an average since 1998. Apparently, there is a big gap between biomarker discovery and biomarker validation. Here, we reviewed the challenges appearing in the three key stages for the pipeline of proteomic biomarker, that is, blood sample preparation, bioinformatics algorithms for biomarker candidate discovery, and validation and clinical application of proteomic biomarkers. To analyze and explain the reasons for the gap between biomarker discovery and validation, we covered areas ranging from the techniques/methods used in biomarker discovery and their related biological backgrounds to the existing problems in these techniques/methods.
KeywordsFeature Selection Linear Discriminant Analysis Discrete Wavelet Transform Feature Subset Peak Detection
This research is funded by the Bioinformatics Core Research Grant at The Methodist Research Institute, Cornell University. Dr. Zhou is partially funded by The Methodist Hospital Scholarship Award. He and Dr. Wong are also partially funded by NIH grants R01LM08696, R01LM009161, and R01AG028928. The authors have declared no conflict of interest.
- Diamond DL, Y Zhang et al (2003) Use of ProteinChip array surface enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF MS) to identify thymosin beta-4, a differentially secreted protein from lymphoblastoid cell lines. J Am Soc Mass Spectrom 14(7):760–765PubMedCrossRefGoogle Scholar
- Fung ET, Enderwick C (2002) ProteinChip clinical proteomics: computational challenges and solutions. Biotechniques Suppl:34–38, 40–41Google Scholar
- Itoh SG, Okamoto Y (2007) Effective sampling in the configurational space of a small peptide by the multicanonical-multioverlap algorithm. Phys Rev E Stat Nonlin Soft Matter Phys 76(2, Part 2):026705Google Scholar
- Wang P, Tang H et al (2006) Normalization regarding non-random missing values in high-throughput mass spectrometry data. Pac Symp Biocomput 315–326Google Scholar
- Yewdell JW (2003) Immunology. Hide and seek in the peptidome. Science 301(5638):1334–1335Google Scholar