Quantitative Biology

, Volume 1, Issue 4, pp 261–271

NPEST: a nonparametric method and a database for transcription start site prediction

  • Tatiana Tatarinova
  • Alona Kryshchenko
  • Martin Triska
  • Mehedi Hassan
  • Denis Murphy
  • Michael Neely
  • Alan Schumitzky
Research Article

DOI: 10.1007/s40484-013-0022-2

Cite this article as:
Tatarinova, T., Kryshchenko, A., Triska, M. et al. Quant Biol (2013) 1: 261. doi:10.1007/s40484-013-0022-2

Abstract

In this paper we present NPEST, a novel tool for the analysis of expressed sequence tags (EST) distributions and transcription start site (TSS) prediction. This method estimates an unknown probability distribution of ESTs using a maximum likelihood (ML) approach, which is then used to predict positions of TSS. Accurate identification of TSS is an important genomics task, since the position of regulatory elements with respect to the TSS can have large effects on gene regulation, and performance of promoter motif-finding methods depends on correct identification of TSSs. Our probabilistic approach expands recognition capabilities to multiple TSS per locus that may be a useful tool to enhance the understanding of alternative splicing mechanisms. This paper presents analysis of simulated data as well as statistical analysis of promoter regions of a model dicot plant Arabidopsis thaliana. Using our statistical tool we analyzed 16520 loci and developed a database of TSS, which is now publicly available at www.glacombio.net/NPEST.

Keywords

transcription start site (TSS) nonparametric maximum likelihood 
Download to read the full article text

Copyright information

© Higher Education Press and Springer-Verlag GmbH 2013

Authors and Affiliations

  • Tatiana Tatarinova
    • 1
  • Alona Kryshchenko
    • 1
  • Martin Triska
    • 1
    • 2
  • Mehedi Hassan
    • 2
  • Denis Murphy
    • 2
  • Michael Neely
    • 1
  • Alan Schumitzky
    • 1
  1. 1.Children’s Hospital Los Angeles and Keck School of MedicineUniversity of Southern CaliforniaLos AngelesUSA
  2. 2.Genomics and Computational Biology research groupUniversity of South WalesTreforest, WalesUK

Personalised recommendations