Skip to main content

Scale-Dependent Statistics of the Numbers of Transcripts and Protein Sequences Encoded in the Genome

  • Chapter
Computational and Statistical Approaches to Genomics

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Adami, C. (1998). Introduction to Artificial Life. New York: Springer-Verlag.

    Google Scholar 

  • Bishop, J. O., Morton, J. G., Rosbash, M., and Richardson, M. (1974). “Three Classes in Hela Cell Messenger RNA.” Nature 250:199–204.

    Article  PubMed  CAS  Google Scholar 

  • Borodovsky, M. Yu. and Gusein-Zade, S. M. (1989). “A General Rule for Ranged Series of Codon Frequencies in Different Genomes.” J Biomolecular Structure and Dynamics 6:1001–1012.

    Google Scholar 

  • Cantor, C. R. and Smith, C. L. (1999). Genomics. New York: J.Willey and Sons.

    Google Scholar 

  • Chelly, J., Concordet, J.-P., Kaplan, J.-C., and Kahn, A. (1989). “Illegitimate Transcription: Transcription of any Gene in Cell Type.” Proc Natl Acad Sci USA 86:2617–2621.

    Article  PubMed  CAS  Google Scholar 

  • Chen, J.-J., Rowley, J. D., and Wang, S. M. (2000). “Generation of Longer cDNA Fragments from Serial Analysis of Gene Expression Tags for Gene Identification.” Proc Natl Acad Sci USA 97:349–353.

    Article  PubMed  CAS  Google Scholar 

  • Cook, D. L., Gerber, A. N., and Tatscott, S. T. (1998). “Modeling Stochastic Gene Expression: Implications for Haploinsufficiency.” Proc Natl Acad Sci USA 95:15641–15646.

    Article  PubMed  CAS  Google Scholar 

  • Caron, H., et al. (2001). “The Human Transcriptome Map: Clustering of Highly Expressed Genes in Chromosomal Domains.” Science 291:1289–1292.

    Article  PubMed  CAS  Google Scholar 

  • Croix, B. S., et al. (2000). “Genes Expressed in Human Tumor Endothelium.” Science 289:1197–1202.

    Article  Google Scholar 

  • Crollius, R., et al. (2000). “Estimate of Human Gene Number Provided by Genomewide Analysis Using Tetraodon Nigroviridis DNA Sequence.” Nature Genetics 25:235–238.

    Article  CAS  Google Scholar 

  • Douglas, S., et al. (2001). “The Highly Reduced Genome of an Enslaved Aldal Nucleus.” Nature 410:1091–1096.

    Article  PubMed  CAS  Google Scholar 

  • Eddy, S. R. (2001). “Non-coding RNA Genes and the Modern RNA World.” Nature Rev Genetics 2:919–928.

    Article  CAS  Google Scholar 

  • Emmert-Buck, M. R., et al. (2000). “Molecular Profiling of Clinical Tissue Specimens: Feasibility and Applications.” Am J Pathol 156:1109–1115.

    PubMed  CAS  Google Scholar 

  • Ewing, B. and Green, P. (2000). “Analysis of Expressed Sequence Tags Indicates 35,000 Human Genes.” Nature Genetics 25:232–234.

    Article  PubMed  CAS  Google Scholar 

  • Femino, A. M., Fay, F. S., Fogarty, K., and Singer, R. H. (1998). “Visualization of Single RNA Transcripts in Situ.” Science 280:585–590.

    Article  PubMed  CAS  Google Scholar 

  • Fisher, R. A. (1930). The Genetical Theory of Natural Selection. Oxford: Clarendon Press.

    Google Scholar 

  • Friedman, R. and Hughes, A. L. (2001). “Pattern and Timing of Gene Duplication in Animal Genomes.” Genome Res 11:1842–1847.

    Article  PubMed  CAS  Google Scholar 

  • Guptasarma, P. (1995). “Does Replication-induced Transcription Regulate Synthesis of the Myriad Low Number Proteins of Escherichia Coli?” BioAssays 17:987–997.

    Article  CAS  Google Scholar 

  • Hogenesch, J. B., et al. (2001). “A Comparison of the Celera and Ensemble Predicted Gene Sets Reveals Little Overlap in Novel Genes.” Cell 106:413–415.

    Article  PubMed  CAS  Google Scholar 

  • Hollander, G. A. (1999). “On the Stochastic Regulation of Interleukin-2 Transcription.” Seminars in Immunology 11:357–367.

    Article  PubMed  CAS  Google Scholar 

  • Holstege, F. C. P., et al. (1998). “Dissecting the Regulatory Circuitry of a Eukaryotic Genome.” Cell 95:717–728.

    Article  PubMed  CAS  Google Scholar 

  • Huang, S.-P. and Weir, B. S. (2001). “Estimating the Total Number of Alleles Using a Sample Coverage Method.” Genetics 159:1365–1373.

    PubMed  CAS  Google Scholar 

  • Hughes, A. L., da Silva, J. and Freadman, R. (2001). “Ancient Genome Duplications did not Structure the Human Hox-bearing Chromosomes.” Genome Res 11:771–780.

    Article  PubMed  CAS  Google Scholar 

  • Hume, D. A. (2000). “Probability in Transcriptional Regulation and Implications for Leukocyte Differentiation and Inducible Gene Expression.” Blood 96:2323–2328.

    PubMed  CAS  Google Scholar 

  • International Human Genome Sequencing Consortium (2001). “Initial Sequencing and Analysis of the Human Genome.” Nature 409:860–921.

    Article  Google Scholar 

  • Impey, S., McCorkle, S. R., Cha-Molstad, H., Dwyer, J. M., Yochum, G. S., Boss, J. M., Mc Weeney, S., Dunn, J. L., Mandel, G., and Goodman, R. H. (2004). “Defining the CREB Regulon: A Genome-wide Analysis of Transcription Factor Regulatory Regions.” Cell 119:1041–1054.

    PubMed  CAS  Google Scholar 

  • Jackson, D. A., Pombo, A., and Iborra, F. (2000). “The Balance Sheet for Transcription: An Analysis of Nuclear RNA Metabolism in Mammalian Cells.” FASEB J 14:242–254.

    PubMed  CAS  Google Scholar 

  • Jelinsky, S. A. and Samson, L. D. (1999). “Global Response of Saccharomyces Cerevisiae to Alkylating Agent.” Proc Natl Acad Sci USA 96:1486–1491.

    Article  PubMed  CAS  Google Scholar 

  • Jelinsky, S. A., Estep, P., Church, G. M., and Samson, L. D. (2000). “Regulatory Networks Revealed by Transcriptional Profiling of Damaged Saccharomyces Cerevisiae Cells: Rpn4 Links Base Excision Repair with Proteasomes.” Molec and Cell Biology 20:8157–8167.

    Article  CAS  Google Scholar 

  • Jeong, H., Tombor, B., Albert, R., Ottval, Z. N., and Barabasi, A.-L. (2000). “The Large-scale Organization of Metabolic Networks.” Nature 407:651–654.

    Article  PubMed  CAS  Google Scholar 

  • Johnson, M. (2000). “The Yeast Genome: On the Road to the Gold Age.” Current Opinion in Genetics and Development 10:617–623.

    Article  Google Scholar 

  • Johnson, N. L., Kotz, S., and Kemp, A. W. (1992). Univariate Discrete Distributions. New York: John Wiley & Sons.

    Google Scholar 

  • Kauffman, S. A. (1993). “The Origins of Order: Self-Organization and Selection in Evolution.” New York: Oxford University Press.

    Google Scholar 

  • Ko, M. S. H. (1992). “Induction Mechanism of a Single Gene Molecule: Stochastic or Deterministic.” BioAssays 14:341–346.

    Article  CAS  Google Scholar 

  • Koonin, E., Aravind, L., and Kondrashov, A. S. (2000). “The Impact of Comparative Genomics on our Understanding of Evolution.” Cell 101:573–576.

    Article  PubMed  CAS  Google Scholar 

  • Kuznetsov, V. A. and Bonner, R. F. (1999). “Statistical Tools for Analysis of Gene Expression Distributions with Missing Data.” In: 3rd Annual Conference on Computational Genomics, p. 26. Baltimore, MD: The Institute for Genomic Research.

    Google Scholar 

  • Kuznetsov, V. A. (2000). “The Genes Number Game in Growing Sample.” J Comput Biol 7:642.

    Google Scholar 

  • Kuznetsov, V. A. (2001a). “Analysis of Stochastic Processes of Gene Expression in a Single Cell.” In: 2001 IEEE-EURASIP Workshop on Nonlinear Signals and Image Processing, Baltimore, MD: University of Delaware.

    Google Scholar 

  • Kuznetsov, V. A. (2001b). “Distribution Associated with Stochastic Processes of Gene Expression in a Single Eukaryotic Cell.” EURASIP J on Applied Signal Processing 4:285–296.

    Article  Google Scholar 

  • Kuznetsov, V. A., Knott, G. D., and Bonner, R. F. (2002a). “General Statistics of Stochastic Process in Eukaryotic Cells.” Genetics 161:1321–1332.

    PubMed  CAS  Google Scholar 

  • Kuznetsov, V. A., Pickalov, V. V., Senko, O. V., and Knott, G. D. (2002b). “Analysis of the Evolving Proteomes: Prediction of the Numbers of Protein Domains in Nature and the Number of Genes in Eukaryotic Organisms.” J Biol Systems 10:381–408.

    Article  Google Scholar 

  • Kuznetsov, V. A. (2003a). “A Stochastic Model of Evolution of Conserved Protein Coding Sequence in the Archaeal, Bacterial and Eukaryotic Proteomes.” Fluctuation and Noise Letters 3:L295–L324.

    Article  Google Scholar 

  • Kuznetsov, V. A. (2003b). “Family of Skewed Distributions Associated with the Gene Expression and Proteome Evolution.” Signal Processing 83:889–910 (Available online 14 Dec., 2002: http://www.ComputerScienceWeb.com).

    Article  Google Scholar 

  • Kuznetsov, V. A. (2005). “Mathematical Analysis and Modeling of SAGE Transcriptome.” In: San Ming Wang, ed. SAGE: Current Technologies and Applications, pp. 139–179. Rowan House, Hethersett: Horizon Science Press.

    Google Scholar 

  • Lash, A. S., et al. (2000). “SAGEmap: A Public Gene Expression Resource.” Genome Res 10:1051–1060, 2000.

    Article  PubMed  CAS  Google Scholar 

  • Li, W. (1992). “Random Texts Exhibit Zipf’s-law-like Word Frequency Distribution.” IEEE Transactions on Information Theory 38:1842–1845.

    Article  Google Scholar 

  • Li, W. (1999). “Statistical Properties of Open Reading Frames in Complete Genome Sequences.” Computers & Chemistry 23:283–301.

    Article  CAS  Google Scholar 

  • Li, W.-H., Gu, Z., Wang, H., and Nekrutenko, A. (2001). “Evolutionary Analyses of the Human Genome.” Nature 409:847–849.

    Article  PubMed  CAS  Google Scholar 

  • Mandelbrot, B. (1982). “Fractal Geometry in Nature.” New York: Freeman.

    Google Scholar 

  • McAdams, H. H. and Arkin, A. (1999). “It’s a Noisy Business! Genetic Regulation at the Nanomolar Scale.” Trends in Genetics 15:65–69.

    Article  PubMed  CAS  Google Scholar 

  • Misteli, T. (2001). “Protein Dynamics: Implications for Nuclear Architecture and Gene Expression.” Science 291:843–847.

    Article  PubMed  CAS  Google Scholar 

  • Newlands, S., et al. (1998). “Transcription Occurs in Pulses in Muscle Fibers.” Genes Dev 12:2748–2758.

    PubMed  CAS  Google Scholar 

  • Newman, M. E. J., Strogatz, S. H., and Watts, D. J. (2001). Physical Rev E 64:02618-1–02618-17.

    Google Scholar 

  • Pennisi, E. (2000). “And the Gene Number is...?” Science 288:1146–1147.

    Article  PubMed  CAS  Google Scholar 

  • Ohlsson, R., Paldi, A., and Marshall Graves, J. A. (2001). “Did Genomic Imprinting and X Chromosome Inactivation Arise from Stochastic Expression?” Trends in Genetics 17:136–141.

    Article  PubMed  CAS  Google Scholar 

  • Ohno, S. (1970). Evolution by gene duplication. New York: Springer Verlag.

    Google Scholar 

  • Pombo, A., et al. (2000). “Specialized Transcription Factories Within Mammalian Nuclei.” Critical Reviews in Eukaryotic Gene Expression 10:21–29.

    PubMed  CAS  Google Scholar 

  • Ramsden, J. J. and Vohradsky, J. (1998). “Zipf-like Behavior in Prokaryotic Protein Expression.” Phys Review E 58:7777–7780.

    Article  CAS  Google Scholar 

  • Ross, I. L., Browne, C. M., and Hume, D. A. (1994). “Transcription of Individual Genes in Eukaryotic Cells Occurs Randomly and Infrequently.” Immunol Cell Biol 72:177–185.

    Article  PubMed  CAS  Google Scholar 

  • Rubin, G. M., et al. (2000). “Comparative Genomics of the Eukaryotes.” Science 287:2204–2215.

    Article  PubMed  CAS  Google Scholar 

  • Rzhetsky, A. and Gomez, S. M. (2001). “Birth of Scale-free Molecular Networks and the Number of Distinct DNA and Protein Domains Per Genome.” Bioinformatics 17:988–996.

    Article  PubMed  CAS  Google Scholar 

  • Sano, Y., et al. (2001). “Random Monoallelic Expression of Three Genes Clustered within 60 kb of Mouse T Complex Genomic DNA.” Genome Res 11:1833–1841.

    PubMed  CAS  Google Scholar 

  • Shmulevich, I., Dougherty, E. R., Kim, S., and Zhang, W. (2002). “Probabilistic Boolean Networks: A Rule-based Uncertainty Model for Gene Regulatory Networks.” Bioinformatics 18:261–274.

    Article  PubMed  CAS  Google Scholar 

  • Shulman, M. J. and Wu, G. E. (1999). “Hypothesis: Genes which Function in a Stochastic Linage Commitment Process are Subject to Monoallelic Expression.” Seminars in Immunology 11:369–371.

    Article  PubMed  CAS  Google Scholar 

  • Simon, H. A. and Van Wormer, T. A. (1963). “Some Monte-Carlo Estimates of the Yule Distribution.” Behavior Science 8:203–210.

    Article  Google Scholar 

  • Stanley, H. E., et al. (1999). “Scaling Features of Noncoding DNA.” Physica A 273:1–18.

    Article  PubMed  CAS  Google Scholar 

  • Sutherland, H. G., et al. (2000). “Reactivation of Heritably Silenced Gene Expression in Mice.” Mammalian Genome 11:347–355.

    Article  PubMed  CAS  Google Scholar 

  • Thieffry, D., Huerta, A. M., Perez-Rueda, E., and Collado-Vides, J. (1998). “From Specific Gene Regulation to Genomic Networks: A Global Analysis of Transcriptional Regulation in Escherichia Coli.” BioEssays 20:433–440.

    Article  PubMed  CAS  Google Scholar 

  • Till, J. E., McCulloch, E. A., and Siminovish, L. (1964). “A Stochastic Model of Stem Cell Proliferation, Based on the Growth of Spleen Colony-forming Cells.” Proc Natl Acad Sci USA 51:29–38.

    Article  PubMed  CAS  Google Scholar 

  • Velculescu, V. E., et al. (1997). “Characterization of Yeast Transcriptome.” Cell 88:243–251.

    Article  PubMed  CAS  Google Scholar 

  • Velculescu, V. E., et al. (1999). “Analysis of Human Transcriptomes.” Nat Genet 23:387–388.

    Article  PubMed  CAS  Google Scholar 

  • Venter, J. C., et al. (2001). “The Sequence of the Human Genome.” Science 291:1304–1351.

    Article  PubMed  CAS  Google Scholar 

  • Vision, T. J., Brown, D. G., and Tanksley, S. D. (2000). “The Origins of Genome Duplications in Arabidopsis.” Science 290:2114–2117.

    Article  PubMed  CAS  Google Scholar 

  • Vohradsky, J. and Ramsden, J. J. (2001). “Genome Resource Utilization During Prokaryotic Development.” FASEB J (express article 10.1096/fj.00-0889fje).

    Google Scholar 

  • Walters, M. C., et al. (1995). “Enhancers Increase the Probability but not the Level of Gene Expression.” Proc Natl Acad Sci USA 92:7125–7129.

    Article  PubMed  CAS  Google Scholar 

  • Wei, C. L., Wu, Q., Vega, V. B., Chiu, K. P., Ng, P., Zhang, T., Shahab, A., Yong, H. C., Fu, Y. T., Weng, Z., Liu, J. J., Lee, Y. L., Kuznetsov, V. A., Sung, K., Lim, B., Liu, E. T., Yu, Q., Ng, H. H., and Yijun, R. (2005). “A Precise Global Map of p53 Transcription Factor Binding Sites in the Human Genome.” (submitted).

    Google Scholar 

  • Weintraub, H. (1988). “Formation of Stable Transcription Complexes as Assayed by Analysis of Individual Templates.” Proc Natl Acad Sci USA 85:5819–5823.

    Article  PubMed  CAS  Google Scholar 

  • Wuchty, S. (2001). “Scale-free Behavior in Protein Domain Networks.” Molec Biol Evol 18:1694–1702.

    PubMed  CAS  Google Scholar 

  • Yule, G. U. (1924). “A Mathematical Theory of Evolution, Based on the Conclusions of Dr. J. C. Willis, F. R. S.” Philosophical Transactions of the Royal Society of London Ser B 213:21–87.

    Article  Google Scholar 

  • Zucchi, I., Mento, E., Kuznestov, V. A., Scotti, M., Valsecchi, V., Simionati, B., Valle, G., Pilotti, S., Vicinanza, E., Reinbold, R., Vezzoni, P., Albertini, A., and Dulbecco, R. (2004). “Gene Expression Profiles of Epithelial Microscopically Isolated from Breast-invasive Ductal Carcinoma and Nodal Metastasis.” PNAS USA 101:18147–18152.

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer Science+Business Media, Inc.

About this chapter

Cite this chapter

Kuznetsov, V.A. (2006). Scale-Dependent Statistics of the Numbers of Transcripts and Protein Sequences Encoded in the Genome. In: Zhang, W., Shmulevich, I. (eds) Computational and Statistical Approaches to Genomics. Springer, Boston, MA. https://doi.org/10.1007/0-387-26288-1_10

Download citation

Publish with us

Policies and ethics