Abstract
Clustering and long-range correlations in the nucleotide sequences of different categories of organisms are studied. As a result of clustering, the size distribution of coding and non-coding DNA regions is estimated analytically using the Generalised Central Limit Theorem.The alternation of coding regions (which follow a short range size distribution) with non-coding regions (which follow a long range size distribution in higher organisms) leads to DNA structures which have a striking resemblance to random Cantor Fractals. For lower organisms (such as viruses, procaryotes etc.) long-range correlations are sporadically observed and the DNA structures do not present fractality.Statistical models are proposed based on biologically motivated dynamical mechanisms (such as aggregation of oligonucleotides, influx and DNA length reduction), which can account for the above statistical features.
Similar content being viewed by others
REFERENCES
C. K. Peng, S. V. Buldyrev, A. L. Goldberger, S. Havlin, F. Sciortino, M. Simons and H. E. Stanley, Nature 356:168 (1992).
W. Li and K. Kaneko, Europhys. Lett. 17:655 (1992).
Y. Almirantis and A. Provata, J. Stat. Phys. 97:233 (1999).
A. Provata and Y. Almirantis, Fractals 8:15 (2000).
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Provata, A., Almirantis, Y. Statistical Dynamics of Clustering in the Genome Structure. Journal of Statistical Physics 101, 709 (2000). https://doi.org/10.1023/A:1026490911614
Issue Date:
DOI: https://doi.org/10.1023/A:1026490911614