Return to search

Model for Long-range Correlations in DNA Sequences

We address the problem of the DNA sequences developing a "dynamical" method based on the assumption that the statistical properties of DNA paths are determined by the joint action of two processes, one deterministic, with long-range correlations, and the other random and delta correlated. The generator of the deterministic evolution is a nonlinear map, belonging to a class of maps recently tailored to mimic the processes of weak chaos responsible for the birth of anomalous diffusion. It is assumed that the deterministic process corresponds to unknown biological rules which determine the DNA path, whereas the noise mimics the influence of an infinite-dimensional environment on the biological process under study.
We prove that the resulting diffusion process, if the effect of the random process is neglected, is an a-stable Levy process with 1 < a < 2. We also show that, if the diffusion process is determined by the joint action of the deterministic and the random process, the correlation effects of the "deterministic dynamics" are cancelled on the short-range scale, but show up in the long-range one. We denote our prescription to generate statistical sequences as the Copying Mistake Map (CMM).
We carry out our analysis of several DNA sequences, and of their CMM realizations, with a variety of techniques, and we especially focus on a method of regression to equilibrium, which we call the Onsager Analysis. With these techniques we establish the statistical equivalence of the real DNA sequences with their CMM realizations. We show that long-range correlations are present in exons as well as in introns, but are difficult to detect, since the exon "dynamics" is shown to be determined by theentaglement of three distinct and independent CMM's.
Finally we study the validity of the stationary assumption in DNA sequences and we discuss a biological model for the short-range random process based on a folding mechanism of the nucleic acid in the cell nucleus.

Identiferoai:union.ndltd.org:unt.edu/info:ark/67531/metadc279189
Date12 1900
CreatorsAllegrini, Paolo
ContributorsWest, Bruce J., Grigolini, Paolo, Deering, William D., Kowalski, Jacek M., Shanley, Mark Stephen
PublisherUniversity of North Texas
Source SetsUniversity of North Texas
LanguageEnglish
Detected LanguageEnglish
TypeThesis or Dissertation
Formatxii, 128 leaves: ill., Text
RightsPublic, Copyright, Copyright is held by the author, unless otherwise noted. All rights reserved., Allegrini, Paolo

Page generated in 0.0022 seconds