We analyze the distribution of RNA secondary structures given by the Knudsen-Hein stochastic context-free grammar used in the prediction program Pfold. Our main theorem gives relations between the expected number of these motifs--independent of the grammar probabilities. These relations are a consequence of proving that the distribution of base pairs, of helices, and of different types of loops is asymptotically Gaussian in this model of RNA folding. Proof techniques use singularity analysis of probability generating functions. We also demonstrate that these asymptotic results capture well the expected number of RNA base pairs in native ribosomal structures, and certain other aspects of their predicted secondary structures. In particular, we find that the predicted structures largely satisfy the expected relations, although the native structures do not.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4081518PMC
http://dx.doi.org/10.1007/s00285-013-0750-yDOI Listing

Publication Analysis

Top Keywords

stochastic context-free
8
context-free grammar
8
model rna
8
rna folding
8
secondary structures
8
expected number
8
base pairs
8
structures
5
asymptotic distribution
4
distribution motifs
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!