RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences
- Brown Univ., Providence, RI (United States). Dept. of Mathematics
- Brown Univ., Providence, RI (United States). Division of Applied Mathematics. Center for Computational Molecular Biology
Motivation: RNA secondary structure plays an important role in the function of many RNAs, and structural features are often key to their interaction with other cellular components. Thus, there has been considerable interest in the prediction of secondary structures for RNA families. In this article, we present a new global structural alignment algorithm, RNAG, to predict consensus secondary structures for unaligned sequences. It uses a blocked Gibbs sampling algorithm, which has a theoretical advantage in convergence time. This algorithm iteratively samples from the conditional probability distributions P(Structure | Alignment) and P(Alignment | Structure). Not surprisingly, there is considerable uncertainly in the high-dimensional space of this difficult problem, which has so far received limited attention in this field. We show how the samples drawn from this algorithm can be used to more fully characterize the posterior space and to assess the uncertainty of predictions. Results: Our analysis of three publically available datasets showed a substantial improvement in RNA structure prediction by RNAG over extant prediction methods. Additionally, our analysis of 17 RNA families showed that the RNAG sampled structures were generally compact around their ensemble centroids, and at least 11 families had at least two well-separated clusters of predicted structures. In general, the distance between a reference structure and our predicted structure was large relative to the variation among structures within an ensemble. Availability: The Perl implementation of the RNAG algorithm and the data necessary to reproduce the results described in Sections 3.1 and 3.2 are available at http://ccmbweb.ccv.brown.edu/rnag.html
- Research Organization:
- Brown Univ., Providence, RI (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC)
- Grant/Contract Number:
- FG02-04ER63942
- OSTI ID:
- 1625276
- Journal Information:
- Bioinformatics, Vol. 27, Issue 18; ISSN 1367-4803
- Publisher:
- Oxford University PressCopyright Statement
- Country of Publication:
- United States
- Language:
- English
CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction
|
journal | February 2013 |
Fighting against uncertainty: An essential issue in bioinformatics | preprint | January 2013 |
RNA secondary structure prediction from multi-aligned sequences | preprint | January 2013 |
Similar Records
Bayesian computational approaches for gene regulation studies of bioethanol and biohydrogen production
RNA modeling using Gibbs sampling and stochastic context free grammars