Unknown

Dataset Information

0

TurboFold II: RNA structural alignment and secondary structure prediction informed by multiple homologs.


ABSTRACT: This paper presents TurboFold II, an extension of the TurboFold algorithm for predicting secondary structures for multiple RNA homologs. TurboFold II augments the structure prediction capabilities of TurboFold by additionally providing multiple sequence alignments. Probabilities for alignment of nucleotide positions between all pairs of input sequences are iteratively estimated in TurboFold II by incorporating information from both the sequence identity and secondary structures. A multiple sequence alignment is obtained from these probabilities by using a probabilistic consistency transformation and a hierarchically computed guide tree. To assess TurboFold II, its sequence alignment and structure predictions were compared with leading tools, including methods that focus on alignment alone and methods that provide both alignment and structure prediction. TurboFold II has comparable alignment accuracy with MAFFT and higher accuracy than other tools. TurboFold II also has comparable structure prediction accuracy as the original TurboFold algorithm, which is one of the most accurate methods. TurboFold II is part of the RNAstructure software package, which is freely available for download at http://rna.urmc.rochester.edu under a GPL license.

SUBMITTER: Tan Z 

PROVIDER: S-EPMC5714223 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

TurboFold II: RNA structural alignment and secondary structure prediction informed by multiple homologs.

Tan Zhen Z   Fu Yinghan Y   Sharma Gaurav G   Mathews David H DH  

Nucleic acids research 20171101 20


This paper presents TurboFold II, an extension of the TurboFold algorithm for predicting secondary structures for multiple RNA homologs. TurboFold II augments the structure prediction capabilities of TurboFold by additionally providing multiple sequence alignments. Probabilities for alignment of nucleotide positions between all pairs of input sequences are iteratively estimated in TurboFold II by incorporating information from both the sequence identity and secondary structures. A multiple seque  ...[more]

Similar Datasets

| S-EPMC4267632 | biostudies-literature
| S-EPMC7671329 | biostudies-literature
| S-EPMC5408826 | biostudies-other
| S-EPMC2366961 | biostudies-literature
| S-EPMC3273805 | biostudies-literature
| S-EPMC2952876 | biostudies-literature
| S-EPMC3819574 | biostudies-literature
| S-EPMC1579236 | biostudies-literature