Unknown

Dataset Information

0

GISSD: Group I Intron Sequence and Structure Database.


ABSTRACT: Group I Intron Sequence and Structure Database (GISSD) is a specialized and comprehensive database for group I introns, focusing on the integration of useful group I intron information from available databases and providing de novo data that is essential for understanding these introns at a systematic level. This database presents 1789 complete intron records, including the nucleotide sequence of each annotated intron plus 15 nt of the upstream and downstream exons, and the pseudoknots-containing secondary structures predicted by integrating comparative sequence analyses and minimal free energy algorithms. These introns represent all 14 subgroups, with their structure-based alignments being separately provided. Both structure predictions and alignments were done manually and iteratively adjusted, which yielded a reliable consensus structure for each subgroup. These consensus structures allowed us to judge the confidence of 20 085 group I introns previously found by the INFERNAL program and to classify them into subgroups automatically. The database provides intron-associated taxonomy information from GenBank, allowing one to view the detailed distribution of all group I introns. CDSs residing in introns and 3D structure information are also integrated if available. About 17 000 group I introns have been validated in this database; approximately 95% of them belong to the IC3 subgroup and reside in the chloroplast tRNA(Leu) gene. The GISSD database can be accessed at http://www.rna.whu.edu.cn/gissd/

SUBMITTER: Zhou Y 

PROVIDER: S-EPMC2238919 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC308312 | biostudies-other
| S-EPMC5154033 | biostudies-literature
| S-EPMC4406475 | biostudies-literature
| S-EPMC4197185 | biostudies-literature
| S-EPMC1370676 | biostudies-literature
| S-EPMC65690 | biostudies-literature
| S-EPMC102468 | biostudies-literature
| S-EPMC3670821 | biostudies-literature
| S-EPMC3074136 | biostudies-literature
| S-EPMC2574657 | biostudies-literature