Unknown

Dataset Information

0

P_RNA_scaffolder: a fast and accurate genome scaffolder using paired-end RNA-sequencing reads.


ABSTRACT: BACKGROUND:Obtaining complete gene structures is one major goal of genome assembly. Some gene regions are fragmented in low quality and high-quality assemblies. Therefore, new approaches are needed to recover gene regions. Genomes are widely transcribed, generating messenger and non-coding RNAs. These widespread transcripts can be used to scaffold genomes and complete transcribed regions. RESULTS:We present P_RNA_scaffolder, a fast and accurate tool using paired-end RNA-sequencing reads to scaffold genomes. This tool aims to improve the completeness of both protein-coding and non-coding genes. After this tool was applied to scaffolding human contigs, the structures of both protein-coding genes and circular RNAs were almost completely recovered and equivalent to those in a complete genome, especially for long proteins and long circular RNAs. Tested in various species, P_RNA_scaffolder exhibited higher speed and efficiency than the existing state-of-the-art scaffolders. This tool also improved the contiguity of genome assemblies generated by current mate-pair scaffolding and third-generation single-molecule sequencing assembly. CONCLUSIONS:The P_RNA_scaffolder can improve the contiguity of genome assembly and benefit gene prediction. This tool is available at http://www.fishbrowser.org/software/P_RNA_scaffolder .

SUBMITTER: Zhu BH 

PROVIDER: S-EPMC5834899 | biostudies-literature | 2018 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

P_RNA_scaffolder: a fast and accurate genome scaffolder using paired-end RNA-sequencing reads.

Zhu Bai-Han BH   Xiao Jun J   Xue Wei W   Xu Gui-Cai GC   Sun Ming-Yuan MY   Li Jiong-Tang JT  

BMC genomics 20180302 1


<h4>Background</h4>Obtaining complete gene structures is one major goal of genome assembly. Some gene regions are fragmented in low quality and high-quality assemblies. Therefore, new approaches are needed to recover gene regions. Genomes are widely transcribed, generating messenger and non-coding RNAs. These widespread transcripts can be used to scaffold genomes and complete transcribed regions.<h4>Results</h4>We present P_RNA_scaffolder, a fast and accurate tool using paired-end RNA-sequencing  ...[more]

Similar Datasets

| S-EPMC3614465 | biostudies-other
| S-EPMC4074385 | biostudies-literature
| S-EPMC8513298 | biostudies-literature
| S-EPMC4234483 | biostudies-literature
| S-EPMC3076424 | biostudies-literature
| S-EPMC4582294 | biostudies-literature
| S-EPMC5029459 | biostudies-literature
| S-EPMC3158087 | biostudies-literature
| S-EPMC7035700 | biostudies-literature
| S-EPMC7168855 | biostudies-literature