Unknown

Dataset Information

0

Pseudo-Reference-Based Assembly of Vertebrate Transcriptomes.


ABSTRACT: High-throughput RNA sequencing (RNA-seq) provides a comprehensive picture of the transcriptome, including the identity, structure, quantity, and variability of expressed transcripts in cells, through the assembly of sequenced short RNA-seq reads. Although the reference-based approach guarantees the high quality of the resulting transcriptome, this approach is only applicable when the relevant reference genome is present. Here, we developed a pseudo-reference-based assembly (PRA) that reconstructs a transcriptome based on a linear regression function of the optimized mapping parameters and genetic distances of the closest species. Using the linear model, we reconstructed transcriptomes of four different aves, the white leg horn, turkey, duck, and zebra finch, with the Gallus gallus genome as a pseudo-reference, and of three primates, the chimpanzee, gorilla, and macaque, with the human genome as a pseudo-reference. The resulting transcriptomes show that the PRAs outperformed the de novo approach for species with within about 10% mutation rate among orthologous transcriptomes, enough to cover distantly related species as far as chicken and duck. Taken together, we suggest that the PRA method can be used as a tool for reconstructing transcriptome maps of vertebrates whose genomes have not yet been sequenced.

SUBMITTER: Nam K 

PROVIDER: S-EPMC4808791 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pseudo-Reference-Based Assembly of Vertebrate Transcriptomes.

Nam Kyoungwoo K   Jeong Heesu H   Nam Jin-Wu JW  

Genes 20160224 3


High-throughput RNA sequencing (RNA-seq) provides a comprehensive picture of the transcriptome, including the identity, structure, quantity, and variability of expressed transcripts in cells, through the assembly of sequenced short RNA-seq reads. Although the reference-based approach guarantees the high quality of the resulting transcriptome, this approach is only applicable when the relevant reference genome is present. Here, we developed a pseudo-reference-based assembly (PRA) that reconstruct  ...[more]

Similar Datasets

| S-EPMC7462077 | biostudies-literature
| S-EPMC6481552 | biostudies-literature
| S-EPMC5226840 | biostudies-literature
| S-EPMC4330339 | biostudies-literature
| S-EPMC6545731 | biostudies-literature
| S-EPMC4856438 | biostudies-literature
| S-EPMC3562798 | biostudies-other
| S-EPMC4305238 | biostudies-literature
| S-EPMC8708430 | biostudies-literature
| S-EPMC3707018 | biostudies-literature