Dataset Information

Baiting out a full length sequence from unmapped RNA-seq data.

ABSTRACT:

Background

As a powerful tool, RNA-Seq has been widely used in various studies. Usually, unmapped RNA-seq reads have been considered as useless and been trashed or ignored.

Results

We develop a strategy to mining the full length sequence by unmapped reads combining with specific reverse transcription primers design and high throughput sequencing. In this study, we salvage 36 unmapped reads from standard RNA-Seq data and randomly select one 149 bp read as a model. Specific reverse transcription primers are designed to amplify its both ends, followed by next generation sequencing. Then we design a statistical model based on power law distribution to estimate its integrality and significance. Further, we validate it by Sanger sequencing. The result shows that the full length is 1556 bp, with insertion mutations in microsatellite structure.

Conclusion

We believe this method would be a useful strategy to extract the sequences information from the unmapped RNA-seq data. Further, it is an alternative way to get the full length sequence of unknown cDNA.

SUBMITTER: Li D

PROVIDER: S-EPMC8626966 | biostudies-literature | 2021 Nov

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Baiting out a full length sequence from unmapped RNA-seq data.

Li Dongwei D Huang Qitong Q Huang Lei L Wen Jikai J Luo Jing J Li Qing Q Peng Yanling Y Zhang Yubo Y

BMC genomics 20211127 1

<h4>Background</h4>As a powerful tool, RNA-Seq has been widely used in various studies. Usually, unmapped RNA-seq reads have been considered as useless and been trashed or ignored.<h4>Results</h4>We develop a strategy to mining the full length sequence by unmapped reads combining with specific reverse transcription primers design and high throughput sequencing. In this study, we salvage 36 unmapped reads from standard RNA-Seq data and randomly select one 149 bp read as a model. Specific reverse ...[more]

PMID: 34837950

Dataset Information

Baiting out a full length sequence from unmapped RNA-seq data.

Background

Results

Conclusion

Publications

Baiting out a full length sequence from unmapped RNA-seq data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Baiting out a full length sequence from unmapped RNA-seq data
2021-04-22 | GSE172487 | GEO

Baiting out a full length sequence from unmapped RNA-seq data
| PRJNA723548 | ENA

CAFU: a Galaxy framework for exploring unmapped RNA-Seq data.
| S-EPMC7299299 | biostudies-literature

Full-length transcriptome assembly from RNA-Seq data without a reference genome.
| S-EPMC3571712 | biostudies-literature

Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco.
| S-EPMC6936136 | biostudies-literature

Comparative evaluation of full-length isoform quantification from RNA-Seq.
| S-EPMC8145802 | biostudies-literature

Obstacles to detecting isoforms using full-length scRNA-seq data.
| S-EPMC7087381 | biostudies-literature

Full-length genome sequence of segmented RNA virus from ticks was obtained using small RNA sequencing data.
| S-EPMC7493057 | biostudies-literature

Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data.
| S-EPMC4673975 | biostudies-literature

Comprehensive assembly of novel transcripts from unmapped human RNA-Seq data and their association with cancer.
| S-EPMC4562499 | biostudies-literature