Unknown

Dataset Information

0

The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats.


ABSTRACT: Unmapped next-generation sequencing reads are typically ignored while they contain biologically relevant information. We systematically analyzed unmapped reads from whole genome sequencing of 33 inbred rat strains. High quality reads were selected and enriched for biologically relevant sequences; similarity-based analysis revealed clustering similar to previously reported phylogenetic trees. Our results demonstrate that on average 20% of all unmapped reads harbor sequences that can be used to improve reference genomes and generate hypotheses on potential genotype-phenotype relationships. Analysis pipelines would benefit from incorporating the described methods and reference genomes would benefit from inclusion of the genomic segments obtained through these efforts.

SUBMITTER: van der Weide RH 

PROVIDER: S-EPMC4976967 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats.

van der Weide Robin H RH   Simonis Marieke M   Hermsen Roel R   Toonen Pim P   Cuppen Edwin E   de Ligt Joep J  

PloS one 20160808 8


Unmapped next-generation sequencing reads are typically ignored while they contain biologically relevant information. We systematically analyzed unmapped reads from whole genome sequencing of 33 inbred rat strains. High quality reads were selected and enriched for biologically relevant sequences; similarity-based analysis revealed clustering similar to previously reported phylogenetic trees. Our results demonstrate that on average 20% of all unmapped reads harbor sequences that can be used to im  ...[more]

Similar Datasets

| S-EPMC4815510 | biostudies-literature
| S-EPMC6668410 | biostudies-literature
| S-EPMC10585868 | biostudies-literature
| S-EPMC4402702 | biostudies-literature
| S-EPMC6323668 | biostudies-literature
| S-EPMC6683435 | biostudies-literature
| S-EPMC6052005 | biostudies-literature
| S-EPMC3347568 | biostudies-literature
| S-EPMC6805598 | biostudies-literature
| S-EPMC7979878 | biostudies-literature