Dataset Information

An optimistic protein assembly from sequence reads salvaged an uncharacterized segment of mouse picobirnavirus.

ABSTRACT: Advances in Next Generation Sequencing technologies have enabled the generation of millions of sequences from microorganisms. However, distinguishing the sequence of a novel species from sequencing errors remains a technical challenge when the novel species is highly divergent from the closest known species. To solve such a problem, we developed a new method called Optimistic Protein Assembly from Reads (OPAR). This method is based on the assumption that protein sequences could be more conserved than the nucleotide sequences encoding them. By taking advantage of metagenomics, bioinformatics and conventional Sanger sequencing, our method successfully identified all coding regions of the mouse picobirnavirus for the first time. The salvaged sequences indicated that segment 1 of this virus was more divergent from its homologues in other Picobirnaviridae species than segment 2. For this reason, only segment 2 of mouse picobirnavirus has been detected in previous studies. OPAR web tool is available at http://bioinformatics.czc.hokudai.ac.jp/opar/.

SUBMITTER: Gonzalez G

PROVIDER: S-EPMC5223137 | biostudies-literature | 2017 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

An optimistic protein assembly from sequence reads salvaged an uncharacterized segment of mouse picobirnavirus.

Gonzalez Gabriel G Sasaki Michihito M Burkitt-Gray Lucy L Kamiya Tomonori T Tsuji Noriko M NM Sawa Hirofumi H Ito Kimihito K

Scientific reports 20170110

Advances in Next Generation Sequencing technologies have enabled the generation of millions of sequences from microorganisms. However, distinguishing the sequence of a novel species from sequencing errors remains a technical challenge when the novel species is highly divergent from the closest known species. To solve such a problem, we developed a new method called Optimistic Protein Assembly from Reads (OPAR). This method is based on the assumption that protein sequences could be more conserved ...[more]

PMID: 28071766

Dataset Information

An optimistic protein assembly from sequence reads salvaged an uncharacterized segment of mouse picobirnavirus.

Publications

An optimistic protein assembly from sequence reads salvaged an uncharacterized segment of mouse picobirnavirus.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Targeted assembly of short sequence reads.
| S-EPMC3092772 | biostudies-literature

Evaluation of CircRNA Sequence Assembly Methods Using Long Reads.
| S-EPMC8882733 | biostudies-literature

Complete genome sequence of a novel picobirnavirus, otarine picobirnavirus, discovered in California sea lions.
| S-EPMC3372223 | biostudies-literature

Parallel, tag-directed assembly of locally derived short sequence reads.
| S-EPMC2848820 | biostudies-literature

Sequence verification of synthetic DNA by assembly of sequencing reads.
| S-EPMC3592409 | biostudies-literature

Morus alba Raw protein sequence reads
2024-02-08 | PXD049451 |

Improving transcriptome assembly through error correction of high-throughput sequence reads.
| S-EPMC3728768 | biostudies-literature

Metagenomic Assembly and Draft Genome Sequence of an Uncharacterized Prevotella sp. from Nelore Rumen.
| S-EPMC4498113 | biostudies-literature

Assembly and Analysis of Unmapped Genome Sequence Reads Reveal Novel Sequence and Variation in Dogs.
| S-EPMC6052005 | biostudies-literature

The Challenges of Analysing Highly Diverse Picobirnavirus Sequence Data.
| S-EPMC6316005 | biostudies-literature