Unknown

Dataset Information

0

Cenote-Taker 2 democratizes virus discovery and sequence annotation.


ABSTRACT: Viruses, despite their great abundance and significance in biological systems, remain largely mysterious. Indeed, the vast majority of the perhaps hundreds of millions of viral species on the planet remain undiscovered. Additionally, many viruses deposited in central databases like GenBank and RefSeq are littered with genes annotated as 'hypothetical protein' or the equivalent. Cenote-Taker 2, a virus discovery and annotation tool available on command line and with a graphical user interface with free high-performance computation access, utilizes highly sensitive models of hallmark virus genes to discover familiar or divergent viral sequences from user-input contigs. Additionally, Cenote-Taker 2 uses a flexible set of modules to automatically annotate the sequence features of contigs, providing more gene information than comparable tools. The outputs include readable and interactive genome maps, virome summary tables, and files that can be directly submitted to GenBank. We expect Cenote-Taker 2 to facilitate virus discovery, annotation, and expansion of the known virome.

SUBMITTER: Tisza MJ 

PROVIDER: S-EPMC7816666 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Cenote-Taker 2 democratizes virus discovery and sequence annotation.

Tisza Michael J MJ   Belford Anna K AK   Domínguez-Huerta Guillermo G   Bolduc Benjamin B   Buck Christopher B CB  

Virus evolution 20201230 1


Viruses, despite their great abundance and significance in biological systems, remain largely mysterious. Indeed, the vast majority of the perhaps hundreds of millions of viral species on the planet remain undiscovered. Additionally, many viruses deposited in central databases like GenBank and RefSeq are littered with genes annotated as 'hypothetical protein' or the equivalent. Cenote-Taker 2, a virus discovery and annotation tool available on command line and with a graphical user interface wit  ...[more]

Similar Datasets

| S-EPMC6376018 | biostudies-literature
| S-EPMC7245624 | biostudies-literature
| S-EPMC2974822 | biostudies-literature
| S-EPMC4374605 | biostudies-literature
| S-EPMC7412958 | biostudies-literature
| S-EPMC2739203 | biostudies-literature
| S-EPMC5015703 | biostudies-literature
| S-EPMC310682 | biostudies-literature
| S-EPMC1287883 | biostudies-literature
| S-EPMC6886511 | biostudies-literature