Unknown

Dataset Information

0

Discovery of an expansive bacteriophage family that includes the most abundant viruses from the human gut.


ABSTRACT: Metagenomic sequence analysis is rapidly becoming the primary source of virus discovery 1-3 . A substantial majority of the currently available virus genomes come from metagenomics, and some of these represent extremely abundant viruses, even if never grown in the laboratory. A particularly striking case of a virus discovered via metagenomics is crAssphage, which is by far the most abundant human-associated virus known, comprising up to 90% of sequences in the gut virome 4 . Over 80% of the predicted proteins encoded in the approximately 100 kilobase crAssphage genome showed no significant similarity to available protein sequences, precluding classification of this virus and hampering further study. Here we combine a comprehensive search of genomic and metagenomic databases with sensitive methods for protein sequence analysis to identify an expansive, diverse group of bacteriophages related to crAssphage and predict the functions of the majority of phage proteins, in particular those that comprise the structural, replication and expression modules. Most, if not all, of the crAss-like phages appear to be associated with diverse bacteria from the phylum Bacteroidetes, which includes some of the most abundant bacteria in the human gut microbiome and that are also common in various other habitats. These findings provide for experimental characterization of the most abundant but poorly understood members of the human-associated virome.

SUBMITTER: Yutin N 

PROVIDER: S-EPMC5736458 | biostudies-literature | 2018 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Discovery of an expansive bacteriophage family that includes the most abundant viruses from the human gut.

Yutin Natalya N   Makarova Kira S KS   Gussow Ayal B AB   Krupovic Mart M   Segall Anca A   Edwards Robert A RA   Koonin Eugene V EV  

Nature microbiology 20171113 1


Metagenomic sequence analysis is rapidly becoming the primary source of virus discovery <sup>1-3</sup> . A substantial majority of the currently available virus genomes come from metagenomics, and some of these represent extremely abundant viruses, even if never grown in the laboratory. A particularly striking case of a virus discovered via metagenomics is crAssphage, which is by far the most abundant human-associated virus known, comprising up to 90% of sequences in the gut virome <sup>4</sup>  ...[more]

Similar Datasets

| S-EPMC6235969 | biostudies-literature
| S-EPMC7895897 | biostudies-literature
| S-EPMC4111155 | biostudies-literature
| S-EPMC8530359 | biostudies-literature
| EGAS00001006260 | EGA
| S-EPMC9226167 | biostudies-literature
| S-EPMC5812488 | biostudies-literature
| S-EPMC5743082 | biostudies-literature
| S-EPMC4069399 | biostudies-literature
| S-EPMC10055259 | biostudies-literature