Unknown

Dataset Information

0

Extension of the viral ecology in humans using viral profile hidden Markov models.


ABSTRACT: When human samples are sequenced, many assembled contigs are "unknown", as conventional alignments find no similarity to known sequences. Hidden Markov models (HMM) exploit the positions of specific nucleotides in protein-encoding codons in various microbes. The algorithm HMMER3 implements HMM using a reference set of sequences encoding viral proteins, "vFam". We used HMMER3 analysis of "unknown" human sample-derived sequences and identified 510 contigs distantly related to viruses (Anelloviridae (n = 1), Baculoviridae (n = 34), Circoviridae (n = 35), Caulimoviridae (n = 3), Closteroviridae (n = 5), Geminiviridae (n = 21), Herpesviridae (n = 10), Iridoviridae (n = 12), Marseillevirus (n = 26), Mimiviridae (n = 80), Phycodnaviridae (n = 165), Poxviridae (n = 23), Retroviridae (n = 6) and 89 contigs related to described viruses not yet assigned to any taxonomic family). In summary, we find that analysis using the HMMER3 algorithm and the "vFam" database greatly extended the detection of viruses in biospecimens from humans.

SUBMITTER: Bzhalava Z 

PROVIDER: S-EPMC5774701 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Extension of the viral ecology in humans using viral profile hidden Markov models.

Bzhalava Zurab Z   Hultin Emilie E   Dillner Joakim J  

PloS one 20180119 1


When human samples are sequenced, many assembled contigs are "unknown", as conventional alignments find no similarity to known sequences. Hidden Markov models (HMM) exploit the positions of specific nucleotides in protein-encoding codons in various microbes. The algorithm HMMER3 implements HMM using a reference set of sequences encoding viral proteins, "vFam". We used HMMER3 analysis of "unknown" human sample-derived sequences and identified 510 contigs distantly related to viruses (Anellovirida  ...[more]

Similar Datasets

| S-EPMC2770071 | biostudies-literature
| S-EPMC3356369 | biostudies-literature
| S-EPMC5860389 | biostudies-literature
| S-EPMC2685388 | biostudies-literature
| S-EPMC2883304 | biostudies-literature
| S-EPMC8097282 | biostudies-literature
| S-EPMC4553831 | biostudies-literature
| S-EPMC4867884 | biostudies-other
| S-EPMC6437899 | biostudies-literature
| S-EPMC4139300 | biostudies-literature