Unknown

Dataset Information

0

Analysis of genomic-length HBV sequences to determine genotype and subgenotype reference sequences.


ABSTRACT: Hepatitis B virus (HBV) is a diverse, partially double-stranded DNA virus, with 9 genotypes (A-I), and a putative 10th genotype (J), characterized thus far. Given the broadening interest in HBV sequencing, there is an increasing requirement for a consistent, unified approach to HBV genotype and subgenotype classification. We set out to generate an updated resource of reference sequences using the diversity of all genomic-length HBV sequences available in public databases. We collated and aligned genomic-length HBV sequences from public databases and used maximum-likelihood phylogenetic analysis to identify genotype clusters. Within each genotype, we examined the phylogenetic support for currently defined subgenotypes, as well as identifying well-supported clades and deriving reference sequences for them. Based on the phylogenies generated, we present a comprehensive set of HBV reference sequences at the genotype and subgenotype level. All of the generated data, including the alignments, phylogenies and chosen reference sequences, are available online (https://doi.org/10.6084/m9.figshare.8851946) as a simple open-access resource.

SUBMITTER: McNaughton AL 

PROVIDER: S-EPMC7416611 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Analysis of genomic-length HBV sequences to determine genotype and subgenotype reference sequences.

McNaughton Anna L AL   Revill Peter A PA   Littlejohn Margaret M   Matthews Philippa C PC   Ansari M Azim MA  

The Journal of general virology 20200301 3


Hepatitis B virus (HBV) is a diverse, partially double-stranded DNA virus, with 9 genotypes (A-I), and a putative 10th genotype (J), characterized thus far. Given the broadening interest in HBV sequencing, there is an increasing requirement for a consistent, unified approach to HBV genotype and subgenotype classification. We set out to generate an updated resource of reference sequences using the diversity of all genomic-length HBV sequences available in public databases. We collated and aligned  ...[more]

Similar Datasets

| S-EPMC7825714 | biostudies-literature
| S-EPMC2935930 | biostudies-literature
| S-EPMC3477104 | biostudies-literature
| S-EPMC3749532 | biostudies-literature
| S-EPMC4190083 | biostudies-literature
| S-EPMC3510619 | biostudies-literature
| S-EPMC3523008 | biostudies-literature
| S-EPMC4355778 | biostudies-literature
| S-EPMC3607908 | biostudies-literature
| S-EPMC6124380 | biostudies-literature