Unknown

Dataset Information

0

Viral coinfection analysis using a MinHash toolkit.


ABSTRACT:

Background

Human papillomavirus (HPV) is a common sexually transmitted infection associated with cervical cancer that frequently occurs as a coinfection of types and subtypes. Highly similar sublineages that show over 100-fold differences in cancer risk are not distinguishable in coinfections with current typing methods.

Results

We describe an efficient set of computational tools, rkmh, for analyzing complex mixed infections of related viruses based on sequence data. rkmh makes extensive use of MinHash similarity measures, and includes utilities for removing host DNA and classifying reads by type, lineage, and sublineage. We show that rkmh is capable of assigning reads to their HPV type as well as HPV16 lineage and sublineages.

Conclusions

Accurate read classification enables estimates of percent composition when there are multiple infecting lineages or sublineages. While we demonstrate rkmh for HPV with multiple sequencing technologies, it is also applicable to other mixtures of related sequences.

SUBMITTER: Dawson ET 

PROVIDER: S-EPMC6626348 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications


<h4>Background</h4>Human papillomavirus (HPV) is a common sexually transmitted infection associated with cervical cancer that frequently occurs as a coinfection of types and subtypes. Highly similar sublineages that show over 100-fold differences in cancer risk are not distinguishable in coinfections with current typing methods.<h4>Results</h4>We describe an efficient set of computational tools, rkmh, for analyzing complex mixed infections of related viruses based on sequence data. rkmh makes ex  ...[more]

Similar Datasets

| S-EPMC5957777 | biostudies-other
| S-EPMC4915045 | biostudies-literature
| PRJEB56223 | ENA
| S-EPMC7177155 | biostudies-literature
| S-EPMC7423146 | biostudies-literature
| S-EPMC8428259 | biostudies-literature
| S-EPMC5538707 | biostudies-other
| S-EPMC10824537 | biostudies-literature
| S-EPMC6166523 | biostudies-literature
| S-EPMC7185664 | biostudies-literature