Unknown

Dataset Information

0

HPV DeepSeq: An Ultra-Fast Method of NGS Data Analysis and Visualization Using Automated Workflows and a Customized Papillomavirus Database in CLC Genomics Workbench.


ABSTRACT: Next-generation sequencing (NGS) has actualized the human papillomavirus (HPV) virome profiling for in-depth investigation of viral evolution and pathogenesis. However, viral computational analysis remains a bottleneck due to semantic discrepancies between computational tools and curated reference genomes. To address this, we developed and tested automated workflows for HPV taxonomic profiling and visualization using a customized papillomavirus database in the CLC Microbial Genomics Module. HPV genomes from Papilloma Virus Episteme were customized and incorporated into CLC "ready-to-use" workflows for stepwise data processing to include: (1) Taxonomic Analysis, (2) Estimate Alpha/Beta Diversities, and (3) Map Reads to Reference. Low-grade (n = 95) and high-grade (n = 60) Pap smears were tested with ensuing collective runtimes: Taxonomic Analysis (36 min); Alpha/Beta Diversities (5 s); Map Reads (45 min). Tabular output conversion to visualizations entailed 1-2 keystrokes. Biodiversity analysis between low- (LSIL) and high-grade squamous intraepithelial lesions (HSIL) revealed loss of species richness and gain of dominance by HPV-16 in HSIL. Integrating clinically relevant, taxonomized HPV reference genomes within automated workflows proved to be an ultra-fast method of virome profiling. The entire process named "HPV DeepSeq" provides a simple, accurate and practical means of NGS data analysis for a broad range of applications in viral research.

SUBMITTER: Shen-Gunther J 

PROVIDER: S-EPMC8398645 | biostudies-literature | 2021 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

HPV DeepSeq: An Ultra-Fast Method of NGS Data Analysis and Visualization Using Automated Workflows and a Customized Papillomavirus Database in CLC Genomics Workbench.

Shen-Gunther Jane J   Xia Qingqing Q   Cai Hong H   Wang Yufeng Y  

Pathogens (Basel, Switzerland) 20210813 8


Next-generation sequencing (NGS) has actualized the human papillomavirus (HPV) virome profiling for in-depth investigation of viral evolution and pathogenesis. However, viral computational analysis remains a bottleneck due to semantic discrepancies between computational tools and curated reference genomes. To address this, we developed and tested automated workflows for HPV taxonomic profiling and visualization using a customized papillomavirus database in the CLC Microbial Genomics Module. HPV  ...[more]

Similar Datasets

| S-EPMC9331699 | biostudies-literature
| S-EPMC3857499 | biostudies-literature
| S-EPMC9861985 | biostudies-literature
| S-EPMC3618517 | biostudies-literature
| S-EPMC3889118 | biostudies-literature
| S-EPMC4133763 | biostudies-literature
| S-EPMC10777859 | biostudies-literature
| S-EPMC10063409 | biostudies-literature
| S-EPMC8935747 | biostudies-literature