Unknown

Dataset Information

0

CANT-HYD: A Curated Database of Phylogeny-Derived Hidden Markov Models for Annotation of Marker Genes Involved in Hydrocarbon Degradation.


ABSTRACT: Many pathways for hydrocarbon degradation have been discovered, yet there are no dedicated tools to identify and predict the hydrocarbon degradation potential of microbial genomes and metagenomes. Here we present the Calgary approach to ANnoTating HYDrocarbon degradation genes (CANT-HYD), a database of 37 HMMs of marker genes involved in anaerobic and aerobic degradation pathways of aliphatic and aromatic hydrocarbons. Using this database, we identify understudied or overlooked hydrocarbon degradation potential in many phyla. We also demonstrate its application in analyzing high-throughput sequence data by predicting hydrocarbon utilization in large metagenomic datasets from diverse environments. CANT-HYD is available at https://github.com/dgittins/CANT-HYD-HydrocarbonBiodegradation.

SUBMITTER: Khot V 

PROVIDER: S-EPMC8767102 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

altmetric image

Publications

CANT-HYD: A Curated Database of Phylogeny-Derived Hidden Markov Models for Annotation of Marker Genes Involved in Hydrocarbon Degradation.

Khot Varada V   Zorz Jackie J   Gittins Daniel A DA   Chakraborty Anirban A   Bell Emma E   Bautista María A MA   Paquette Alexandre J AJ   Hawley Alyse K AK   Novotnik Breda B   Hubert Casey R J CRJ   Strous Marc M   Bhatnagar Srijak S  

Frontiers in microbiology 20220107


Many pathways for hydrocarbon degradation have been discovered, yet there are no dedicated tools to identify and predict the hydrocarbon degradation potential of microbial genomes and metagenomes. Here we present the Calgary approach to ANnoTating HYDrocarbon degradation genes (CANT-HYD), a database of 37 HMMs of marker genes involved in anaerobic and aerobic degradation pathways of aliphatic and aromatic hydrocarbons. Using this database, we identify understudied or overlooked hydrocarbon degra  ...[more]

Similar Datasets

| S-EPMC5860389 | biostudies-literature
| S-EPMC5385569 | biostudies-literature
| S-EPMC6950343 | biostudies-literature
| S-EPMC9623898 | biostudies-literature
| S-EPMC4300491 | biostudies-literature
| S-EPMC6658007 | biostudies-literature
| S-EPMC4231724 | biostudies-literature
| S-EPMC10491390 | biostudies-literature
| S-EPMC8830650 | biostudies-literature
| S-EPMC10510128 | biostudies-literature