Cytochrome P450 diversity in the tree of life.
Ontology highlight
ABSTRACT: Sequencing in all areas of the tree of life has produced >300,000 cytochrome P450 (CYP) sequences that have been mined and collected. Nomenclature has been assigned to >41,000 CYP sequences and the majority of the remainder has been sorted by BLAST searches into clans, families and subfamilies in preparation for naming. The P450 sequence space is being systematically explored and filled in. Well-studied groups like vertebrates are covered in greater depth while new insights are being added into uncharted territories like horseshoe crab (Limulus polyphemus), tardigrades (Hypsibius dujardini), velvet worm (Euperipatoides_rowelli), and basal land plants like hornworts, liverworts and mosses. CYPs from the fungi, one of the most diverse groups, are being explored and organized as nearly 800 fungal species are now sequenced. The CYP clan structure in fungi is emerging with 805 CYP families sorting into 32 CYP clans. >3000 bacterial sequences are named, mostly from terrestrial or freshwater sources. Of 18,379 bacterial sequences downloaded from the CYPED database, all are >43% identical to named CYPs. Therefore, they fit in the 602 named P450 prokaryotic families. Diversity in this group is becoming saturated, however 25% of 3305 seawater bacterial P450s did not match known P450 families, indicating marine bacterial CYPs are not as well sampled as land/freshwater based bacterial CYPs. Future sequencing plans of the Genome 10K project, i5k and GIGA (Global Invertebrate Genomics Alliance) are expected to produce more than one million cytochrome P450 sequences by 2020. This article is part of a Special Issue entitled: Cytochrome P450 biodiversity and biotechnology, edited by Erika Plettner, Gianfranco Gilardi, Luet Wong, Vlada Urlacher, Jared Goldstone.
SUBMITTER: Nelson DR
PROVIDER: S-EPMC5681887 | biostudies-literature | 2018 Jan
REPOSITORIES: biostudies-literature
ACCESS DATA