Unknown

Dataset Information

0

HyDe: A Python Package for Genome-Scale Hybridization Detection.


ABSTRACT: The analysis of hybridization and gene flow among closely related taxa is a common goal for researchers studying speciation and phylogeography. Many methods for hybridization detection use simple site pattern frequencies from observed genomic data and compare them to null models that predict an absence of gene flow. The theory underlying the detection of hybridization using these site pattern probabilities exploits the relationship between the coalescent process for gene trees within population trees and the process of mutation along the branches of the gene trees. For certain models, site patterns are predicted to occur in equal frequency (i.e., their difference is 0), producing a set of functions called phylogenetic invariants. In this article, we introduce HyDe, a software package for detecting hybridization using phylogenetic invariants arising under the coalescent model with hybridization. HyDe is written in Python and can be used interactively or through the command line using pre-packaged scripts. We demonstrate the use of HyDe on simulated data, as well as on two empirical data sets from the literature. We focus in particular on identifying individual hybrids within population samples and on distinguishing between hybrid speciation and gene flow. HyDe is freely available as an open source Python package under the GNU GPL v3 on both GitHub (https://github.com/pblischak/HyDe) and the Python Package Index (PyPI: https://pypi.python.org/pypi/phyde).

SUBMITTER: Blischak PD 

PROVIDER: S-EPMC6454532 | biostudies-literature | 2018 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

HyDe: A Python Package for Genome-Scale Hybridization Detection.

Blischak Paul D PD   Chifman Julia J   Wolfe Andrea D AD   Kubatko Laura S LS  

Systematic biology 20180901 5


The analysis of hybridization and gene flow among closely related taxa is a common goal for researchers studying speciation and phylogeography. Many methods for hybridization detection use simple site pattern frequencies from observed genomic data and compare them to null models that predict an absence of gene flow. The theory underlying the detection of hybridization using these site pattern probabilities exploits the relationship between the coalescent process for gene trees within population  ...[more]

Similar Datasets

| S-EPMC6006996 | biostudies-literature
| S-EPMC10833567 | biostudies-literature
| S-EPMC8685811 | biostudies-literature
| S-EPMC4837986 | biostudies-literature
| S-EPMC7597035 | biostudies-literature
| S-EPMC8168212 | biostudies-literature
| S-EPMC8275978 | biostudies-literature
| S-EPMC8138882 | biostudies-literature
| S-EPMC10997433 | biostudies-literature
| S-EPMC3364938 | biostudies-literature