Unknown

Dataset Information

0

Imputing missing distances in molecular phylogenetics.


ABSTRACT: Missing data are frequently encountered in molecular phylogenetics, but there has been no accurate distance imputation method available for distance-based phylogenetic reconstruction. The general framework for distance imputation is to explore tree space and distance values to find an optimal combination of output tree and imputed distances. Here I develop a least-square method coupled with multivariate optimization to impute multiple missing distance in a distance matrix or from a set of aligned sequences with missing genes so that some sequences share no homologous sites (whose distances therefore need to be imputed). I show that phylogenetic trees can be inferred from distance matrices with about 10% of distances missing, and the accuracy of the resulting phylogenetic tree is almost as good as the tree from full information. The new method has the advantage over a recently published one in that it does not assume a molecular clock and is more accurate (comparable to maximum likelihood method based on simulated sequences). I have implemented the function in DAMBE software, which is freely available at http://dambe.bio.uottawa.ca.

SUBMITTER: Xia X 

PROVIDER: S-EPMC6063210 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Imputing missing distances in molecular phylogenetics.

Xia Xuhua X  

PeerJ 20180724


Missing data are frequently encountered in molecular phylogenetics, but there has been no accurate distance imputation method available for distance-based phylogenetic reconstruction. The general framework for distance imputation is to explore tree space and distance values to find an optimal combination of output tree and imputed distances. Here I develop a least-square method coupled with multivariate optimization to impute multiple missing distance in a distance matrix or from a set of aligne  ...[more]

Similar Datasets

| S-EPMC7274349 | biostudies-literature
2012-01-11 | PRD000375 | Pride
| S-EPMC8159923 | biostudies-literature
| S-EPMC8059005 | biostudies-literature
| S-EPMC8357088 | biostudies-literature
| S-EPMC8580266 | biostudies-literature
| S-EPMC8596520 | biostudies-literature
| S-EPMC7350980 | biostudies-literature
| S-EPMC1951527 | biostudies-literature
| S-EPMC5980335 | biostudies-literature