Unknown

Dataset Information

0

Inferring phylogenies from RAD sequence data.


ABSTRACT: Reduced-representation genome sequencing represents a new source of data for systematics, and its potential utility in interspecific phylogeny reconstruction has not yet been explored. One approach that seems especially promising is the use of inexpensive short-read technologies (e.g., Illumina, SOLiD) to sequence restriction-site associated DNA (RAD)--the regions of the genome that flank the recognition sites of restriction enzymes. In this study, we simulated the collection of RAD sequences from sequenced genomes of different taxa (Drosophila, mammals, and yeasts) and developed a proof-of-concept workflow to test whether informative data could be extracted and used to accurately reconstruct "known" phylogenies of species within each group. The workflow consists of three basic steps: first, sequences are clustered by similarity to estimate orthology; second, clusters are filtered by taxonomic coverage; and third, they are aligned and concatenated for "total evidence" phylogenetic analysis. We evaluated the performance of clustering and filtering parameters by comparing the resulting topologies with well-supported reference trees and we were able to identify conditions under which the reference tree was inferred with high support. For Drosophila, whole genome alignments allowed us to directly evaluate which parameters most consistently recovered orthologous sequences. For the parameter ranges explored, we recovered the best results at the low ends of sequence similarity and taxonomic representation of loci; these generated the largest supermatrices with the highest proportion of missing data. Applications of the method to mammals and yeasts were less successful, which we suggest may be due partly to their much deeper evolutionary divergence times compared to Drosophila (crown ages of approximately 100 and 300 versus 60 Mya, respectively). RAD sequences thus appear to hold promise for reconstructing phylogenetic relationships in younger clades in which sufficient numbers of orthologous restriction sites are retained across species.

SUBMITTER: Rubin BE 

PROVIDER: S-EPMC3320897 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inferring phylogenies from RAD sequence data.

Rubin Benjamin E R BE   Ree Richard H RH   Moreau Corrie S CS  

PloS one 20120406 4


Reduced-representation genome sequencing represents a new source of data for systematics, and its potential utility in interspecific phylogeny reconstruction has not yet been explored. One approach that seems especially promising is the use of inexpensive short-read technologies (e.g., Illumina, SOLiD) to sequence restriction-site associated DNA (RAD)--the regions of the genome that flank the recognition sites of restriction enzymes. In this study, we simulated the collection of RAD sequences fr  ...[more]

Similar Datasets

| S-EPMC4179140 | biostudies-literature
| S-EPMC2674047 | biostudies-literature
| S-EPMC5889004 | biostudies-literature
| S-EPMC3773407 | biostudies-literature
| S-EPMC3148245 | biostudies-literature
| S-EPMC7044161 | biostudies-literature
| S-EPMC7514434 | biostudies-literature
| S-EPMC10439365 | biostudies-literature
| S-EPMC10776241 | biostudies-literature
| S-EPMC2664478 | biostudies-literature