Unknown

Dataset Information

0

Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals.


ABSTRACT:

Background

Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if species are very closely related.

Results

Using simulations of DNA sequence evolution we show that assignment of the ancestral CpG state based on the simple presence/absence of the CpG dinucleotide can seriously bias estimates of the substitution rate, because many true non-CpG changes are misassigned as CpG. Paradoxically, this bias is most severe between closely related species, because a minimum of two substitutions are required to misassign a true ancestral CpG site as non-CpG whereas only a single substitution is required to misassign a true ancestral non-CpG site as CpG in a two branch tree. We also show that CpG misassignment bias differentially affects fourfold degenerate and noncoding sites due to differences in base composition such that fourfold degenerate sites can appear to be evolving more slowly than noncoding sites. We demonstrate that the effects predicted by our simulations occur in a real evolutionary setting by comparing substitution rates estimated from human-chimp coding and intronic sequence using CpG/non-CpG assignment with estimates derived from a method that is largely free from bias.

Conclusion

Our study demonstrates that a common method of assigning sites into CpG and non CpG classes in pairwise alignments is seriously biased and recommends against the adoption of ad hoc methods of ancestral state assignment.

SUBMITTER: Gaffney DJ 

PROVIDER: S-EPMC2576242 | biostudies-literature | 2008 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals.

Gaffney Daniel J DJ   Keightley Peter D PD  

BMC evolutionary biology 20080930


<h4>Background</h4>Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if spec  ...[more]

Similar Datasets

| S-EPMC2847748 | biostudies-literature
| S-EPMC5080320 | biostudies-literature
| S-EPMC4512549 | biostudies-literature
| S-EPMC34419 | biostudies-literature
2022-06-24 | GSE206526 | GEO
| S-EPMC5604096 | biostudies-literature
| S-EPMC3887100 | biostudies-literature
| S-EPMC8495791 | biostudies-literature
| S-EPMC2674423 | biostudies-literature
| S-EPMC7462845 | biostudies-literature