Unknown

Dataset Information

0

Efficient approach to unique single-nucleotide polymorphism discovery.


ABSTRACT: Single-nucleotide polymorphisms (SNPs) are the most frequently found DNA sequence variations in the human genome. It has been argued that a dense set of SNP markers can be used to identify genetic factors associated with complex disease traits. Because all high-throughput genotyping methods require precise sequence knowledge of the SNPs, any SNP discovery approach must involve both the determination of DNA sequence and allele frequencies. Furthermore, high-throughput genotyping also requires a genomic DNA amplification step, making it necessary to develop sequence-tagged sites (STSs) that amplify only the DNA fragment containing the SNP and nothing else from the rest of the genome. In this report, we demonstrate the utility of a SNP-screening approach that yields the DNA sequence and allele frequency information while screening out duplications with minimal cost and effort. Our approach is based on the use of a homozygous complete hydatidiform mole (CHM) as the reference. With this homozygous reference, one can identify and estimate the allele frequencies of common SNPs with a pooled DNA-sequencing approach (rather than having to sequence numerous individuals as is commonly done). More importantly, the CHM reference is preferable to a single individual reference because it reveals readily any duplicated regions of the genome amplified by the PCR assay before the duplicated sequences are found in GenBank. This approach reduces the cost of SNP discovery by 60% and eliminates the costly development of SNP markers that cannot be amplified uniquely from the genome.

SUBMITTER: Taillon-Miller P 

PROVIDER: S-EPMC310784 | biostudies-literature | 1999 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient approach to unique single-nucleotide polymorphism discovery.

Taillon-Miller P P   Piernot E E EE   Kwok P Y PY  

Genome research 19990501 5


Single-nucleotide polymorphisms (SNPs) are the most frequently found DNA sequence variations in the human genome. It has been argued that a dense set of SNP markers can be used to identify genetic factors associated with complex disease traits. Because all high-throughput genotyping methods require precise sequence knowledge of the SNPs, any SNP discovery approach must involve both the determination of DNA sequence and allele frequencies. Furthermore, high-throughput genotyping also requires a g  ...[more]

Similar Datasets

| S-EPMC1318490 | biostudies-literature
| S-EPMC521117 | biostudies-literature
| S-EPMC3128068 | biostudies-literature
| S-EPMC3146956 | biostudies-literature
| S-EPMC5340323 | biostudies-literature
| S-EPMC5325534 | biostudies-literature
| S-EPMC6892549 | biostudies-literature
| S-EPMC2745763 | biostudies-literature
| S-EPMC5015895 | biostudies-literature
| S-EPMC5817121 | biostudies-literature