Unknown

Dataset Information

0

A system for exact and approximate genetic linkage analysis of SNP data in large pedigrees.


ABSTRACT: The use of dense single nucleotide polymorphism (SNP) data in genetic linkage analysis of large pedigrees is impeded by significant technical, methodological and computational challenges. Here we describe Superlink-Online SNP, a new powerful online system that streamlines the linkage analysis of SNP data. It features a fully integrated flexible processing workflow comprising both well-known and novel data analysis tools, including SNP clustering, erroneous data filtering, exact and approximate LOD calculations and maximum-likelihood haplotyping. The system draws its power from thousands of CPUs, performing data analysis tasks orders of magnitude faster than a single computer. By providing an intuitive interface to sophisticated state-of-the-art analysis tools coupled with high computing capacity, Superlink-Online SNP helps geneticists unleash the potential of SNP data for detecting disease genes.Computations performed by Superlink-Online SNP are automatically parallelized using novel paradigms, and executed on unlimited number of private or public CPUs. One novel service is large-scale approximate Markov Chain-Monte Carlo (MCMC) analysis. The accuracy of the results is reliably estimated by running the same computation on multiple CPUs and evaluating the Gelman-Rubin Score to set aside unreliable results. Another service within the workflow is a novel parallelized exact algorithm for inferring maximum-likelihood haplotyping. The reported system enables genetic analyses that were previously infeasible. We demonstrate the system capabilities through a study of a large complex pedigree affected with metabolic syndrome.Superlink-Online SNP is freely available for researchers at http://cbl-hap.cs.technion.ac.il/superlink-snp. The system source code can also be downloaded from the system website.omerw@cs.technion.ac.ilSupplementary data are available at Bioinformatics online.

SUBMITTER: Silberstein M 

PROVIDER: S-EPMC3546794 | biostudies-literature | 2013 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

A system for exact and approximate genetic linkage analysis of SNP data in large pedigrees.

Silberstein Mark M   Weissbrod Omer O   Otten Lars L   Tzemach Anna A   Anisenia Andrei A   Shtark Oren O   Tuberg Dvir D   Galfrin Eddie E   Gannon Irena I   Shalata Adel A   Borochowitz Zvi U ZU   Dechter Rina R   Thompson Elizabeth E   Geiger Dan D  

Bioinformatics (Oxford, England) 20121118 2


<h4>Motivation</h4>The use of dense single nucleotide polymorphism (SNP) data in genetic linkage analysis of large pedigrees is impeded by significant technical, methodological and computational challenges. Here we describe Superlink-Online SNP, a new powerful online system that streamlines the linkage analysis of SNP data. It features a fully integrated flexible processing workflow comprising both well-known and novel data analysis tools, including SNP clustering, erroneous data filtering, exac  ...[more]

Similar Datasets

| S-EPMC2680848 | biostudies-other
| S-EPMC3290352 | biostudies-literature
| S-EPMC4237330 | biostudies-literature
| S-EPMC4143705 | biostudies-literature
| S-EPMC4879118 | biostudies-literature
| S-EPMC5079601 | biostudies-literature
| S-EPMC10898342 | biostudies-literature
| S-EPMC379157 | biostudies-other
| S-EPMC5408770 | biostudies-literature
| S-EPMC4023913 | biostudies-literature