Unknown

Dataset Information

0

DbVOR: a database system for importing pedigree, phenotype and genotype data and exporting selected subsets.


ABSTRACT: When studying the genetics of a human trait, we typically have to manage both genome-wide and targeted genotype data. There can be overlap of both people and markers from different genotyping experiments; the overlap can introduce several kinds of problems. Most times the overlapping genotypes are the same, but sometimes they are different. Occasionally, the lab will return genotypes using a different allele labeling scheme (for example 1/2 vs A/C). Sometimes, the genotype for a person/marker index is unreliable or missing. Further, over time some markers are merged and bad samples are re-run under a different sample name. We need a consistent picture of the subset of data we have chosen to work with even though there might possibly be conflicting measurements from multiple data sources.We have developed the dbVOR database, which is designed to hold data efficiently for both genome-wide and targeted experiments. The data are indexed for fast retrieval by person and marker. In addition, we store pedigree and phenotype data for our subjects. The dbVOR database allows us to select subsets of the data by several different criteria and to merge their results into a coherent and consistent whole. Data may be filtered by: family, person, trait value, markers, chromosomes, and chromosome ranges. The results can be presented in columnar, Mega2, or PLINK format.dbVOR serves our needs well. It is freely available from https://watson.hgen.pitt.edu/register . Documentation for dbVOR can be found at https://watson.hgen.pitt.edu/register/docs/dbvor.html .

SUBMITTER: Baron RV 

PROVIDER: S-EPMC4407391 | biostudies-literature | 2015 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

dbVOR: a database system for importing pedigree, phenotype and genotype data and exporting selected subsets.

Baron Robert V RV   Conley Yvette P YP   Gorin Michael B MB   Weeks Daniel E DE  

BMC bioinformatics 20150318


<h4>Background</h4>When studying the genetics of a human trait, we typically have to manage both genome-wide and targeted genotype data. There can be overlap of both people and markers from different genotyping experiments; the overlap can introduce several kinds of problems. Most times the overlapping genotypes are the same, but sometimes they are different. Occasionally, the lab will return genotypes using a different allele labeling scheme (for example 1/2 vs A/C). Sometimes, the genotype for  ...[more]

Similar Datasets

| S-EPMC2790299 | biostudies-literature
| S-EPMC5048068 | biostudies-literature
| S-EPMC2860197 | biostudies-literature
| S-EPMC5241210 | biostudies-literature
| S-EPMC6906681 | biostudies-literature
| S-EPMC4326710 | biostudies-literature
| S-EPMC7779071 | biostudies-literature
| S-EPMC4152273 | biostudies-literature
| PRJEB48275 | ENA
| S-EPMC2699508 | biostudies-literature