Unknown

Dataset Information

0

Using BioBin to explore rare variant population stratification.


ABSTRACT: Rare variants (RVs) will likely explain additional heritability of many common complex diseases; however, the natural frequencies of rare variation across and between human populations are largely unknown. We have developed a powerful, flexible collapsing method called BioBin that utilizes prior biological knowledge using multiple publicly available database sources to direct analyses. Variants can be collapsed according to functional regions, evolutionary conserved regions, regulatory regions, genes, and/or pathways without the need for external files. We conducted an extensive comparison of rare variant burden differences (MAF < 0.03) between two ancestry groups from 1000 Genomes Project data, Yoruba (YRI) and European descent (CEU) individuals. We found that 56.86% of gene bins, 72.73% of intergenic bins, 69.45% of pathway bins, 32.36% of ORegAnno annotated bins, and 9.10% of evolutionary conserved regions (shared with primates) have statistically significant differences in RV burden. Ongoing efforts include examining additional regional characteristics using regulatory regions and protein binding domains. Our results show interesting variant differences between two ancestral populations and demonstrate that population stratification is a pervasive concern for sequence analyses.

SUBMITTER: Moore CB 

PROVIDER: S-EPMC3638724 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using BioBin to explore rare variant population stratification.

Moore Carrie B CB   Wallace John R JR   Frase Alex T AT   Pendergrass Sarah A SA   Ritchie Marylyn D MD  

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing 20130101


Rare variants (RVs) will likely explain additional heritability of many common complex diseases; however, the natural frequencies of rare variation across and between human populations are largely unknown. We have developed a powerful, flexible collapsing method called BioBin that utilizes prior biological knowledge using multiple publicly available database sources to direct analyses. Variants can be collapsed according to functional regions, evolutionary conserved regions, regulatory regions,  ...[more]

Similar Datasets

| S-EPMC8463695 | biostudies-literature
| S-EPMC3701690 | biostudies-literature
| S-EPMC6283567 | biostudies-other
| S-EPMC5114546 | biostudies-other
| S-EPMC4135410 | biostudies-literature
| S-EPMC3465327 | biostudies-literature
| S-EPMC7077175 | biostudies-literature
| S-EPMC10794901 | biostudies-literature
| S-EPMC4406348 | biostudies-literature
| S-EPMC8219685 | biostudies-literature