Unknown

Dataset Information

0

A multiple coefficient of determination-based method for parsing SNPs that correlate with mRNA expression.


ABSTRACT: In this study, we present a novel, multiple coefficient of determination (R2M)-based method for parsing SNPs located within the chromosomal neighborhood of a gene into semi-independent families, each of which corresponds to one or more functional variants that regulate transcription of the gene. Specifically, our method utilizes a matrix equation framework to calculate R2M values for SNPs within a chromosome region of interest (ROI) based upon the choices of 1-4 "index" SNPs (iSNPs) that serve as proxies for underlying regulatory variants. Exhaustive testing of sets of 1-4 candidate iSNPs identifies iSNP models that best account for estimated R2 values derived from single-variable linear regression analysis of correlations between mRNA expression and genotypes of individual SNPs. Subsequent genotype-based estimation of pairwise r2 linkage disequilibrium (LD) coefficients between each iSNP and the other ROI SNPs allows the SNPs to be parsed into semi-independent families. Analysis of mRNA expression and genotypes data downloaded from Gene Expression Omnibus (GEO) and database for Genotypes and Phenotypes (dbGAP) demonstrates the usefulness of this method for parsing SNPs based on experimental data. We believe that this method will be widely applicable for the analysis of the genetic basis of mRNA expression and visualizing the contributions of multiple genetic variants to the regulation of individual genes.

SUBMITTER: Song F 

PROVIDER: S-EPMC6934451 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A multiple coefficient of determination-based method for parsing SNPs that correlate with mRNA expression.

Song Fan F   Tao Yu Y   Sun Yue Y   Saffen David D  

Scientific reports 20191227 1


In this study, we present a novel, multiple coefficient of determination (R<sup>2</sup><sub>M</sub>)-based method for parsing SNPs located within the chromosomal neighborhood of a gene into semi-independent families, each of which corresponds to one or more functional variants that regulate transcription of the gene. Specifically, our method utilizes a matrix equation framework to calculate R<sup>2</sup><sub>M</sub> values for SNPs within a chromosome region of interest (ROI) based upon the choi  ...[more]

Similar Datasets

| S-EPMC2851566 | biostudies-literature
| S-EPMC2956011 | biostudies-literature
| S-EPMC2874662 | biostudies-other
| S-EPMC5446698 | biostudies-literature
| S-EPMC4696787 | biostudies-literature
| S-EPMC3112071 | biostudies-literature
| S-EPMC3036059 | biostudies-literature
| S-EPMC2863064 | biostudies-literature
| S-EPMC7611212 | biostudies-literature
| S-EPMC4664237 | biostudies-literature