Unknown

Dataset Information

0

A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data.


ABSTRACT: Variation in gene expression is thought to make a significant contribution to phenotypic diversity among individuals within populations. Although high-throughput cDNA sequencing offers a unique opportunity to delineate the genome-wide architecture of regulatory variation, new statistical methods need to be developed to capitalize on the wealth of information contained in RNA-seq data sets. To this end, we developed a powerful and flexible hierarchical Bayesian model that combines information across loci to allow both global and locus-specific inferences about allele-specific expression (ASE). We applied our methodology to a large RNA-seq data set obtained in a diploid hybrid of two diverse Saccharomyces cerevisiae strains, as well as to RNA-seq data from an individual human genome. Our statistical framework accurately quantifies levels of ASE with specified false-discovery rates, achieving high reproducibility between independent sequencing platforms. We pinpoint loci that show unusual and biologically interesting patterns of ASE, including allele-specific alternative splicing and transcription termination sites. Our methodology provides a rigorous, quantitative, and high-resolution tool for profiling ASE across whole genomes.

SUBMITTER: Skelly DA 

PROVIDER: S-EPMC3202289 | biostudies-literature | 2011 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data.

Skelly Daniel A DA   Johansson Marnie M   Madeoy Jennifer J   Wakefield Jon J   Akey Joshua M JM  

Genome research 20110826 10


Variation in gene expression is thought to make a significant contribution to phenotypic diversity among individuals within populations. Although high-throughput cDNA sequencing offers a unique opportunity to delineate the genome-wide architecture of regulatory variation, new statistical methods need to be developed to capitalize on the wealth of information contained in RNA-seq data sets. To this end, we developed a powerful and flexible hierarchical Bayesian model that combines information acr  ...[more]

Similar Datasets

| S-EPMC4425276 | biostudies-literature
| S-EPMC4642818 | biostudies-literature
| S-EPMC3218220 | biostudies-literature
| S-EPMC5340573 | biostudies-literature
| S-EPMC8594885 | biostudies-literature
| S-EPMC3333886 | biostudies-other
| S-EPMC4423382 | biostudies-literature
| S-EPMC7477012 | biostudies-literature
| S-EPMC8288148 | biostudies-literature
| S-EPMC10200579 | biostudies-literature