Unknown

Dataset Information

0

MMBGX: a method for estimating expression at the isoform level and detecting differential splicing using whole-transcript Affymetrix arrays.


ABSTRACT: Affymetrix has recently developed whole-transcript GeneChips-'Gene' and 'Exon' arrays-which interrogate exons along the length of each gene. Although each probe on these arrays is intended to hybridize perfectly to only one transcriptional target, many probes match multiple transcripts located in different parts of the genome or alternative isoforms of the same gene. Existing statistical methods for estimating expression do not take this into account and are thus prone to producing inflated estimates. We propose a method, Multi-Mapping Bayesian Gene eXpression (MMBGX), which disaggregates the signal at 'multi-match' probes. When applied to Gene arrays, MMBGX removes the upward bias of gene-level expression estimates. When applied to Exon arrays, it can further disaggregate the signal between alternative transcripts of the same gene, providing expression estimates of individual splice variants. We demonstrate the performance of MMBGX on simulated data and a tissue mixture data set. We then show that MMBGX can estimate the expression of alternative isoforms within one experimental condition, confirming our results by RT-PCR. Finally, we show that our method for detecting differential splicing has a lower error rate than standard exon-level approaches on a previously validated colon cancer data set.

SUBMITTER: Turro E 

PROVIDER: S-EPMC2800219 | biostudies-literature | 2010 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

MMBGX: a method for estimating expression at the isoform level and detecting differential splicing using whole-transcript Affymetrix arrays.

Turro Ernest E   Lewin Alex A   Rose Anna A   Dallman Margaret J MJ   Richardson Sylvia S  

Nucleic acids research 20091023 1


Affymetrix has recently developed whole-transcript GeneChips-'Gene' and 'Exon' arrays-which interrogate exons along the length of each gene. Although each probe on these arrays is intended to hybridize perfectly to only one transcriptional target, many probes match multiple transcripts located in different parts of the genome or alternative isoforms of the same gene. Existing statistical methods for estimating expression do not take this into account and are thus prone to producing inflated esti  ...[more]

Similar Datasets

| S-EPMC2703633 | biostudies-literature
| S-EPMC2585104 | biostudies-literature
| S-EPMC2728531 | biostudies-literature
| S-EPMC1316121 | biostudies-literature
| S-EPMC2841991 | biostudies-literature
| S-EPMC3012675 | biostudies-literature
| S-EPMC7156646 | biostudies-literature
| S-EPMC5896116 | biostudies-literature
| S-EPMC1913542 | biostudies-literature
| S-EPMC3091630 | biostudies-literature