Unknown

Dataset Information

0

Identification of Differentially Expressed Genes in RNA-seq Data of Arabidopsis thaliana: A Compound Distribution Approach.


ABSTRACT: Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product, which may be proteins. A gene is declared differentially expressed if an observed difference or change in read counts or expression levels between two experimental conditions is statistically significant. To identify differentially expressed genes between two conditions, it is important to find statistical distributional property of the data to approximate the nature of differential genes. In the present study, the focus is mainly to investigate the differential gene expression analysis for sequence data based on compound distribution model. This approach was applied in RNA-seq count data of Arabidopsis thaliana and it has been found that compound Poisson distribution is more appropriate to capture the variability as compared with Poisson distribution. Thus, fitting of appropriate distribution to gene expression data provides statistically sound cutoff values for identifying differentially expressed genes.

SUBMITTER: Anjum A 

PROVIDER: S-EPMC4827276 | biostudies-other | 2016 Apr

REPOSITORIES: biostudies-other

altmetric image

Publications

Identification of Differentially Expressed Genes in RNA-seq Data of Arabidopsis thaliana: A Compound Distribution Approach.

Anjum Arfa A   Jaggi Seema S   Varghese Eldho E   Lall Shwetank S   Bhowmik Arpan A   Rai Anil A  

Journal of computational biology : a journal of computational molecular cell biology 20160307 4


Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product, which may be proteins. A gene is declared differentially expressed if an observed difference or change in read counts or expression levels between two experimental conditions is statistically significant. To identify differentially expressed genes between two conditions, it is important to find statistical distributional property of the data to approximate the nature of differen  ...[more]

Similar Datasets

| S-EPMC6604381 | biostudies-literature
| S-EPMC5592911 | biostudies-literature
| S-EPMC3906980 | biostudies-literature
| S-EPMC5349981 | biostudies-literature
| S-EPMC9670425 | biostudies-literature
| S-EPMC6284200 | biostudies-literature
| S-EPMC8582999 | biostudies-literature
| S-EPMC9140427 | biostudies-literature
| S-EPMC3488134 | biostudies-literature
| S-EPMC6090096 | biostudies-literature