Unknown

Dataset Information

0

Robust extraction of functional signals from gene set analysis using a generalized threshold free scoring function.


ABSTRACT:

Background

A central task in contemporary biosciences is the identification of biological processes showing response in genome-wide differential gene expression experiments. Two types of analysis are common. Either, one generates an ordered list based on the differential expression values of the probed genes and examines the tail areas of the list for over-representation of various functional classes. Alternatively, one monitors the average differential expression level of genes belonging to a given functional class. So far these two types of method have not been combined.

Results

We introduce a scoring function, Gene Set Z-score (GSZ), for the analysis of functional class over-representation that combines two previous analysis methods. GSZ encompasses popular functions such as correlation, hypergeometric test, Max-Mean and Random Sets as limiting cases. GSZ is stable against changes in class size as well as across different positions of the analysed gene list in tests with randomized data. GSZ shows the best overall performance in a detailed comparison to popular functions using artificial data. Likewise, GSZ stands out in a cross-validation of methods using split real data. A comparison of empirical p-values further shows a strong difference in favour of GSZ, which clearly reports better p-values for top classes than the other methods. Furthermore, GSZ detects relevant biological themes that are missed by the other methods. These observations also hold when comparing GSZ with popular program packages.

Conclusion

GSZ and improved versions of earlier methods are a useful contribution to the analysis of differential gene expression. The methods and supplementary material are available from the website http://ekhidna.biocenter.helsinki.fi/users/petri/public/GSZ/GSZscore.html.

SUBMITTER: Toronen P 

PROVIDER: S-EPMC2761411 | biostudies-literature | 2009 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Robust extraction of functional signals from gene set analysis using a generalized threshold free scoring function.

Törönen Petri P   Ojala Pauli J PJ   Marttinen Pekka P   Holm Liisa L  

BMC bioinformatics 20090923


<h4>Background</h4>A central task in contemporary biosciences is the identification of biological processes showing response in genome-wide differential gene expression experiments. Two types of analysis are common. Either, one generates an ordered list based on the differential expression values of the probed genes and examines the tail areas of the list for over-representation of various functional classes. Alternatively, one monitors the average differential expression level of genes belongin  ...[more]

Similar Datasets

| S-EPMC3625125 | biostudies-literature
2022-05-04 | PXD030435 | Pride
2023-07-20 | PXD038367 | Pride
| S-EPMC3025713 | biostudies-literature
2023-07-20 | PXD038394 | Pride
| S-EPMC4710236 | biostudies-literature
| S-EPMC8728032 | biostudies-literature
| S-EPMC7229423 | biostudies-literature
2023-07-20 | PXD038377 | Pride
| S-EPMC2646862 | biostudies-other