Unknown

Dataset Information

0

Theme discovery from gene lists for identification and viewing of multiple functional groups.


ABSTRACT:

Background

High throughput methods of the genome era produce vast amounts of data in the form of gene lists. These lists are large and difficult to interpret without advanced computational or bioinformatic tools. Most existing methods analyse a gene list as a single entity although it is comprised of multiple gene groups associated with separate biological functions. Therefore it is imperative to define and visualize gene groups with unique functionality within gene lists.

Results

In order to analyse the functional heterogeneity within a gene list, we have developed a method that clusters genes to groups with homogenous functionalities. The method uses Non-negative Matrix Factorization (NMF) to create several clustering results with varying numbers of clusters. The obtained clustering results are combined into a simple graphical presentation showing the functional groups over-represented in the analyzed gene list. We demonstrate its performance on two data sets and show results that improve upon existing methods. The comparison also shows that our method creates a more simplified view that aids in discovery of biological themes within the list and discards less informative classes from the results.

Conclusion

The presented method and associated software are useful for the identification and interpretation of biological functions associated with gene lists and are especially useful for the analysis of large lists.

SUBMITTER: Pehkonen P 

PROVIDER: S-EPMC1190153 | biostudies-literature | 2005 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Theme discovery from gene lists for identification and viewing of multiple functional groups.

Pehkonen Petri P   Wong Garry G   Törönen Petri P  

BMC bioinformatics 20050629


<h4>Background</h4>High throughput methods of the genome era produce vast amounts of data in the form of gene lists. These lists are large and difficult to interpret without advanced computational or bioinformatic tools. Most existing methods analyse a gene list as a single entity although it is comprised of multiple gene groups associated with separate biological functions. Therefore it is imperative to define and visualize gene groups with unique functionality within gene lists.<h4>Results</h4  ...[more]

Similar Datasets

| S-EPMC8301326 | biostudies-literature
| S-EPMC4287672 | biostudies-literature
| S-EPMC9252805 | biostudies-literature
| S-EPMC2615629 | biostudies-literature
| S-EPMC4987867 | biostudies-literature
| S-EPMC2949900 | biostudies-literature
| S-EPMC5493535 | biostudies-other
| S-EPMC6983382 | biostudies-literature
| S-EPMC2375021 | biostudies-literature
| S-EPMC7817695 | biostudies-literature