Unknown

Dataset Information

0

CORAZON: a web server for data normalization and unsupervised clustering based on expression profiles.


ABSTRACT: OBJECTIVE:Data normalization and clustering are mandatory steps in gene expression and downstream analyses, respectively. However, user-friendly implementations of these methodologies are available exclusively under expensive licensing agreements, or in stand-alone scripts developed, reflecting on a great obstacle for users with less computational skills. RESULTS:We developed an online tool called CORAZON (Correlations Analyses Zipper Online), which implements three unsupervised learning methods to cluster gene expression datasets in a friendly environment. It allows the usage of eight gene expression normalization/transformation methodologies and the attribute's influence. The normalizations requiring the gene length only could be performed to RNA-seq, meanwhile the others can be used with microarray and/or NanoString data. Clustering methodologies performances were evaluated through five models with accuracies between 92 and 100%. We applied our tool to obtain functional insights of non-coding RNAs (ncRNAs) based on Gene Ontology enrichment of clusters in a dataset generated by the ENCODE project. The clusters where the majority of transcripts are coding genes were enriched in Cellular, Metabolic, Transports, and Systems Development categories. Meanwhile, the ncRNAs were enriched in the Detection of Stimulus, Sensory Perception, Immunological System, and Digestion categories. CORAZON source-code is freely available at https://gitlab.com/integrativebioinformatics/corazon and the web-server can be accessed at http://corazon.integrativebioinformatics.me .

SUBMITTER: Ramos TAR 

PROVIDER: S-EPMC7359491 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

CORAZON: a web server for data normalization and unsupervised clustering based on expression profiles.

Ramos Thaís A R TAR   Maracaja-Coutinho Vinicius V   Ortega J Miguel JM   do Rêgo Thaís G TG  

BMC research notes 20200714 1


<h4>Objective</h4>Data normalization and clustering are mandatory steps in gene expression and downstream analyses, respectively. However, user-friendly implementations of these methodologies are available exclusively under expensive licensing agreements, or in stand-alone scripts developed, reflecting on a great obstacle for users with less computational skills.<h4>Results</h4>We developed an online tool called CORAZON (Correlations Analyses Zipper Online), which implements three unsupervised l  ...[more]

Similar Datasets

| S-EPMC6602472 | biostudies-literature
| S-EPMC7319577 | biostudies-literature
| S-EPMC4987925 | biostudies-literature
| S-EPMC8535304 | biostudies-literature
| S-EPMC1160230 | biostudies-literature
| S-EPMC3125752 | biostudies-literature
| S-EPMC7050097 | biostudies-literature
| S-EPMC6722089 | biostudies-literature
| S-EPMC5570229 | biostudies-literature
| S-EPMC10192332 | biostudies-literature