Unknown

Dataset Information

0

A clustering-based approach for efficient identification of microRNA combinatorial biomarkers.


ABSTRACT: MicroRNAs (miRNAs) have great potential serving as tumor biomarkers and therapeutic targets. As the rapid development of high-throughput experimental technology, gene expression experiments have become more and more specialized and diversified. The complex data structure has brought great challenge for the identification of biomarkers. In the meantime, current statistical and machine learning methods for detecting biomarkers have the problem of low reliability and biased criteria.This study aims to select combinatorial miRNA biomarkers, which have higher sensitivity and specificity than single-gene biomarkers. In order to avoid exhaustive search and redundant information, miRNAs are firstly clustered, then the combinations of representative cluster members are assessed as potential biomarkers. Both the criteria for the partition of clusters and selection of representative members are based on Fisher linear discriminant analysis (FDA). The FDA-based criterion has been demonstrated to be superior to three other criteria in selecting representative members, and also good at refining clusters. In the comparison with eight common feature selection methods, this clustering-based method performs the best with regard to the discriminative ability of selected biomarkers.Our experimental results demonstrate that the clustering-based method can identify microRNA combinatorial biomarkers with high accuracy and efficiency. Our method and data are available to the public upon request.

SUBMITTER: Yang Y 

PROVIDER: S-EPMC5374636 | biostudies-literature | 2017 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

A clustering-based approach for efficient identification of microRNA combinatorial biomarkers.

Yang Yang Y   Huang Ning N   Hao Luning L   Kong Wei W  

BMC genomics 20170314 Suppl 2


<h4>Background</h4>MicroRNAs (miRNAs) have great potential serving as tumor biomarkers and therapeutic targets. As the rapid development of high-throughput experimental technology, gene expression experiments have become more and more specialized and diversified. The complex data structure has brought great challenge for the identification of biomarkers. In the meantime, current statistical and machine learning methods for detecting biomarkers have the problem of low reliability and biased crite  ...[more]

Similar Datasets

| S-EPMC3715582 | biostudies-literature
| S-EPMC6034686 | biostudies-literature
| S-EPMC4345154 | biostudies-literature
| S-EPMC3459543 | biostudies-literature
2017-06-28 | GSE83270 | GEO
| S-EPMC11369284 | biostudies-literature
| S-EPMC8619582 | biostudies-literature
| S-EPMC6110866 | biostudies-literature
| S-EPMC7424406 | biostudies-literature
| S-EPMC8479235 | biostudies-literature