Unknown

Dataset Information

0

Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants.


ABSTRACT:

Background

Since the development of sequencing technology, an enormous amount of genetic information has been generated, and human cancer analysis using this information is drawing attention. As the effects of variants on human cancer become known, it is important to find cancer-associated variants among countless variants.

Results

We propose a new filter-based feature selection method applicable for extracting cancer-associated somatic variants considering correlations of data. Both variants associated with the activation and deactivation of cancer's characteristics are analyzed using dual correlation filters. The multiobjective optimization is utilized to consider two types of variants simultaneously without redundancy. To overcome high computational complexity problem, we calculate the correlation-based weight to select significant variants instead of directly searching for the optimal subset of variants. The proposed algorithm is applied to the identification of melanoma metastasis or breast cancer stage, and the classification results of the proposed method are compared with those of conventional single correlation filter-based method.

Conclusions

We verified that the proposed dual correlation filter-based method can extract cancer-associated variants related to the characteristics of human cancer.

SUBMITTER: Seo H 

PROVIDER: S-EPMC7596964 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants.

Seo Hyein H   Cho Dong-Ho DH  

BMC bioinformatics 20201030 1


<h4>Background</h4>Since the development of sequencing technology, an enormous amount of genetic information has been generated, and human cancer analysis using this information is drawing attention. As the effects of variants on human cancer become known, it is important to find cancer-associated variants among countless variants.<h4>Results</h4>We propose a new filter-based feature selection method applicable for extracting cancer-associated somatic variants considering correlations of data. B  ...[more]

Similar Datasets

| S-EPMC9322764 | biostudies-literature
| S-EPMC7397300 | biostudies-literature
| S-EPMC6642261 | biostudies-literature
| S-EPMC10782922 | biostudies-literature
| S-EPMC4342225 | biostudies-literature
| S-EPMC4804474 | biostudies-literature
| S-EPMC7270206 | biostudies-literature
| S-EPMC8446846 | biostudies-literature
| S-EPMC5638869 | biostudies-literature
| S-EPMC9163069 | biostudies-literature