Unknown

Dataset Information

0

ILRC: a hybrid biomarker discovery algorithm based on improved L1 regularization and clustering in microarray data.


ABSTRACT:

Background

Finding significant genes or proteins from gene chip data for disease diagnosis and drug development is an important task. However, the challenge comes from the curse of the data dimension. It is of great significance to use machine learning methods to find important features from the data and build an accurate classification model.

Results

The proposed method has proved superior to the published advanced hybrid feature selection method and traditional feature selection method on different public microarray data sets. In addition, the biomarkers selected using our method show a match to those provided by the cooperative hospital in a set of clinical cleft lip and palate data.

Method

In this paper, a feature selection algorithm ILRC based on clustering and improved L1 regularization is proposed. The features are firstly clustered, and the redundant features in the sub-clusters are deleted. Then all the remaining features are iteratively evaluated using ILR. The final result is given according to the cumulative weight reordering.

Conclusion

The proposed method can effectively remove redundant features. The algorithm's output has high stability and classification accuracy, which can potentially select potential biomarkers.

SUBMITTER: Yu K 

PROVIDER: S-EPMC8532312 | biostudies-literature | 2021 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

ILRC: a hybrid biomarker discovery algorithm based on improved L1 regularization and clustering in microarray data.

Yu Kun K   Xie Weidong W   Wang Linjie L   Li Wei W  

BMC bioinformatics 20211022 1


<h4>Background</h4>Finding significant genes or proteins from gene chip data for disease diagnosis and drug development is an important task. However, the challenge comes from the curse of the data dimension. It is of great significance to use machine learning methods to find important features from the data and build an accurate classification model.<h4>Results</h4>The proposed method has proved superior to the published advanced hybrid feature selection method and traditional feature selection  ...[more]

Similar Datasets

| S-EPMC3684607 | biostudies-literature
| S-EPMC5504766 | biostudies-literature
| S-EPMC3996860 | biostudies-other
| S-EPMC3002369 | biostudies-literature
| S-EPMC3563403 | biostudies-literature
2010-10-08 | GSE18495 | GEO
| S-EPMC3984869 | biostudies-other
| S-EPMC6409843 | biostudies-other
2010-10-08 | E-GEOD-18495 | biostudies-arrayexpress
| S-EPMC1090559 | biostudies-literature