Unknown

Dataset Information

0

Word Embedding Reveals Cyfra 21-1 as a Biomarker for Chronic Obstructive Pulmonary Disease.


ABSTRACT:

Background

Although patients with chronic obstructive pulmonary disease (COPD) experience high morbidity and mortality worldwide, few biomarkers are available for COPD. Here, we analyzed potential biomarkers for the diagnosis of COPD by using word embedding.

Methods

To determine which biomarkers are likely to be associated with COPD, we selected respiratory disease-related biomarkers. Degrees of similarity between the 26 selected biomarkers and COPD were measured by word embedding. And we infer the similarity with COPD through the word embedding model trained in the large-capacity medical corpus, and search for biomarkers with high similarity among them. We used Word2Vec, Canonical Correlation Analysis, and Global Vector for word embedding. We evaluated the associations of selected biomarkers with COPD parameters in a cohort of patients with COPD.

Results

Cytokeratin 19 fragment (Cyfra 21-1) was selected because of its high similarity and its significant correlation with the COPD phenotype. Serum Cyfra 21-1 levels were determined in patients with COPD and controls (4.3 ± 5.9 vs. 3.9 ± 3.6 ng/mL, P = 0.611). The emphysema index was significantly correlated with the serum Cyfra 21-1 level (correlation coefficient = 0.219, P = 0.015).

Conclusion

Word embedding may be used for the discovery of biomarkers for COPD and Cyfra 21-1 may be used as a biomarker for emphysema. Additional studies are needed to validate Cyfra 21-1 as a biomarker for COPD.

SUBMITTER: Heo J 

PROVIDER: S-EPMC8422037 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

2003-07-16 | GSE475 | GEO
2008-05-31 | GSE8581 | GEO
2014-08-14 | E-GEOD-60399 | biostudies-arrayexpress
| S-EPMC8756227 | biostudies-literature
| PRJNA647843 | ENA
2014-08-14 | GSE60399 | GEO
| S-EPMC7358425 | biostudies-literature
| S-EPMC9397727 | biostudies-literature
| S-EPMC7172377 | biostudies-literature
| S-EPMC6603062 | biostudies-literature