Unknown

Dataset Information

0

Discovery of disease- and drug-specific pathways through community structures of a literature network.


ABSTRACT:

Motivation

In light of the massive growth of the scientific literature, text mining is increasingly used to extract biological pathways. Though multiple tools explore individual connections between genes, diseases and drugs, few extensively synthesize pathways for specific diseases and drugs.

Results

Through community detection of a literature network, we extracted 3444 functional gene groups that represented biological pathways for specific diseases and drugs. The network linked Medical Subject Headings (MeSH) terms of genes, diseases and drugs that co-occurred in publications. The resulting communities detected highly associated genes, diseases and drugs. These significantly matched current knowledge of biological pathways and predicted future ones in time-stamped experiments. Likewise, disease- and drug-specific communities also recapitulated known pathways for those given diseases and drugs. Moreover, diseases sharing communities had high comorbidity with each other and drugs sharing communities had many common side effects, consistent with related mechanisms. Indeed, the communities robustly recovered mutual targets for drugs [area under Receiver Operating Characteristic curve (AUROC)=0.75] and shared pathogenic genes for diseases (AUROC=0.82). These data show that literature communities inform not only just known biological processes but also suggest novel disease- and drug-specific mechanisms that may guide disease gene discovery and drug repurposing.

Availability and implementation

Application tools are available at http://meteor.lichtargelab.org.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Pham M 

PROVIDER: S-EPMC7103064 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Discovery of disease- and drug-specific pathways through community structures of a literature network.

Pham Minh M   Wilson Stephen S   Govindarajan Harikumar H   Lin Chih-Hsu CH   Lichtarge Olivier O  

Bioinformatics (Oxford, England) 20200301 6


<h4>Motivation</h4>In light of the massive growth of the scientific literature, text mining is increasingly used to extract biological pathways. Though multiple tools explore individual connections between genes, diseases and drugs, few extensively synthesize pathways for specific diseases and drugs.<h4>Results</h4>Through community detection of a literature network, we extracted 3444 functional gene groups that represented biological pathways for specific diseases and drugs. The network linked  ...[more]

Similar Datasets

| S-EPMC4944831 | biostudies-literature
| S-EPMC5741828 | biostudies-literature
| S-EPMC7361499 | biostudies-literature
| S-EPMC3878363 | biostudies-other
| S-EPMC8096770 | biostudies-literature
| S-EPMC9051440 | biostudies-literature
| S-EPMC3817400 | biostudies-literature
| S-EPMC5860178 | biostudies-literature
| S-EPMC7097144 | biostudies-literature
| S-EPMC3317484 | biostudies-literature