Unknown

Dataset Information

0

FAVA: high-quality functional association networks inferred from scRNA-seq and proteomics data.


ABSTRACT:

Motivation

Protein networks are commonly used for understanding how proteins interact. However, they are typically biased by data availability, favoring well-studied proteins with more interactions. To uncover functions of understudied proteins, we must use data that are not affected by this literature bias, such as single-cell RNA-seq and proteomics. Due to data sparseness and redundancy, functional association analysis becomes complex.

Results

To address this, we have developed FAVA (Functional Associations using Variational Autoencoders), which compresses high-dimensional data into a low-dimensional space. FAVA infers networks from high-dimensional omics data with much higher accuracy than existing methods, across a diverse collection of real as well as simulated datasets. FAVA can process large datasets with over 0.5 million conditions and has predicted 4210 interactions between 1039 understudied proteins. Our findings showcase FAVA's capability to offer novel perspectives on protein interactions. FAVA functions within the scverse ecosystem, employing AnnData as its input source.

Availability and implementation

Source code, documentation, and tutorials for FAVA are accessible on GitHub at https://github.com/mikelkou/fava. FAVA can also be installed and used via pip/PyPI as well as via the scverse ecosystem https://github.com/scverse/ecosystem-packages/tree/main/packages/favapy.

SUBMITTER: Koutrouli M 

PROVIDER: S-EPMC10868155 | biostudies-literature | 2024 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

FAVA: high-quality functional association networks inferred from scRNA-seq and proteomics data.

Koutrouli Mikaela M   Nastou Katerina K   Piera Líndez Pau P   Bouwmeester Robbin R   Rasmussen Simon S   Martens Lennart L   Jensen Lars Juhl LJ  

Bioinformatics (Oxford, England) 20240201 2


<h4>Motivation</h4>Protein networks are commonly used for understanding how proteins interact. However, they are typically biased by data availability, favoring well-studied proteins with more interactions. To uncover functions of understudied proteins, we must use data that are not affected by this literature bias, such as single-cell RNA-seq and proteomics. Due to data sparseness and redundancy, functional association analysis becomes complex.<h4>Results</h4>To address this, we have developed  ...[more]

Similar Datasets

| S-EPMC11274664 | biostudies-literature
| S-EPMC10519377 | biostudies-literature
| S-EPMC10168282 | biostudies-literature
| S-EPMC11874984 | biostudies-literature
| S-EPMC9364370 | biostudies-literature
| S-EPMC2248599 | biostudies-literature
| S-EPMC11891664 | biostudies-literature
| S-EPMC11741621 | biostudies-literature
| S-EPMC7141853 | biostudies-literature
| S-EPMC11846686 | biostudies-literature