Unknown

Dataset Information

0

Sfaira accelerates data and model reuse in single cell genomics.


ABSTRACT: Single-cell RNA-seq datasets are often first analyzed independently without harnessing model fits from previous studies, and are then contextualized with public data sets, requiring time-consuming data wrangling. We address these issues with sfaira, a single-cell data zoo for public data sets paired with a model zoo for executable pre-trained models. The data zoo is designed to facilitate contribution of data sets using ontologies for metadata. We propose an adaption of cross-entropy loss for cell type classification tailored to datasets annotated at different levels of coarseness. We demonstrate the utility of sfaira by training models across anatomic data partitions on 8 million cells.

SUBMITTER: Fischer DS 

PROVIDER: S-EPMC8386039 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6777348 | biostudies-literature
| S-EPMC6443043 | biostudies-literature
| S-EPMC8748196 | biostudies-literature
| S-EPMC7460789 | biostudies-literature
| S-EPMC7045518 | biostudies-literature
| S-EPMC6065048 | biostudies-literature
| S-EPMC3792178 | biostudies-literature
| S-EPMC4905655 | biostudies-literature
| S-EPMC8701080 | biostudies-literature
| S-EPMC5282606 | biostudies-literature