Unknown

Dataset Information

0

PyNVR: investigating factors affecting feature selection from scRNA-seq data for lineage reconstruction.


ABSTRACT: MOTIVATION:The emergence of single-cell RNA-sequencing has enabled analyses that leverage transitioning cell states to reconstruct pseudotemporal trajectories. Multidimensional data sparsity, zero inflation and technical variation necessitate the selection of high-quality features that feed downstream analyses. Despite the development of numerous algorithms for the unsupervised selection of biologically relevant features, their differential performance remains largely unaddressed. RESULTS:We implemented the neighborhood variance ratio (NVR) feature selection approach as a Python package with substantial improvements in performance. In comparing NVR with multiple unsupervised algorithms such as dpFeature, we observed striking differences in features selected. We present evidence that quantifiable dataset properties have observable and predictable effects on the performance of these algorithms. AVAILABILITY AND IMPLEMENTATION:pyNVR is freely available at https://github.com/KenLauLab/NVR. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Chen B 

PROVIDER: S-EPMC6596893 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

pyNVR: investigating factors affecting feature selection from scRNA-seq data for lineage reconstruction.

Chen Bob B   Herring Charles A CA   Lau Ken S KS  

Bioinformatics (Oxford, England) 20190701 13


<h4>Motivation</h4>The emergence of single-cell RNA-sequencing has enabled analyses that leverage transitioning cell states to reconstruct pseudotemporal trajectories. Multidimensional data sparsity, zero inflation and technical variation necessitate the selection of high-quality features that feed downstream analyses. Despite the development of numerous algorithms for the unsupervised selection of biologically relevant features, their differential performance remains largely unaddressed.<h4>Res  ...[more]

Similar Datasets

| S-EPMC9175189 | biostudies-literature
| S-EPMC6547431 | biostudies-literature
| S-EPMC6755101 | biostudies-literature
| S-EPMC7585212 | biostudies-literature
| S-EPMC7141853 | biostudies-literature
| S-EPMC10547911 | biostudies-literature
| S-EPMC8157426 | biostudies-literature
| S-EPMC7234837 | biostudies-literature
| S-EPMC5888655 | biostudies-literature
| S-EPMC7647120 | biostudies-literature