Unknown

Dataset Information

0

Orthogonal outlier detection and dimension estimation for improved MDS embedding of biological datasets.


ABSTRACT: Conventional dimensionality reduction methods like Multidimensional Scaling (MDS) are sensitive to the presence of orthogonal outliers, leading to significant defects in the embedding. We introduce a robust MDS method, called DeCOr-MDS (Detection and Correction of Orthogonal outliers using MDS), based on the geometry and statistics of simplices formed by data points, that allows to detect orthogonal outliers and subsequently reduce dimensionality. We validate our methods using synthetic datasets, and further show how it can be applied to a variety of large real biological datasets, including cancer image cell data, human microbiome project data and single cell RNA sequencing data, to address the task of data cleaning and visualization.

SUBMITTER: Li W 

PROVIDER: S-EPMC10448701 | biostudies-literature | 2023

REPOSITORIES: biostudies-literature

altmetric image

Publications

Orthogonal outlier detection and dimension estimation for improved MDS embedding of biological datasets.

Li Wanxin W   Mirone Jules J   Prasad Ashok A   Miolane Nina N   Legrand Carine C   Dao Duc Khanh K  

Frontiers in bioinformatics 20230810


Conventional dimensionality reduction methods like Multidimensional Scaling (MDS) are sensitive to the presence of orthogonal outliers, leading to significant defects in the embedding. We introduce a robust MDS method, called <i>DeCOr-MDS</i> (Detection and Correction of Orthogonal outliers using MDS), based on the geometry and statistics of simplices formed by data points, that allows to detect orthogonal outliers and subsequently reduce dimensionality. We validate our methods using synthetic d  ...[more]

Similar Datasets

| S-EPMC10865689 | biostudies-literature
| S-EPMC4980871 | biostudies-literature
| S-EPMC6454425 | biostudies-literature
| S-EPMC10655845 | biostudies-literature
| S-EPMC8771813 | biostudies-literature
| S-EPMC3777433 | biostudies-literature
| S-EPMC3961718 | biostudies-literature
2020-05-18 | GSE125020 | GEO
| S-EPMC11323683 | biostudies-literature
| S-EPMC8213704 | biostudies-literature