Unknown

Dataset Information

0

On the stability of log-rank test under labeling errors.


ABSTRACT:

Motivation

Log rank test is a widely used test that serves to assess the statistical significance of observed differences in survival, when comparing two or more groups. The log rank test is based on several assumptions that support the validity of the calculations. It is naturally assumed, implicitly, that no errors occur in the labeling of the samples. That is - that the mapping between samples and groups is perfectly correct. In this work we investigate how test results may be affected when considering some errors in the original labeling.

Results

We introduce and define the uncertainty that arises from labeling errors in log rank test. In order to deal with this uncertainty, we develop a novel algorithm for efficiently calculating a stability interval around the original log rank p-value and prove its correctness. We demonstrate our algorithm on several datasets.

Availability

We provide a Python implementation, called LoRSI, for calculating the stability interval using our algorithm. https://github.com/YakhiniGroup/LoRSI.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Galili B 

PROVIDER: S-EPMC8652036 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8649467 | biostudies-literature
| S-EPMC5793689 | biostudies-literature
| S-EPMC8592474 | biostudies-literature
| S-EPMC4013236 | biostudies-literature
| S-EPMC7850908 | biostudies-literature
| S-EPMC9305601 | biostudies-literature
| S-EPMC6690426 | biostudies-literature
| S-EPMC8944499 | biostudies-literature
| S-EPMC2795392 | biostudies-literature