Dataset Information

On Model Evaluation Under Non-constant Class Imbalance

ABSTRACT: Many real-world classification problems are significantly class-imbalanced to detriment of the class of interest. The standard set of proper evaluation metrics is well-known but the usual assumption is that the test dataset imbalance equals the real-world imbalance. In practice, this assumption is often broken for various reasons. The reported results are then often too optimistic and may lead to wrong conclusions about industrial impact and suitability of proposed techniques. We introduce methods (Supplementary code related to techniques described in this paper is available at: https://github.com/CiscoCTA/nci_eval) focusing on evaluation under non-constant class imbalance. We show that not only the absolute values of commonly used metrics, but even the order of classifiers in relation to the evaluation metric used is affected by the change of the imbalance rate. Finally, we demonstrate that using subsampling in order to get a test dataset with class imbalance equal to the one observed in the wild is not necessary, and eventually can lead to significant errors in classifier’s performance estimate.

SUBMITTER: Krzhizhanovskaya V

PROVIDER: S-EPMC7303692 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Similar Datasets

Project description:Viscoelasticity is a fundamental property of virtually all biological materials, and proteinaceous, fibrous materials that constitute the extracellular matrix (ECM) are no exception. Viscoelasticity may be particularly important in the ECM since cells can apply mechanical stress resulting from cell contractility over very long periods of time. However, measurements of ECM fiber response to long-term constant force loading are scarce, despite the increasing recognition that mechanical strain regulates the biological function of some ECM fibers. We developed a dual micropipette system that applies constant force to single fibers for up to 8?h. We utilized this system to study the time dependent response of fibronectin (Fn) fibers to constant force, as Fn fibers exhibit tremendous extensibility before mechanical failure as well as strain dependent alterations in biological properties. These data demonstrate the Fn fibers continue to stretch under constant force loading for at least 8?h and that this long-term creep results in plastic deformation of Fn fibers, in contrast to elastic deformation of Fn fibers under short-term, but fast loading rate extension. These data demonstrate that physiologically-relevant loading may impart mechanical features to Fn fibers by switching them into an extended state that may have altered biological functions. STATEMENT OF SIGNIFICANCE: Measurements of extracellular matrix (ECM) fiber response to constant force loading are scarce, so we developed a novel technique for applying constant force to single ECM fibers. We used this technique to measure constant force creep of fibronectin fibers since these fibers have been shown to be mechanotransducers whose functions can be altered by mechanical strain. We found that fibronectin fibers creep under constant force loading for the duration of the experiment and that this creep behavior resembles a power law. Furthermore, we found that constant force creep results in plastic deformation of the fibers, which suggests that the mechanobiological switching of fibronectin can only occur once after long-term loading.

Dataset Information

On Model Evaluation Under Non-constant Class Imbalance

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets