Ontology highlight
ABSTRACT:
SUBMITTER: Bernhardt M
PROVIDER: S-EPMC8897392 | biostudies-literature | 2022 Mar
REPOSITORIES: biostudies-literature
Bernhardt Mélanie M Castro Daniel C DC Tanno Ryutaro R Schwaighofer Anton A Tezcan Kerem C KC Monteiro Miguel M Bannur Shruthi S Lungren Matthew P MP Nori Aditya A Glocker Ben B Alvarez-Valle Javier J Oktay Ozan O
Nature communications 20220304 1
Imperfections in data annotation, known as label noise, are detrimental to the training of machine learning models and have a confounding effect on the assessment of model performance. Nevertheless, employing experts to remove label noise by fully re-annotating large datasets is infeasible in resource-constrained settings, such as healthcare. This work advocates for a data-driven approach to prioritising samples for re-annotation-which we term "active label cleaning". We propose to rank instance ...[more]