Ontology highlight
ABSTRACT:
SUBMITTER: Johndrow JE
PROVIDER: S-EPMC5963577 | biostudies-literature | 2018 Jun
REPOSITORIES: biostudies-literature
Johndrow J E JE Lum K K Dunson D B DB
Biometrika 20180319 2
There has been substantial recent interest in record linkage, where one attempts to group the records pertaining to the same entities from one or more large databases that lack unique identifiers. This can be viewed as a type of microclustering, with few observations per cluster and a very large number of clusters. We show that the problem is fundamentally hard from a theoretical perspective and, even in idealized cases, accurate entity resolution is effectively impossible unless the number of e ...[more]