Ontology highlight
ABSTRACT:
SUBMITTER: Li R
PROVIDER: S-EPMC7872589 | biostudies-literature | 2020 Jun
REPOSITORIES: biostudies-literature
Li Rundong R Gatterbauer Wolfgang W Riedewald Mirek M
Proceedings. ACM-SIGMOD International Conference on Management of Data 20200601
We consider running-time optimization for band-joins in a distributed system, e.g., the cloud. To balance load across worker machines, input has to be partitioned, which causes duplication. We explore how to resolve this tension between <i>maximum load per worker</i> and <i>input duplication</i> for band-joins between two relations. Previous work suffered from high optimization cost or considered partitionings that were too restricted (resulting in suboptimal join performance). Our main insight ...[more]