Unknown

Dataset Information

0

Mining whole genome sequence data to efficiently attribute individuals to source populations.


ABSTRACT: Whole genome sequence (WGS) data could transform our ability to attribute individuals to source populations. However, methods that efficiently mine these data are yet to be developed. We present a minimal multilocus distance (MMD) method which rapidly deals with these large data sets as well as methods for optimally selecting loci. This was applied on WGS data to determine the source of human campylobacteriosis, the geographical origin of diverse biological species including humans and proteomic data to classify breast cancer tumours. The MMD method provides a highly accurate attribution which is computationally efficient for extended genotypes. These methods are generic, easy to implement for WGS and proteomic data and have wide application.

SUBMITTER: Perez-Reche FJ 

PROVIDER: S-EPMC7376179 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mining whole genome sequence data to efficiently attribute individuals to source populations.

Pérez-Reche Francisco J FJ   Rotariu Ovidiu O   Lopes Bruno S BS   Forbes Ken J KJ   Strachan Norval J C NJC  

Scientific reports 20200722 1


Whole genome sequence (WGS) data could transform our ability to attribute individuals to source populations. However, methods that efficiently mine these data are yet to be developed. We present a minimal multilocus distance (MMD) method which rapidly deals with these large data sets as well as methods for optimally selecting loci. This was applied on WGS data to determine the source of human campylobacteriosis, the geographical origin of diverse biological species including humans and proteomic  ...[more]

Similar Datasets

| S-EPMC7955157 | biostudies-literature
| S-EPMC6797713 | biostudies-literature
| S-EPMC6485071 | biostudies-literature
| S-EPMC7863413 | biostudies-literature
| S-EPMC2637895 | biostudies-other
| S-EPMC6896509 | biostudies-literature
| S-EPMC4216928 | biostudies-literature
| S-EPMC3907355 | biostudies-literature
| S-EPMC4027178 | biostudies-literature
| S-EPMC9553944 | biostudies-literature