Unknown

Dataset Information

0

SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation.


ABSTRACT:

Motivation

Ancestral haplotype maps provide useful information about genomic variation and insights into biological processes. Reconstructing the descendent haplotype structure of homologous chromosomes, particularly for large numbers of individuals, can help with characterizing the recombination landscape, elucidating genotype-to-phenotype relationships, improving genomic predictions and more. Inferring haplotype maps from sparse genotype data is an efficient approach to whole-genome haplotyping, but this is a non-trivial problem. A standardized approach is needed to validate whether haplotype reconstruction software, conceived population designs and existing data for a given population provides accurate haplotype information for further inference.

Results

We introduce SPEARS, a pipeline for the simulation-based appraisal of genome-wide haplotype maps constructed from sparse genotype data. Using a specified pedigree, the pipeline generates virtual genotypes (known data) with genotyping errors and missing data structure. It then proceeds to mimic analysis in practice, capturing sources of error due to genotyping, imputation and haplotype inference. Standard metrics allow researchers to assess different population designs and which features of haplotype structure or regions of the genome are sufficiently accurate for analysis. Haplotype maps for 1000 outcross progeny from a multi-parent population of maize are used to demonstrate SPEARS.

Availabilityand implementation

SPEARS, the protocol and suite of scripts, are publicly available under an MIT license at GitHub (https://github.com/maizeatlas/spears).

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Manching H 

PROVIDER: S-EPMC8097754 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC4942178 | biostudies-literature
| PRJEB54098 | ENA
| S-EPMC5209871 | biostudies-literature
| S-EPMC4512549 | biostudies-literature
| S-EPMC8525220 | biostudies-literature
| S-EPMC3519909 | biostudies-literature
| S-EPMC7782867 | biostudies-literature
| S-EPMC5995191 | biostudies-literature
| S-EPMC3899994 | biostudies-literature
| S-EPMC6837550 | biostudies-literature