Unknown

Dataset Information

0

Haplosaurus computes protein haplotypes for use in precision drug design.


ABSTRACT: Selecting the most appropriate protein sequences is critical for precision drug design. Here we describe Haplosaurus, a bioinformatic tool for computation of protein haplotypes. Haplosaurus computes protein haplotypes from pre-existing chromosomally-phased genomic variation data. Integration into the Ensembl resource provides rapid and detailed protein haplotypes retrieval. Using Haplosaurus, we build a database of unique protein haplotypes from the 1000 Genomes dataset reflecting real-world protein sequence variability and their prevalence. For one in seven genes, their most common protein haplotype differs from the reference sequence and a similar number differs on their most common haplotype between human populations. Three case studies show how knowledge of the range of commonly encountered protein forms predicted in populations leads to insights into therapeutic efficacy. Haplosaurus and its associated database is expected to find broad applications in many disciplines using protein sequences and particularly impactful for therapeutics design.

SUBMITTER: Spooner W 

PROVIDER: S-EPMC6175845 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications


Selecting the most appropriate protein sequences is critical for precision drug design. Here we describe Haplosaurus, a bioinformatic tool for computation of protein haplotypes. Haplosaurus computes protein haplotypes from pre-existing chromosomally-phased genomic variation data. Integration into the Ensembl resource provides rapid and detailed protein haplotypes retrieval. Using Haplosaurus, we build a database of unique protein haplotypes from the 1000 Genomes dataset reflecting real-world pro  ...[more]

Similar Datasets

| S-EPMC6060444 | biostudies-literature
| S-EPMC6349207 | biostudies-literature
2012-12-21 | GSE36691 | GEO
2012-12-21 | E-GEOD-36691 | biostudies-arrayexpress
| S-EPMC3164996 | biostudies-literature
| S-EPMC10598866 | biostudies-literature
2016-08-24 | E-GEOD-85952 | biostudies-arrayexpress
2016-08-24 | GSE85952 | GEO
| S-EPMC3440625 | biostudies-literature
2024-10-27 | GSE279349 | GEO