Unknown

Dataset Information

0

Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle.


ABSTRACT: BACKGROUND:There are 346 serologically defined red blood cell (RBC) antigens and 33 serologically defined platelet (PLT) antigens, most of which have known genetic changes in 45 RBC or six PLT genes that correlate with antigen expression. Polymorphic sites associated with antigen expression in the primary literature and reference databases are annotated according to nucleotide positions in cDNA. This makes antigen prediction from next-generation sequencing data challenging, since it uses genomic coordinates. STUDY DESIGN AND METHODS:The conventional cDNA reference sequences for all known RBC and PLT genes that correlate with antigen expression were aligned to the human reference genome. The alignments allowed conversion of conventional cDNA nucleotide positions to the corresponding genomic coordinates. RBC and PLT antigen prediction was then performed using the human reference genome and whole genome sequencing (WGS) data with serologic confirmation. RESULTS:Some major differences and alignment issues were found when attempting to convert the conventional cDNA to human reference genome sequences for the following genes: ABO, A4GALT, RHD, RHCE, FUT3, ACKR1 (previously DARC), ACHE, FUT2, CR1, GCNT2, and RHAG. However, it was possible to create usable alignments, which facilitated the prediction of all RBC and PLT antigens with a known molecular basis from WGS data. Traditional serologic typing for 18 RBC antigens were in agreement with the WGS-based antigen predictions, providing proof of principle for this approach. CONCLUSION:Detailed mapping of conventional cDNA annotated RBC and PLT alleles can enable accurate prediction of RBC and PLT antigens from whole genomic sequencing data.

SUBMITTER: Lane WJ 

PROVIDER: S-EPMC5019240 | biostudies-literature | 2016 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle.

Lane William J WJ   Westhoff Connie M CM   Uy Jon Michael JM   Aguad Maria M   Smeland-Wagman Robin R   Kaufman Richard M RM   Rehm Heidi L HL   Green Robert C RC   Silberstein Leslie E LE  

Transfusion 20151203 3


<h4>Background</h4>There are 346 serologically defined red blood cell (RBC) antigens and 33 serologically defined platelet (PLT) antigens, most of which have known genetic changes in 45 RBC or six PLT genes that correlate with antigen expression. Polymorphic sites associated with antigen expression in the primary literature and reference databases are annotated according to nucleotide positions in cDNA. This makes antigen prediction from next-generation sequencing data challenging, since it uses  ...[more]

Similar Datasets

| S-EPMC3217398 | biostudies-literature
| S-EPMC6438177 | biostudies-literature
2006-10-23 | GSE5721 | GEO
| S-EPMC8794781 | biostudies-literature
2010-05-26 | E-GEOD-10761 | biostudies-arrayexpress
| S-EPMC8206199 | biostudies-literature
2010-04-09 | GSE20873 | GEO
| S-EPMC7171411 | biostudies-literature
| S-EPMC2442168 | biostudies-literature
2008-03-08 | GSE10761 | GEO