Project description:HLA-I molecules bind short peptides and present them to CD8+ T cells for TCR recognition. The length of HLA-I ligands typically ranges from 8 to 12 amino acids, but high variability is observed between different alleles. Here we used recent HLA peptidomics data to analyze in an unbiased way peptide length distributions over 85 different HLA-I alleles. Our results revealed clear clustering of HLA-I alleles with distinct peptide length distributions, which enabled us to unravel some of the molecular basis of peptide length distributions and predict peptide length distributions based on HLA-I sequences only. We further took advantage of our collection of curated HLA peptidomics studies to investigate multiple specificity in HLA-I molecules and validated these observations with binding assays. Explicitly modeling peptide length distributions and multiple specificity significantly improved predictions of naturally presented HLA-I ligands, as demonstrated in an independent benchmarking based on ten newly generated HLA peptidomics datasets from meningioma samples.
Project description:Prediction of HLA epitopes is important for the development of cancer immunotherapies and vaccines. However, current prediction algorithms have limited predictive power, in part because they were not trained on high-quality epitope datasets covering a broad range of HLA alleles. To enable prediction of endogenous HLA class I-associated peptides across a large fraction of the human population, we used mass spectrometry to profile >185,000 peptides eluted from 95 HLA-A, -B, -C and -G mono-allelic cell lines. We identified canonical peptide motifs per HLA allele, unique and shared binding submotifs across alleles and distinct motifs associated with different peptide lengths. By integrating these data with transcript abundance and peptide processing, we developed HLAthena, providing allele-and-length-specific and pan-allele-pan-length prediction models for endogenous peptide presentation. These models predicted endogenous HLA class I-associated ligands with 1.5-fold improvement in positive predictive value compared with existing tools and correctly identified >75% of HLA-bound peptides that were observed experimentally in 11 patient-derived tumor cell lines.
Project description:DNA methyltransferase 3A (DNMT3A) and isocitrate dehydrogenase 1 & 2 (IDH1/2) are genes involved in epigenetic regulation, each mutated in 7-23% of acute myeloid leukemia (AML). Here, we investigated whether hotspot mutations in these genes encode neoantigens that can be targeted by immunotherapy. Five EBV-B cell lines expressing common HLA class I alleles were transduced with a minigene construct containing mutations that often occur in DNMT3A or IDH1/2. Peptides eluted from HLA class I alleles on these EBV-B cell lines were analyzed by tandem mass spec-trometry. We identified an HLA-A*01:01-binding DNMT3AR882H peptide and HLA-B*07:02-binding IDH2R140Q peptide, for which we searched for specific T cells in healthy individuals using pep-tide-HLA tetramers. Various T-cell clones were isolated showing specific reactivity against cell lines transduced with full-length DNMT3AR882H or IDH2R140Q genes, while cell lines transduced with wildtype genes were not recognized. One T-cell clone for DNMT3AR882H also reacted against patient AML cells with the mutation, while AML cells without the mutation were not recognized, thereby validating surface presentation of a DNMT3AR882H neoantigen that can potentially be targeted on HLA-A*01:01 positive AML by immunotherapy.