Large-scale modeling of pHLA complexes for structure-based immunogenicity prediction
Choi, Sae Hee and de Freitas, Martiela Vaz and Belle, Rafael Lorenzo and Devaurs, Didier and Ferreto, Tiago Coelho and Antunes, Dinler Amaral (2025) Large-scale modeling of pHLA complexes for structure-based immunogenicity prediction. Journal for ImmunoTherapy of Cancer, 13 (Suppl ). A1234. ISSN 2051-1426 (https://doi.org/10.1136/jitc-2025-SITC2025.1084)
Preview |
Text.
Filename: Large-scale-modeling-of-pHLA-complexes-for-structure-based-immunogenicity-prediction.pdf
Preprint License:
Download (633kB)| Preview |
Abstract
Background Cancer immunotherapies have become an effective tool by manipulating cellular immune response to fight cancer. A key step of cellular immunity involves peptides binding to the Human Leukocyte Antigen (HLA) molecules and forming a stable peptide-HLA (pHLA) complex. pHLAs are transported to the cell surface where they can be ‘inspected’ by T-cells, through highly specific T-cell receptors (TCRs). Understanding such interactions is central to designing peptide-based vaccines and T-cell-based immunotherapies. Key limitations to widespread use of cancer immunotherapies includes lack of immunogenicity of tumor-associated peptide-antigens, and risk of off-target causing immune-related adverse events. Different approaches are being explored to address and minimize this issue. Therefore, accurate prediction of pHLA immunogenicity is a critical area for advancing the design of cancer immunotherapies. However, current immunogenicity prediction tools are limited to identifying peptide motifs, neglecting critical structural features and TCR-specific recognition. We developed a structure-based machine learning tool that utilizes models of HLA-peptide-TCR complexes to extract features predictive of immunogenicity. Methods A labelled dataset involving over 15 million pHLA sequences with experimentally-determined immunogenicity results was selected from a previously developed sequence-based immunogenicity prediction tool called BigMHC.1 APE-Gen 2.02 and Boltz-23 were used to generate structural models for each pHLA, to be utilized for the extraction of structural features. APE-Gen 2.0 was selected for its tailored, scalable pHLA modeling and docking-based scoring of conformational ensembles.2 Boltz-2, a novel AI-based tool, for outperforming Alphafold2 by incorporating improved biophysical refinement.3 In addition, it also predicts complex binding affinity using a new AI-based approach. Results We modeled a pilot dataset of 77 complexes with binding affinity and immunogenicity labels using Ape-Gen 2.0 and Boltz2. Top scored conformation from Ape-Gen 2.0 ensembles are being analyzed to determine which structural features contribute to immunogenicity. Different properties and featurization approaches will be explored. We are also evaluating accuracy of affinity predictions on Boltz-2, in comparison to Ape-Gen 2.0 and Rosetta. Large-scale modeling of the entire dataset of 15 million complexes is ongoing. Conclusions By integrating structural features of the entire pHLA complex we aim to overcome limitations of current sequence-based approaches and enable more accurate screening of therapeutic peptides, improving the safety of next-generation immunotherapies. Once modeled with both independent approaches (~30 million pHLA complexes), our dataset will be the largest available for machine-learning training. We will then explore how to further improve immunogenicity prediction by leveraging different structural sources, featurization methods and AI models.
ORCID iDs
Choi, Sae Hee, de Freitas, Martiela Vaz, Belle, Rafael Lorenzo, Devaurs, Didier
ORCID: https://orcid.org/0000-0002-3415-9816, Ferreto, Tiago Coelho and Antunes, Dinler Amaral;
-
-
Item type: Article ID code: 94732 Dates: DateEvent4 November 2025Published1 November 2025AcceptedSubjects: Medicine > Internal medicine > Neoplasms. Tumors. Oncology (including Cancer)
Science > Microbiology > ImmunologyDepartment: Faculty of Science > Computer and Information Sciences Depositing user: Pure Administrator Date deposited: 14 Nov 2025 13:54 Last modified: 22 Jan 2026 09:41 Related URLs: URI: https://strathprints.strath.ac.uk/id/eprint/94732
Tools
Tools






