PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction
Prediction of neutralization antibodies is important for the development of effective vaccines and antibody-based therapeutics. Traditional methods rely on features based on first principles derived from the binding interface. However, they are burdened by arduous data preprocessing from a limited q...
Saved in:
Main Authors: | , , , , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/178506 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-178506 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1785062024-06-25T05:56:47Z PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction Wan, Zhang Lin, Zhuoyi Rashid, Shamima Ng, Shaun Yue Hao Yin, Rui Senthilnath, J. Kwoh, Chee Keong College of Computing and Data Science School of Computer Science and Engineering School of Chemical and Biomedical Engineering 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) Biomedical Informatics Lab Computer and Information Science Medicine, Health and Life Sciences SARS-CoV-2 Paratope-epitope interaction Prediction of neutralization antibodies is important for the development of effective vaccines and antibody-based therapeutics. Traditional methods rely on features based on first principles derived from the binding interface. However, they are burdened by arduous data preprocessing from a limited quantity of protein structures. In comparison, deep learning allows automatic substructure characterization and representation without hand-crafted feature engineering. In particular, large language models (LLMs) based method predicts neutralization using Fv sequences of antibody and antigen. Despite LLM's success, incorporating full-length Fv sequences suffers from: 1) inaccurate sequence-level labels in existing datasets, 2) inefficient modeling due to noisy non-contributing motifs, and 3) ignorance of non-bonded interactions that play a key role in facilitating epitope-paratope pairing. In this paper, we propose a novel approach that incorporates only the paratope and epitope for antibody-antigen neutralization prediction while adopting a novel set modeling that regards the paratope and epitope as bags of residues. Specifically, we hand-crafted a dataset containing neutralizing paratope-epitope pairs where epitopes are potentially generalizable to future unseen variants of SARS-CoV-2. Training on such a dataset enables deep learning models to predict neutralizing antibodies for prospective mutated variants of SARS-CoV-2, meanwhile addressing the problem of inaccurate sequence-level labels. A higher modeling efficiency is also achieved by disregarding non-contributing motifs. Furthermore, we also propose paratope-epitope set interaction (PESI), a set modeling model inspired by first principles that learns intra-inter non-covalent interactions through a global attention mechanism. To validate PESI, we perform a 10-fold cross-validation on our dataset. Experimental results show that PESI achieves a more balanced overall performance and a significant improvement on MCC as compared to existing architectures. Ministry of Education (MOE) Submitted/Accepted version This research is supported by the MOE Academic Research Fund Tier 2 (Grant No: MOE2019-T2-2-175) under the project ”Host-pathogen protein-protein interaction approaches for predicting virulence”. 2024-06-25T05:56:47Z 2024-06-25T05:56:47Z 2023 Conference Paper Wan, Z., Lin, Z., Rashid, S., Ng, S. Y. H., Yin, R., Senthilnath, J. & Kwoh, C. K. (2023). PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction. 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 49-56. https://dx.doi.org/10.1109/BIBM58861.2023.10386059 9798350337488 2156-1133 https://hdl.handle.net/10356/178506 10.1109/BIBM58861.2023.10386059 2-s2.0-85184882196 49 56 en MOE2019-T2-2-175 © 2023 IEEE. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at http://doi.org/10.1109/BIBM58861.2023.10386059. application/pdf application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Medicine, Health and Life Sciences SARS-CoV-2 Paratope-epitope interaction |
spellingShingle |
Computer and Information Science Medicine, Health and Life Sciences SARS-CoV-2 Paratope-epitope interaction Wan, Zhang Lin, Zhuoyi Rashid, Shamima Ng, Shaun Yue Hao Yin, Rui Senthilnath, J. Kwoh, Chee Keong PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction |
description |
Prediction of neutralization antibodies is important for the development of effective vaccines and antibody-based therapeutics. Traditional methods rely on features based on first principles derived from the binding interface. However, they are burdened by arduous data preprocessing from a limited quantity of protein structures. In comparison, deep learning allows automatic substructure characterization and representation without hand-crafted feature engineering. In particular, large language models (LLMs) based method predicts neutralization using Fv sequences of antibody and antigen. Despite LLM's success, incorporating full-length Fv sequences suffers from: 1) inaccurate sequence-level labels in existing datasets, 2) inefficient modeling due to noisy non-contributing motifs, and 3) ignorance of non-bonded interactions that play a key role in facilitating epitope-paratope pairing. In this paper, we propose a novel approach that incorporates only the paratope and epitope for antibody-antigen neutralization prediction while adopting a novel set modeling that regards the paratope and epitope as bags of residues. Specifically, we hand-crafted a dataset containing neutralizing paratope-epitope pairs where epitopes are potentially generalizable to future unseen variants of SARS-CoV-2. Training on such a dataset enables deep learning models to predict neutralizing antibodies for prospective mutated variants of SARS-CoV-2, meanwhile addressing the problem of inaccurate sequence-level labels. A higher modeling efficiency is also achieved by disregarding non-contributing motifs. Furthermore, we also propose paratope-epitope set interaction (PESI), a set modeling model inspired by first principles that learns intra-inter non-covalent interactions through a global attention mechanism. To validate PESI, we perform a 10-fold cross-validation on our dataset. Experimental results show that PESI achieves a more balanced overall performance and a significant improvement on MCC as compared to existing architectures. |
author2 |
College of Computing and Data Science |
author_facet |
College of Computing and Data Science Wan, Zhang Lin, Zhuoyi Rashid, Shamima Ng, Shaun Yue Hao Yin, Rui Senthilnath, J. Kwoh, Chee Keong |
format |
Conference or Workshop Item |
author |
Wan, Zhang Lin, Zhuoyi Rashid, Shamima Ng, Shaun Yue Hao Yin, Rui Senthilnath, J. Kwoh, Chee Keong |
author_sort |
Wan, Zhang |
title |
PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction |
title_short |
PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction |
title_full |
PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction |
title_fullStr |
PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction |
title_full_unstemmed |
PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction |
title_sort |
pesi: paratope-epitope set interaction for sars-cov-2 neutralization prediction |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/178506 |
_version_ |
1814047346462294016 |