PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction

Prediction of neutralization antibodies is important for the development of effective vaccines and antibody-based therapeutics. Traditional methods rely on features based on first principles derived from the binding interface. However, they are burdened by arduous data preprocessing from a limited q...

Full description

Saved in:
Bibliographic Details
Main Authors: Wan, Zhang, Lin, Zhuoyi, Rashid, Shamima, Ng, Shaun Yue Hao, Yin, Rui, Senthilnath, J., Kwoh, Chee Keong
Other Authors: College of Computing and Data Science
Format: Conference or Workshop Item
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/178506
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-178506
record_format dspace
spelling sg-ntu-dr.10356-1785062024-06-25T05:56:47Z PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction Wan, Zhang Lin, Zhuoyi Rashid, Shamima Ng, Shaun Yue Hao Yin, Rui Senthilnath, J. Kwoh, Chee Keong College of Computing and Data Science School of Computer Science and Engineering School of Chemical and Biomedical Engineering 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) Biomedical Informatics Lab Computer and Information Science Medicine, Health and Life Sciences SARS-CoV-2 Paratope-epitope interaction Prediction of neutralization antibodies is important for the development of effective vaccines and antibody-based therapeutics. Traditional methods rely on features based on first principles derived from the binding interface. However, they are burdened by arduous data preprocessing from a limited quantity of protein structures. In comparison, deep learning allows automatic substructure characterization and representation without hand-crafted feature engineering. In particular, large language models (LLMs) based method predicts neutralization using Fv sequences of antibody and antigen. Despite LLM's success, incorporating full-length Fv sequences suffers from: 1) inaccurate sequence-level labels in existing datasets, 2) inefficient modeling due to noisy non-contributing motifs, and 3) ignorance of non-bonded interactions that play a key role in facilitating epitope-paratope pairing. In this paper, we propose a novel approach that incorporates only the paratope and epitope for antibody-antigen neutralization prediction while adopting a novel set modeling that regards the paratope and epitope as bags of residues. Specifically, we hand-crafted a dataset containing neutralizing paratope-epitope pairs where epitopes are potentially generalizable to future unseen variants of SARS-CoV-2. Training on such a dataset enables deep learning models to predict neutralizing antibodies for prospective mutated variants of SARS-CoV-2, meanwhile addressing the problem of inaccurate sequence-level labels. A higher modeling efficiency is also achieved by disregarding non-contributing motifs. Furthermore, we also propose paratope-epitope set interaction (PESI), a set modeling model inspired by first principles that learns intra-inter non-covalent interactions through a global attention mechanism. To validate PESI, we perform a 10-fold cross-validation on our dataset. Experimental results show that PESI achieves a more balanced overall performance and a significant improvement on MCC as compared to existing architectures. Ministry of Education (MOE) Submitted/Accepted version This research is supported by the MOE Academic Research Fund Tier 2 (Grant No: MOE2019-T2-2-175) under the project ”Host-pathogen protein-protein interaction approaches for predicting virulence”. 2024-06-25T05:56:47Z 2024-06-25T05:56:47Z 2023 Conference Paper Wan, Z., Lin, Z., Rashid, S., Ng, S. Y. H., Yin, R., Senthilnath, J. & Kwoh, C. K. (2023). PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction. 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 49-56. https://dx.doi.org/10.1109/BIBM58861.2023.10386059 9798350337488 2156-1133 https://hdl.handle.net/10356/178506 10.1109/BIBM58861.2023.10386059 2-s2.0-85184882196 49 56 en MOE2019-T2-2-175 © 2023 IEEE. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at http://doi.org/10.1109/BIBM58861.2023.10386059. application/pdf application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Medicine, Health and Life Sciences
SARS-CoV-2
Paratope-epitope interaction
spellingShingle Computer and Information Science
Medicine, Health and Life Sciences
SARS-CoV-2
Paratope-epitope interaction
Wan, Zhang
Lin, Zhuoyi
Rashid, Shamima
Ng, Shaun Yue Hao
Yin, Rui
Senthilnath, J.
Kwoh, Chee Keong
PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction
description Prediction of neutralization antibodies is important for the development of effective vaccines and antibody-based therapeutics. Traditional methods rely on features based on first principles derived from the binding interface. However, they are burdened by arduous data preprocessing from a limited quantity of protein structures. In comparison, deep learning allows automatic substructure characterization and representation without hand-crafted feature engineering. In particular, large language models (LLMs) based method predicts neutralization using Fv sequences of antibody and antigen. Despite LLM's success, incorporating full-length Fv sequences suffers from: 1) inaccurate sequence-level labels in existing datasets, 2) inefficient modeling due to noisy non-contributing motifs, and 3) ignorance of non-bonded interactions that play a key role in facilitating epitope-paratope pairing. In this paper, we propose a novel approach that incorporates only the paratope and epitope for antibody-antigen neutralization prediction while adopting a novel set modeling that regards the paratope and epitope as bags of residues. Specifically, we hand-crafted a dataset containing neutralizing paratope-epitope pairs where epitopes are potentially generalizable to future unseen variants of SARS-CoV-2. Training on such a dataset enables deep learning models to predict neutralizing antibodies for prospective mutated variants of SARS-CoV-2, meanwhile addressing the problem of inaccurate sequence-level labels. A higher modeling efficiency is also achieved by disregarding non-contributing motifs. Furthermore, we also propose paratope-epitope set interaction (PESI), a set modeling model inspired by first principles that learns intra-inter non-covalent interactions through a global attention mechanism. To validate PESI, we perform a 10-fold cross-validation on our dataset. Experimental results show that PESI achieves a more balanced overall performance and a significant improvement on MCC as compared to existing architectures.
author2 College of Computing and Data Science
author_facet College of Computing and Data Science
Wan, Zhang
Lin, Zhuoyi
Rashid, Shamima
Ng, Shaun Yue Hao
Yin, Rui
Senthilnath, J.
Kwoh, Chee Keong
format Conference or Workshop Item
author Wan, Zhang
Lin, Zhuoyi
Rashid, Shamima
Ng, Shaun Yue Hao
Yin, Rui
Senthilnath, J.
Kwoh, Chee Keong
author_sort Wan, Zhang
title PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction
title_short PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction
title_full PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction
title_fullStr PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction
title_full_unstemmed PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction
title_sort pesi: paratope-epitope set interaction for sars-cov-2 neutralization prediction
publishDate 2024
url https://hdl.handle.net/10356/178506
_version_ 1814047346462294016