Protein language models are performant in structure-free virtual screening
Hitherto virtual screening (VS) has been typically performed using a structure-based drug design paradigm. Such methods typically require the use of molecular docking on high-resolution three-dimensional structures of a target protein-a computationally-intensive and time-consuming exercise. This wor...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/181379 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-181379 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1813792024-12-02T15:32:31Z Protein language models are performant in structure-free virtual screening Lam, Hilbert Yuen In Guan, Jia Sheng Ong, Xing Er Pincket, Robbe Mu, Yuguang School of Biological Sciences MagMol Pte. Ltd. Medicine, Health and Life Sciences Virtual screening Computer-aided drug design Hitherto virtual screening (VS) has been typically performed using a structure-based drug design paradigm. Such methods typically require the use of molecular docking on high-resolution three-dimensional structures of a target protein-a computationally-intensive and time-consuming exercise. This work demonstrates that by employing protein language models and molecular graphs as inputs to a novel graph-to-transformer cross-attention mechanism, a screening power comparable to state-of-the-art structure-based models can be achieved. The implications thereof include highly expedited VS due to the greatly reduced compute required to run this model, and the ability to perform early stages of computer-aided drug design in the complete absence of 3D protein structures. Ministry of Education (MOE) Nanyang Technological University National Supercomputing Centre (NSCC) Singapore Published version This work is supported by the Singapore Ministry of Education (MOE) Tier 1 grant RG97/22. The computational work for this article was partially performed on resources of the National Supercomputing Centre, Singapore (https://www.nscc.sg) and the HADLEY supercomputer by Singapore Centre for Environmental Life Sciences Engineering (SCELSE). 2024-11-27T06:11:06Z 2024-11-27T06:11:06Z 2024 Journal Article Lam, H. Y. I., Guan, J. S., Ong, X. E., Pincket, R. & Mu, Y. (2024). Protein language models are performant in structure-free virtual screening. Briefings in Bioinformatics, 25(6). https://dx.doi.org/10.1093/bib/bbae480 1477-4054 https://hdl.handle.net/10356/181379 10.1093/bib/bbae480 39327890 2-s2.0-85205151848 6 25 en RG97/22 Briefings in Bioinformatics © The Author(s) 2024. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Medicine, Health and Life Sciences Virtual screening Computer-aided drug design |
spellingShingle |
Medicine, Health and Life Sciences Virtual screening Computer-aided drug design Lam, Hilbert Yuen In Guan, Jia Sheng Ong, Xing Er Pincket, Robbe Mu, Yuguang Protein language models are performant in structure-free virtual screening |
description |
Hitherto virtual screening (VS) has been typically performed using a structure-based drug design paradigm. Such methods typically require the use of molecular docking on high-resolution three-dimensional structures of a target protein-a computationally-intensive and time-consuming exercise. This work demonstrates that by employing protein language models and molecular graphs as inputs to a novel graph-to-transformer cross-attention mechanism, a screening power comparable to state-of-the-art structure-based models can be achieved. The implications thereof include highly expedited VS due to the greatly reduced compute required to run this model, and the ability to perform early stages of computer-aided drug design in the complete absence of 3D protein structures. |
author2 |
School of Biological Sciences |
author_facet |
School of Biological Sciences Lam, Hilbert Yuen In Guan, Jia Sheng Ong, Xing Er Pincket, Robbe Mu, Yuguang |
format |
Article |
author |
Lam, Hilbert Yuen In Guan, Jia Sheng Ong, Xing Er Pincket, Robbe Mu, Yuguang |
author_sort |
Lam, Hilbert Yuen In |
title |
Protein language models are performant in structure-free virtual screening |
title_short |
Protein language models are performant in structure-free virtual screening |
title_full |
Protein language models are performant in structure-free virtual screening |
title_fullStr |
Protein language models are performant in structure-free virtual screening |
title_full_unstemmed |
Protein language models are performant in structure-free virtual screening |
title_sort |
protein language models are performant in structure-free virtual screening |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/181379 |
_version_ |
1819112969705881600 |