Protein language models are performant in structure-free virtual screening

Hitherto virtual screening (VS) has been typically performed using a structure-based drug design paradigm. Such methods typically require the use of molecular docking on high-resolution three-dimensional structures of a target protein-a computationally-intensive and time-consuming exercise. This wor...

Full description

Saved in:
Bibliographic Details
Main Authors: Lam, Hilbert Yuen In, Guan, Jia Sheng, Ong, Xing Er, Pincket, Robbe, Mu, Yuguang
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181379
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181379
record_format dspace
spelling sg-ntu-dr.10356-1813792024-12-02T15:32:31Z Protein language models are performant in structure-free virtual screening Lam, Hilbert Yuen In Guan, Jia Sheng Ong, Xing Er Pincket, Robbe Mu, Yuguang School of Biological Sciences MagMol Pte. Ltd. Medicine, Health and Life Sciences Virtual screening Computer-aided drug design Hitherto virtual screening (VS) has been typically performed using a structure-based drug design paradigm. Such methods typically require the use of molecular docking on high-resolution three-dimensional structures of a target protein-a computationally-intensive and time-consuming exercise. This work demonstrates that by employing protein language models and molecular graphs as inputs to a novel graph-to-transformer cross-attention mechanism, a screening power comparable to state-of-the-art structure-based models can be achieved. The implications thereof include highly expedited VS due to the greatly reduced compute required to run this model, and the ability to perform early stages of computer-aided drug design in the complete absence of 3D protein structures. Ministry of Education (MOE) Nanyang Technological University National Supercomputing Centre (NSCC) Singapore Published version This work is supported by the Singapore Ministry of Education (MOE) Tier 1 grant RG97/22. The computational work for this article was partially performed on resources of the National Supercomputing Centre, Singapore (https://www.nscc.sg) and the HADLEY supercomputer by Singapore Centre for Environmental Life Sciences Engineering (SCELSE). 2024-11-27T06:11:06Z 2024-11-27T06:11:06Z 2024 Journal Article Lam, H. Y. I., Guan, J. S., Ong, X. E., Pincket, R. & Mu, Y. (2024). Protein language models are performant in structure-free virtual screening. Briefings in Bioinformatics, 25(6). https://dx.doi.org/10.1093/bib/bbae480 1477-4054 https://hdl.handle.net/10356/181379 10.1093/bib/bbae480 39327890 2-s2.0-85205151848 6 25 en RG97/22 Briefings in Bioinformatics © The Author(s) 2024. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Medicine, Health and Life Sciences
Virtual screening
Computer-aided drug design
spellingShingle Medicine, Health and Life Sciences
Virtual screening
Computer-aided drug design
Lam, Hilbert Yuen In
Guan, Jia Sheng
Ong, Xing Er
Pincket, Robbe
Mu, Yuguang
Protein language models are performant in structure-free virtual screening
description Hitherto virtual screening (VS) has been typically performed using a structure-based drug design paradigm. Such methods typically require the use of molecular docking on high-resolution three-dimensional structures of a target protein-a computationally-intensive and time-consuming exercise. This work demonstrates that by employing protein language models and molecular graphs as inputs to a novel graph-to-transformer cross-attention mechanism, a screening power comparable to state-of-the-art structure-based models can be achieved. The implications thereof include highly expedited VS due to the greatly reduced compute required to run this model, and the ability to perform early stages of computer-aided drug design in the complete absence of 3D protein structures.
author2 School of Biological Sciences
author_facet School of Biological Sciences
Lam, Hilbert Yuen In
Guan, Jia Sheng
Ong, Xing Er
Pincket, Robbe
Mu, Yuguang
format Article
author Lam, Hilbert Yuen In
Guan, Jia Sheng
Ong, Xing Er
Pincket, Robbe
Mu, Yuguang
author_sort Lam, Hilbert Yuen In
title Protein language models are performant in structure-free virtual screening
title_short Protein language models are performant in structure-free virtual screening
title_full Protein language models are performant in structure-free virtual screening
title_fullStr Protein language models are performant in structure-free virtual screening
title_full_unstemmed Protein language models are performant in structure-free virtual screening
title_sort protein language models are performant in structure-free virtual screening
publishDate 2024
url https://hdl.handle.net/10356/181379
_version_ 1819112969705881600