Utilizing machine learning techniques to predict the blood-brain barrier permeability of compounds detected using LCQTOF-MS in Malaysian Kelulut honey

Current in silico modelling techniques, such as molecular dynamics, typically focus on compounds with the highest concentration from chromatographic analyses for bioactivity screening. Consequently, they reduce the need for labour-intensive in vitro studies but limit the utilization of extensive chr...

Full description

Saved in:
Bibliographic Details
Main Authors: Raihana Zahirah, Edros, Feng, Tan wei, Dong, RuiHai
Format: Article
Language:English
English
Published: Taylor & Francis 2023
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/42715/1/Utilizing%20machine%20learning%20techniques%20to%20predict%20the%20blood%20brain%20barrier%20permeability%20of%20compounds%20detected%20using%20LCQTOF%20MS%20in%20Malaysian%20Kelulut.pdf
http://umpir.ump.edu.my/id/eprint/42715/2/Utilizing%20machine%20learning%20techniques%20to%20predict%20the%20blood%20brain%20barrier%20permeability%20of%20compounds%20detected%20using%20LCQTOF%20MS%20in%20Malaysian%20Kelulut%20honey.pdf
http://umpir.ump.edu.my/id/eprint/42715/
https://doi.org/10.1080/1062936X.2023.2230868
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Pahang Al-Sultan Abdullah
Language: English
English
Description
Summary:Current in silico modelling techniques, such as molecular dynamics, typically focus on compounds with the highest concentration from chromatographic analyses for bioactivity screening. Consequently, they reduce the need for labour-intensive in vitro studies but limit the utilization of extensive chromatographic data and molecular diversity for compound classification. Compound permeability across the blood–brain barrier (BBB) is a key concern in central nervous system (CNS) drug development, and this limitation can be addressed by applying cheminformatics with codeless machine learning (ML). Among the four models developed in this study, the Random Forest (RF) algorithm with the most robust performance in both internal and external validation was selected for model construction, with an accuracy (ACC) of 87.5% and 86.9% and area under the curve (AUC) of 0.907 and 0.726, respectively. The RF model was deployed to classify 285 compounds detected using liquid chromatography quadrupole time-of-flight mass spectrometry (LCQTOF-MS) in Kelulut honey; of which, 140 compounds were screened with 94 descriptors. Seventeen compounds were predicted to permeate the BBB, revealing their potential as drugs for treating neurodegenerative diseases. Our results highlight the importance of employing ML pattern recognition to identify compounds with neuroprotective potential from the entire pool of chromatographic data.