Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning

Cytochrome P450 17A1 (CYP17A1) is one of the key enzymes in steroidogenesis that produces dehydroepiandrosterone (DHEA) from cholesterol. Abnormal DHEA production may lead to the progression of severe diseases, such as prostatic and breast cancers. Thus, CYP17A1 is a druggable target for anti-cancer...

Full description

Saved in:
Bibliographic Details
Main Author: Yu T.
Other Authors: Mahidol University
Format: Article
Published: 2023
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/81655
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
id th-mahidol.81655
record_format dspace
spelling th-mahidol.816552023-05-19T14:35:41Z Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning Yu T. Mahidol University Biochemistry, Genetics and Molecular Biology Cytochrome P450 17A1 (CYP17A1) is one of the key enzymes in steroidogenesis that produces dehydroepiandrosterone (DHEA) from cholesterol. Abnormal DHEA production may lead to the progression of severe diseases, such as prostatic and breast cancers. Thus, CYP17A1 is a druggable target for anti-cancer molecule development. In this study, cheminformatic analyses and quantitative structure–activity relationship (QSAR) modeling were applied on a set of 962 CYP17A1 inhibitors (i.e., consisting of 279 steroidal and 683 nonsteroidal inhibitors) compiled from the ChEMBL database. For steroidal inhibitors, a QSAR classification model built using the PubChem fingerprint along with the extra trees algorithm achieved the best performance, reflected by the accuracy values of 0.933, 0.818, and 0.833 for the training, cross-validation, and test sets, respectively. For nonsteroidal inhibitors, a systematic cheminformatic analysis was applied for exploring the chemical space, Murcko scaffolds, and structure–activity relationships (SARs) for visualizing distributions, patterns, and representative scaffolds for drug discoveries. Furthermore, seven total QSAR classification models were established based on the nonsteroidal scaffolds, and two activity cliff (AC) generators were identified. The best performing model out of these seven was model VIII, which is built upon the PubChem fingerprint along with the random forest algorithm. It achieved a robust accuracy across the training set, the cross-validation set, and the test set, i.e., 0.96, 0.92, and 0.913, respectively. It is anticipated that the results presented herein would be instrumental for further CYP17A1 inhibitor drug discovery efforts. 2023-05-19T07:35:41Z 2023-05-19T07:35:41Z 2023-02-01 Article Molecules Vol.28 No.4 (2023) 10.3390/molecules28041679 14203049 36838665 2-s2.0-85149053552 https://repository.li.mahidol.ac.th/handle/123456789/81655 SCOPUS
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Biochemistry, Genetics and Molecular Biology
spellingShingle Biochemistry, Genetics and Molecular Biology
Yu T.
Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning
description Cytochrome P450 17A1 (CYP17A1) is one of the key enzymes in steroidogenesis that produces dehydroepiandrosterone (DHEA) from cholesterol. Abnormal DHEA production may lead to the progression of severe diseases, such as prostatic and breast cancers. Thus, CYP17A1 is a druggable target for anti-cancer molecule development. In this study, cheminformatic analyses and quantitative structure–activity relationship (QSAR) modeling were applied on a set of 962 CYP17A1 inhibitors (i.e., consisting of 279 steroidal and 683 nonsteroidal inhibitors) compiled from the ChEMBL database. For steroidal inhibitors, a QSAR classification model built using the PubChem fingerprint along with the extra trees algorithm achieved the best performance, reflected by the accuracy values of 0.933, 0.818, and 0.833 for the training, cross-validation, and test sets, respectively. For nonsteroidal inhibitors, a systematic cheminformatic analysis was applied for exploring the chemical space, Murcko scaffolds, and structure–activity relationships (SARs) for visualizing distributions, patterns, and representative scaffolds for drug discoveries. Furthermore, seven total QSAR classification models were established based on the nonsteroidal scaffolds, and two activity cliff (AC) generators were identified. The best performing model out of these seven was model VIII, which is built upon the PubChem fingerprint along with the random forest algorithm. It achieved a robust accuracy across the training set, the cross-validation set, and the test set, i.e., 0.96, 0.92, and 0.913, respectively. It is anticipated that the results presented herein would be instrumental for further CYP17A1 inhibitor drug discovery efforts.
author2 Mahidol University
author_facet Mahidol University
Yu T.
format Article
author Yu T.
author_sort Yu T.
title Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning
title_short Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning
title_full Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning
title_fullStr Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning
title_full_unstemmed Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning
title_sort exploring the chemical space of cyp17a1 inhibitors using cheminformatics and machine learning
publishDate 2023
url https://repository.li.mahidol.ac.th/handle/123456789/81655
_version_ 1781414470443073536