Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning
Cytochrome P450 17A1 (CYP17A1) is one of the key enzymes in steroidogenesis that produces dehydroepiandrosterone (DHEA) from cholesterol. Abnormal DHEA production may lead to the progression of severe diseases, such as prostatic and breast cancers. Thus, CYP17A1 is a druggable target for anti-cancer...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Article |
Published: |
2023
|
Subjects: | |
Online Access: | https://repository.li.mahidol.ac.th/handle/123456789/81655 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Mahidol University |
id |
th-mahidol.81655 |
---|---|
record_format |
dspace |
spelling |
th-mahidol.816552023-05-19T14:35:41Z Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning Yu T. Mahidol University Biochemistry, Genetics and Molecular Biology Cytochrome P450 17A1 (CYP17A1) is one of the key enzymes in steroidogenesis that produces dehydroepiandrosterone (DHEA) from cholesterol. Abnormal DHEA production may lead to the progression of severe diseases, such as prostatic and breast cancers. Thus, CYP17A1 is a druggable target for anti-cancer molecule development. In this study, cheminformatic analyses and quantitative structure–activity relationship (QSAR) modeling were applied on a set of 962 CYP17A1 inhibitors (i.e., consisting of 279 steroidal and 683 nonsteroidal inhibitors) compiled from the ChEMBL database. For steroidal inhibitors, a QSAR classification model built using the PubChem fingerprint along with the extra trees algorithm achieved the best performance, reflected by the accuracy values of 0.933, 0.818, and 0.833 for the training, cross-validation, and test sets, respectively. For nonsteroidal inhibitors, a systematic cheminformatic analysis was applied for exploring the chemical space, Murcko scaffolds, and structure–activity relationships (SARs) for visualizing distributions, patterns, and representative scaffolds for drug discoveries. Furthermore, seven total QSAR classification models were established based on the nonsteroidal scaffolds, and two activity cliff (AC) generators were identified. The best performing model out of these seven was model VIII, which is built upon the PubChem fingerprint along with the random forest algorithm. It achieved a robust accuracy across the training set, the cross-validation set, and the test set, i.e., 0.96, 0.92, and 0.913, respectively. It is anticipated that the results presented herein would be instrumental for further CYP17A1 inhibitor drug discovery efforts. 2023-05-19T07:35:41Z 2023-05-19T07:35:41Z 2023-02-01 Article Molecules Vol.28 No.4 (2023) 10.3390/molecules28041679 14203049 36838665 2-s2.0-85149053552 https://repository.li.mahidol.ac.th/handle/123456789/81655 SCOPUS |
institution |
Mahidol University |
building |
Mahidol University Library |
continent |
Asia |
country |
Thailand Thailand |
content_provider |
Mahidol University Library |
collection |
Mahidol University Institutional Repository |
topic |
Biochemistry, Genetics and Molecular Biology |
spellingShingle |
Biochemistry, Genetics and Molecular Biology Yu T. Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning |
description |
Cytochrome P450 17A1 (CYP17A1) is one of the key enzymes in steroidogenesis that produces dehydroepiandrosterone (DHEA) from cholesterol. Abnormal DHEA production may lead to the progression of severe diseases, such as prostatic and breast cancers. Thus, CYP17A1 is a druggable target for anti-cancer molecule development. In this study, cheminformatic analyses and quantitative structure–activity relationship (QSAR) modeling were applied on a set of 962 CYP17A1 inhibitors (i.e., consisting of 279 steroidal and 683 nonsteroidal inhibitors) compiled from the ChEMBL database. For steroidal inhibitors, a QSAR classification model built using the PubChem fingerprint along with the extra trees algorithm achieved the best performance, reflected by the accuracy values of 0.933, 0.818, and 0.833 for the training, cross-validation, and test sets, respectively. For nonsteroidal inhibitors, a systematic cheminformatic analysis was applied for exploring the chemical space, Murcko scaffolds, and structure–activity relationships (SARs) for visualizing distributions, patterns, and representative scaffolds for drug discoveries. Furthermore, seven total QSAR classification models were established based on the nonsteroidal scaffolds, and two activity cliff (AC) generators were identified. The best performing model out of these seven was model VIII, which is built upon the PubChem fingerprint along with the random forest algorithm. It achieved a robust accuracy across the training set, the cross-validation set, and the test set, i.e., 0.96, 0.92, and 0.913, respectively. It is anticipated that the results presented herein would be instrumental for further CYP17A1 inhibitor drug discovery efforts. |
author2 |
Mahidol University |
author_facet |
Mahidol University Yu T. |
format |
Article |
author |
Yu T. |
author_sort |
Yu T. |
title |
Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning |
title_short |
Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning |
title_full |
Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning |
title_fullStr |
Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning |
title_full_unstemmed |
Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning |
title_sort |
exploring the chemical space of cyp17a1 inhibitors using cheminformatics and machine learning |
publishDate |
2023 |
url |
https://repository.li.mahidol.ac.th/handle/123456789/81655 |
_version_ |
1781414470443073536 |