High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm

In high-dimensional quantitative structure–activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L1/2-norm is proposed. Furthe...

Full description

Saved in:
Bibliographic Details
Main Authors: Algamal, Z. Y., Lee, M. H., Al-Fakih, A. M., Aziz, M.
Format: Article
Published: Taylor and Francis Ltd. 2016
Subjects:
Online Access:http://eprints.utm.my/id/eprint/72108/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84987643691&doi=10.1080%2f1062936X.2016.1228696&partnerID=40&md5=4d4834740f41f51ed40fd692c7811449
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.72108
record_format eprints
spelling my.utm.721082017-11-23T06:19:24Z http://eprints.utm.my/id/eprint/72108/ High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm Algamal, Z. Y. Lee, M. H. Al-Fakih, A. M. Aziz, M. QD Chemistry In high-dimensional quantitative structure–activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L1/2-norm is proposed. Furthermore, the local linear approximation algorithm is utilized to avoid the non-convexity of the proposed method. The potential applicability of the proposed method is tested on several benchmark data sets. Compared with other commonly used penalized methods, the proposed method can not only obtain the best predictive ability, but also provide an easily interpretable QSAR model. In addition, it is noteworthy that the results obtained in terms of applicability domain and Y-randomization test provide an efficient and a robust QSAR model. It is evident from the results that the proposed method may possibly be a promising penalized method in the field of computational chemistry research, especially when the number of molecular descriptors exceeds the number of compounds. Taylor and Francis Ltd. 2016 Article PeerReviewed Algamal, Z. Y. and Lee, M. H. and Al-Fakih, A. M. and Aziz, M. (2016) High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm. SAR and QSAR in Environmental Research, 27 (9). pp. 703-719. ISSN 1062-936X https://www.scopus.com/inward/record.uri?eid=2-s2.0-84987643691&doi=10.1080%2f1062936X.2016.1228696&partnerID=40&md5=4d4834740f41f51ed40fd692c7811449
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QD Chemistry
spellingShingle QD Chemistry
Algamal, Z. Y.
Lee, M. H.
Al-Fakih, A. M.
Aziz, M.
High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
description In high-dimensional quantitative structure–activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L1/2-norm is proposed. Furthermore, the local linear approximation algorithm is utilized to avoid the non-convexity of the proposed method. The potential applicability of the proposed method is tested on several benchmark data sets. Compared with other commonly used penalized methods, the proposed method can not only obtain the best predictive ability, but also provide an easily interpretable QSAR model. In addition, it is noteworthy that the results obtained in terms of applicability domain and Y-randomization test provide an efficient and a robust QSAR model. It is evident from the results that the proposed method may possibly be a promising penalized method in the field of computational chemistry research, especially when the number of molecular descriptors exceeds the number of compounds.
format Article
author Algamal, Z. Y.
Lee, M. H.
Al-Fakih, A. M.
Aziz, M.
author_facet Algamal, Z. Y.
Lee, M. H.
Al-Fakih, A. M.
Aziz, M.
author_sort Algamal, Z. Y.
title High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_short High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_full High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_fullStr High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_full_unstemmed High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_sort high-dimensional qsar modelling using penalized linear regression model with l1/2-norm
publisher Taylor and Francis Ltd.
publishDate 2016
url http://eprints.utm.my/id/eprint/72108/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84987643691&doi=10.1080%2f1062936X.2016.1228696&partnerID=40&md5=4d4834740f41f51ed40fd692c7811449
_version_ 1643656358804848640