Topological feature engineering for machine learning based halide perovskite materials design

Accelerated materials development with machine learning (ML) assisted screening and high throughput experimentation for new photovoltaic materials holds the key to addressing our grand energy challenges. Data-driven ML is envisaged as a decisive enabler for new perovskite materials discovery. Howeve...

Full description

Saved in:
Bibliographic Details
Main Authors: Anand, D. Vijay, Xu, Qiang, Wee, Junjie, Xia, Kelin, Sum, Tze Chien
Other Authors: School of Physical and Mathematical Sciences
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/165413
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-165413
record_format dspace
spelling sg-ntu-dr.10356-1654132023-03-29T09:36:37Z Topological feature engineering for machine learning based halide perovskite materials design Anand, D. Vijay Xu, Qiang Wee, Junjie Xia, Kelin Sum, Tze Chien School of Physical and Mathematical Sciences Science::Physics Feature Engineering Halide Perovskites Accelerated materials development with machine learning (ML) assisted screening and high throughput experimentation for new photovoltaic materials holds the key to addressing our grand energy challenges. Data-driven ML is envisaged as a decisive enabler for new perovskite materials discovery. However, its full potential can be severely curtailed by poorly represented molecular descriptors (or fingerprints). Optimal descriptors are essential for establishing effective mathematical representations of quantitative structure-property relationships. Here we reveal that our persistent functions (PFs) based learning models offer significant accuracy advantages over traditional descriptor based models in organic-inorganic halide perovskite (OIHP) materials design and have similar performance as deep learning models. Our multiscale simplicial complex approach not only provides a more precise representation for OIHP structures and underlying interactions, but also has better transferability to ML models. Our results demonstrate that advanced geometrical and topological invariants are highly efficient feature engineering approaches that can markedly improve the performance of learning models for molecular data analysis. Further, new structure-property relationships can be established between our invariants and bandgaps. We anticipate that our molecular representations and featurization models will transcend the limitations of conventional approaches and lead to breakthroughs in perovskite materials design and discovery. Ministry of Education (MOE) Nanyang Technological University National Research Foundation (NRF) Published version This work was supported in part by Nanyang Technological University Startup Grant M4081842.110, Singapore Ministry of Education Academic Research fund Tier 1 grant RG109/19 and Tier 2 grants MOE-T2EP50120-0004 and MOE-T2EP20120-0013, as well as the National Research Foundation (NRF), Singapore under its NRF Investigatorship (NRF-NRFI2018-04). 2023-03-27T03:06:03Z 2023-03-27T03:06:03Z 2022 Journal Article Anand, D. V., Xu, Q., Wee, J., Xia, K. & Sum, T. C. (2022). Topological feature engineering for machine learning based halide perovskite materials design. Npj Computational Materials, 8(1). https://dx.doi.org/10.1038/s41524-022-00883-8 2057-3960 https://hdl.handle.net/10356/165413 10.1038/s41524-022-00883-8 2-s2.0-85138709678 1 8 en M4081842.110 RG109/19 MOE-T2EP50120-0004 MOE-T2EP20120-0013 NRF-NRFI-2018-04 npj Computational Materials 10.21979/N9/CVJWZ9 © The Author(s) 2022. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http:// creativecommons.org/licenses/by/4.0/. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Physics
Feature Engineering
Halide Perovskites
spellingShingle Science::Physics
Feature Engineering
Halide Perovskites
Anand, D. Vijay
Xu, Qiang
Wee, Junjie
Xia, Kelin
Sum, Tze Chien
Topological feature engineering for machine learning based halide perovskite materials design
description Accelerated materials development with machine learning (ML) assisted screening and high throughput experimentation for new photovoltaic materials holds the key to addressing our grand energy challenges. Data-driven ML is envisaged as a decisive enabler for new perovskite materials discovery. However, its full potential can be severely curtailed by poorly represented molecular descriptors (or fingerprints). Optimal descriptors are essential for establishing effective mathematical representations of quantitative structure-property relationships. Here we reveal that our persistent functions (PFs) based learning models offer significant accuracy advantages over traditional descriptor based models in organic-inorganic halide perovskite (OIHP) materials design and have similar performance as deep learning models. Our multiscale simplicial complex approach not only provides a more precise representation for OIHP structures and underlying interactions, but also has better transferability to ML models. Our results demonstrate that advanced geometrical and topological invariants are highly efficient feature engineering approaches that can markedly improve the performance of learning models for molecular data analysis. Further, new structure-property relationships can be established between our invariants and bandgaps. We anticipate that our molecular representations and featurization models will transcend the limitations of conventional approaches and lead to breakthroughs in perovskite materials design and discovery.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Anand, D. Vijay
Xu, Qiang
Wee, Junjie
Xia, Kelin
Sum, Tze Chien
format Article
author Anand, D. Vijay
Xu, Qiang
Wee, Junjie
Xia, Kelin
Sum, Tze Chien
author_sort Anand, D. Vijay
title Topological feature engineering for machine learning based halide perovskite materials design
title_short Topological feature engineering for machine learning based halide perovskite materials design
title_full Topological feature engineering for machine learning based halide perovskite materials design
title_fullStr Topological feature engineering for machine learning based halide perovskite materials design
title_full_unstemmed Topological feature engineering for machine learning based halide perovskite materials design
title_sort topological feature engineering for machine learning based halide perovskite materials design
publishDate 2023
url https://hdl.handle.net/10356/165413
_version_ 1762031110624116736