Understanding single-protein fouling in micro- and ultrafiltration systems via machine-learning-based models

Protein fouling is a complex mechanism. To enhance understanding on protein fouling of membranes, the target of this study is twofold: (i) to determine the relative influences of parameters via the Random Forest (RF) model and (ii) to evaluate the predictive capability of the Neural Network (NN) mod...

Full description

Saved in:
Bibliographic Details
Main Authors: Tanudjaja, Henry Jonathan, Ng, Angie Qi Qi, Chew, Jia Wei
Other Authors: School of Chemistry, Chemical Engineering and Biotechnology
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/170058
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Protein fouling is a complex mechanism. To enhance understanding on protein fouling of membranes, the target of this study is twofold: (i) to determine the relative influences of parameters via the Random Forest (RF) model and (ii) to evaluate the predictive capability of the Neural Network (NN) model. Membrane pore size is the most dominant influence on fouling followed by transmembrane pressure (TMP), while membrane configuration (i.e., flat-sheet, hollow fiber, or tubular) is the least dominant. The NN model gives modest predictive capability despite inconsistencies and variabilities of the experimental setups and protocols, which invariably affects the important parameters in the database compiled from past publications. The database was divided into microfiltration (MF) and ultrafiltration (UF) subsets based on the membrane pore size values. It was found that the dominant parameters for permeate flux are different, with membrane pore size and protein concentration being dominant for MF and UF, respectively, while TMP is dominant for protein rejection for both cases. For permeate flux, membrane material is the most dominant parameter for the non-BSA database, while membrane pore size remains the most dominant parameter for protein rejection regardless of the protein used. Results show that such data-driven RF and NN models can enhance the understanding on the relative dominance of the parameters on different phenomena and provide adequate prediction of protein fouling, in the absence of any governing equations.