Privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering

With the increasing digitalization of financial transactions and the rise of cybercrime, combating money laundering has become increasingly complex. Graph-based machine learning techniques have emerged as promising tools for Anti-Money Laundering (AML) detection, capable of capturing intricate relat...

Full description

Saved in:
Bibliographic Details
Main Author: Effendi, Fabrianne
Other Authors: Anupam Chattopadhyay
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175347
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-175347
record_format dspace
spelling sg-ntu-dr.10356-1753472024-06-03T01:39:30Z Privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering Effendi, Fabrianne Anupam Chattopadhyay School of Computer Science and Engineering anupam@ntu.edu.sg Computer and Information Science Privacy-preserving Graph-based machine learning Fully homomorphic encryption Anti-money laundering With the increasing digitalization of financial transactions and the rise of cybercrime, combating money laundering has become increasingly complex. Graph-based machine learning techniques have emerged as promising tools for Anti-Money Laundering (AML) detection, capable of capturing intricate relationships within money laundering networks. However, the effectiveness of AML solutions is hindered by the challenge of data silos within financial institutions, limiting collaboration and reducing overall efficacy. To address these challenges, this research presents a novel privacy-preserving approach for collaborative AML machine learning, facilitating secure data sharing across institutions and borders while preserving data privacy and regulatory compliance. Leveraging Fully Homomorphic Encryption (FHE), computations can be performed on encrypted data without decryption, ensuring sensitive financial data remains confidential. Notably, this research explores the integration of Fully Homomorphic Encryption over the Torus (TFHE) with graph-based machine learning techniques, marking a pioneering effort in this field. The research contributes to the development of an extensible Graph Neural Network (GNN) pipeline, integrating TFHE using Concrete ML. Although progress has been made in implementing techniques such as quantization and pruning to render the GNN FHE-compatible, challenges persist in compiling the pipeline due to the complexity of integrating GNN with Concrete ML. Nonetheless, the insights gained from this development process lay the groundwork for future research in this area. In parallel, a privacy-preserving graph-based gradient boosting pipeline was successfully developed, leveraging Graph Feature Preprocessor (GFP) to enhance XGBoost model performance on AML datasets. Through a series of experiments, the trade-offs between model performance and privacy were evaluated, highlighting the potential of the pipeline in balancing between the two aspects. This work lays the foundation for innovative approaches in safeguarding financial systems against illicit activities, paving the way for future endeavors in privacy-preserving machine learning in AML detection. Bachelor's degree 2024-04-22T08:12:40Z 2024-04-22T08:12:40Z 2024 Final Year Project (FYP) Effendi, F. (2024). Privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175347 https://hdl.handle.net/10356/175347 en SCSE23-0246 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Privacy-preserving
Graph-based machine learning
Fully homomorphic encryption
Anti-money laundering
spellingShingle Computer and Information Science
Privacy-preserving
Graph-based machine learning
Fully homomorphic encryption
Anti-money laundering
Effendi, Fabrianne
Privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering
description With the increasing digitalization of financial transactions and the rise of cybercrime, combating money laundering has become increasingly complex. Graph-based machine learning techniques have emerged as promising tools for Anti-Money Laundering (AML) detection, capable of capturing intricate relationships within money laundering networks. However, the effectiveness of AML solutions is hindered by the challenge of data silos within financial institutions, limiting collaboration and reducing overall efficacy. To address these challenges, this research presents a novel privacy-preserving approach for collaborative AML machine learning, facilitating secure data sharing across institutions and borders while preserving data privacy and regulatory compliance. Leveraging Fully Homomorphic Encryption (FHE), computations can be performed on encrypted data without decryption, ensuring sensitive financial data remains confidential. Notably, this research explores the integration of Fully Homomorphic Encryption over the Torus (TFHE) with graph-based machine learning techniques, marking a pioneering effort in this field. The research contributes to the development of an extensible Graph Neural Network (GNN) pipeline, integrating TFHE using Concrete ML. Although progress has been made in implementing techniques such as quantization and pruning to render the GNN FHE-compatible, challenges persist in compiling the pipeline due to the complexity of integrating GNN with Concrete ML. Nonetheless, the insights gained from this development process lay the groundwork for future research in this area. In parallel, a privacy-preserving graph-based gradient boosting pipeline was successfully developed, leveraging Graph Feature Preprocessor (GFP) to enhance XGBoost model performance on AML datasets. Through a series of experiments, the trade-offs between model performance and privacy were evaluated, highlighting the potential of the pipeline in balancing between the two aspects. This work lays the foundation for innovative approaches in safeguarding financial systems against illicit activities, paving the way for future endeavors in privacy-preserving machine learning in AML detection.
author2 Anupam Chattopadhyay
author_facet Anupam Chattopadhyay
Effendi, Fabrianne
format Final Year Project
author Effendi, Fabrianne
author_sort Effendi, Fabrianne
title Privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering
title_short Privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering
title_full Privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering
title_fullStr Privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering
title_full_unstemmed Privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering
title_sort privacy-preserving graph-based machine learning with fully homomorphic encryption for collaborative anti-money laundering
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/175347
_version_ 1800916185252888576