VISUAL STIMULUS RECONSTRUCTION BASED ON EEG SIGNALS USING STABLE DIFFUSION MODEL WITH CONTRASTIVE LEARNING APPROACH

In recent years, models have been developed to reconstruct visual stimulus images based on EEG signals. A popular approach currently used is contrastive learning, as it enables training with unlabeled data. However, several datasets utilized in existing studies were collected using the block desi...

Full description

Saved in:

Bibliographic Details
Main Author:	Rizqullah Ecaldy, Rheza
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/87995
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:87995
spelling	id-itb.:879952025-02-05T13:56:06ZVISUAL STIMULUS RECONSTRUCTION BASED ON EEG SIGNALS USING STABLE DIFFUSION MODEL WITH CONTRASTIVE LEARNING APPROACH Rizqullah Ecaldy, Rheza Indonesia Final Project EEG, Block design, Contrastive learning, Stable diffusion INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/87995 In recent years, models have been developed to reconstruct visual stimulus images based on EEG signals. A popular approach currently used is contrastive learning, as it enables training with unlabeled data. However, several datasets utilized in existing studies were collected using the block design approach, which affects the validity of the research outcomes. Therefore, there is a need to develop models with contrastive learning using datasets collected under improved design conditions. To achieve this goal, a model was developed using the Alljoined1 dataset, employing contrastive learning and stable diffusion model to generate visual stimulus images. First, the encoder model was trained using contrastive learning to learn the latent space of image embeddings produced by the pretrained CLIP model from EEG signal inputs. The EEG embeddings were then aligned using a diffusion prior. Finally, these EEG embeddings were used as input to generate images through the pretrained stable diffusion model. This model successfully generated visual reconstruction images with semantic evaluation metrics, achieving two-way identification scores of 0.5094 for the CLIP model, 0.4892 for the Inception model, 0.4570 for the AlexNet(2) model, 0.5239 for the AlexNet(5) model, and a SwAV distance score of 0.6812. Additionally, analyses were conducted to observe the effects of subject-specific EEG variability and frequency bands on model performance. text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	In recent years, models have been developed to reconstruct visual stimulus images based on EEG signals. A popular approach currently used is contrastive learning, as it enables training with unlabeled data. However, several datasets utilized in existing studies were collected using the block design approach, which affects the validity of the research outcomes. Therefore, there is a need to develop models with contrastive learning using datasets collected under improved design conditions. To achieve this goal, a model was developed using the Alljoined1 dataset, employing contrastive learning and stable diffusion model to generate visual stimulus images. First, the encoder model was trained using contrastive learning to learn the latent space of image embeddings produced by the pretrained CLIP model from EEG signal inputs. The EEG embeddings were then aligned using a diffusion prior. Finally, these EEG embeddings were used as input to generate images through the pretrained stable diffusion model. This model successfully generated visual reconstruction images with semantic evaluation metrics, achieving two-way identification scores of 0.5094 for the CLIP model, 0.4892 for the Inception model, 0.4570 for the AlexNet(2) model, 0.5239 for the AlexNet(5) model, and a SwAV distance score of 0.6812. Additionally, analyses were conducted to observe the effects of subject-specific EEG variability and frequency bands on model performance.
format	Final Project
author	Rizqullah Ecaldy, Rheza
spellingShingle	Rizqullah Ecaldy, Rheza VISUAL STIMULUS RECONSTRUCTION BASED ON EEG SIGNALS USING STABLE DIFFUSION MODEL WITH CONTRASTIVE LEARNING APPROACH
author_facet	Rizqullah Ecaldy, Rheza
author_sort	Rizqullah Ecaldy, Rheza
title	VISUAL STIMULUS RECONSTRUCTION BASED ON EEG SIGNALS USING STABLE DIFFUSION MODEL WITH CONTRASTIVE LEARNING APPROACH
title_short	VISUAL STIMULUS RECONSTRUCTION BASED ON EEG SIGNALS USING STABLE DIFFUSION MODEL WITH CONTRASTIVE LEARNING APPROACH
title_full	VISUAL STIMULUS RECONSTRUCTION BASED ON EEG SIGNALS USING STABLE DIFFUSION MODEL WITH CONTRASTIVE LEARNING APPROACH
title_fullStr	VISUAL STIMULUS RECONSTRUCTION BASED ON EEG SIGNALS USING STABLE DIFFUSION MODEL WITH CONTRASTIVE LEARNING APPROACH
title_full_unstemmed	VISUAL STIMULUS RECONSTRUCTION BASED ON EEG SIGNALS USING STABLE DIFFUSION MODEL WITH CONTRASTIVE LEARNING APPROACH
title_sort	visual stimulus reconstruction based on eeg signals using stable diffusion model with contrastive learning approach
url	https://digilib.itb.ac.id/gdl/view/87995
_version_	1823658411515969536

VISUAL STIMULUS RECONSTRUCTION BASED ON EEG SIGNALS USING STABLE DIFFUSION MODEL WITH CONTRASTIVE LEARNING APPROACH

Similar Items