PARAMETRIC GAUSSIAN PROCESS REGRESSION FOR BIG DATA

The significant increase in data quantity in the modern era presents challenges for mathematical models, which become increasingly complex as data grows. This study proposes parametric Gaussian process regression as a solution, utilizing induction points or centroids from data clusters to reduce...

Full description

Saved in:
Bibliographic Details
Main Author: Rizky Kosasih, Kahfi
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/81467
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:81467
spelling id-itb.:814672024-06-27T09:36:52ZPARAMETRIC GAUSSIAN PROCESS REGRESSION FOR BIG DATA Rizky Kosasih, Kahfi Indonesia Final Project Gaussian process, parametric gaussian process regression, induction points, k-means clustering, big data, average temperature of java island. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/81467 The significant increase in data quantity in the modern era presents challenges for mathematical models, which become increasingly complex as data grows. This study proposes parametric Gaussian process regression as a solution, utilizing induction points or centroids from data clusters to reduce data quantity without sacrificing performance. The study aims to explore the potential of using centroids as observational data representations, validate the model through simulation, and implement it on synthetic and real data, specifically the average temperature from 12 weather stations on Java Island. The results indicate that the proposed model significantly outperforms standard resampling methods by up to 30% in terms of consistency and accuracy, using standardized mean squared error (SMSE), and effectively captures external factors through kernel combinations. In conclusion, employing induction points in Gaussian process regression transforms it into a parametric model with stable computational efficiency despite increasing observational data, and further exploration is recommended for various kernels and other observational data. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description The significant increase in data quantity in the modern era presents challenges for mathematical models, which become increasingly complex as data grows. This study proposes parametric Gaussian process regression as a solution, utilizing induction points or centroids from data clusters to reduce data quantity without sacrificing performance. The study aims to explore the potential of using centroids as observational data representations, validate the model through simulation, and implement it on synthetic and real data, specifically the average temperature from 12 weather stations on Java Island. The results indicate that the proposed model significantly outperforms standard resampling methods by up to 30% in terms of consistency and accuracy, using standardized mean squared error (SMSE), and effectively captures external factors through kernel combinations. In conclusion, employing induction points in Gaussian process regression transforms it into a parametric model with stable computational efficiency despite increasing observational data, and further exploration is recommended for various kernels and other observational data.
format Final Project
author Rizky Kosasih, Kahfi
spellingShingle Rizky Kosasih, Kahfi
PARAMETRIC GAUSSIAN PROCESS REGRESSION FOR BIG DATA
author_facet Rizky Kosasih, Kahfi
author_sort Rizky Kosasih, Kahfi
title PARAMETRIC GAUSSIAN PROCESS REGRESSION FOR BIG DATA
title_short PARAMETRIC GAUSSIAN PROCESS REGRESSION FOR BIG DATA
title_full PARAMETRIC GAUSSIAN PROCESS REGRESSION FOR BIG DATA
title_fullStr PARAMETRIC GAUSSIAN PROCESS REGRESSION FOR BIG DATA
title_full_unstemmed PARAMETRIC GAUSSIAN PROCESS REGRESSION FOR BIG DATA
title_sort parametric gaussian process regression for big data
url https://digilib.itb.ac.id/gdl/view/81467
_version_ 1822997330918375424