FASSSTER Data Pipeline and DevOps

In data science; the data pipeline serves as a methodological and potentially architectural framework for setting up systems that require near real-time monitoring through dashboards and visualization. The collection; aggregation; and analysis of data related to COVID-19 cases proved to be important...

Full description

Saved in:

Bibliographic Details
Main Authors:	Tamayo, Lenard Paulo V, Pulmano, Christian E, Santos, Romel John, Buhain, Jay-Arr, Ico, Raven
Format:	text
Published:	Archīum Ateneo 2023
Subjects:	Data pipeline Data analytics Data visualization Analytical, Diagnostic and Therapeutic Techniques and Equipment Data Science Epidemiology Medicine and Health Sciences Physical Sciences and Mathematics Public Health
Online Access:	https://archium.ateneo.edu/discs-faculty-pubs/415 https://doi.org/10.1007/978-981-99-3153-8_3
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Ateneo De Manila University

id	ph-ateneo-arc.discs-faculty-pubs-1415
record_format	eprints
spelling	ph-ateneo-arc.discs-faculty-pubs-14152024-04-29T08:16:10Z FASSSTER Data Pipeline and DevOps Tamayo, Lenard Paulo V Pulmano, Christian E Santos, Romel John Buhain, Jay-Arr Ico, Raven In data science; the data pipeline serves as a methodological and potentially architectural framework for setting up systems that require near real-time monitoring through dashboards and visualization. The collection; aggregation; and analysis of data related to COVID-19 cases proved to be important in providing the community with the right information at the right time. In the beginning of the pandemic; the data used for interpretation came from different data sources. Some datasets were made available to the public by the Department of Health (DOH) by publishing a Google Drive that contained the datasets in spreadsheet format (http://bit.ly/DataDropPH). Eventually; DOH provided access to a BigQuery database to select groups where data can be automatically extracted on a daily basis. These datasets are extracted and ingested to a data warehouse for further analysis. Various data analysis and modeling techniques are applied to the data. As such; data analysis scripts are written using two popular programming languages; R and Python; to facilitate the processing and transformation of data. The stakeholders then view model outputs in a web-based visualization platform. This chapter describes the FASSSTER data pipeline; from extraction; preprocessing; and processing to produce outputs generated by analytics and models and corresponding data visualization techniques. 2023-08-08T07:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/415 https://doi.org/10.1007/978-981-99-3153-8_3 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo Data pipeline Data analytics Data visualization Analytical, Diagnostic and Therapeutic Techniques and Equipment Data Science Epidemiology Medicine and Health Sciences Physical Sciences and Mathematics Public Health
institution	Ateneo De Manila University
building	Ateneo De Manila University Library
continent	Asia
country	Philippines Philippines
content_provider	Ateneo De Manila University Library
collection	archium.Ateneo Institutional Repository
topic	Data pipeline Data analytics Data visualization Analytical, Diagnostic and Therapeutic Techniques and Equipment Data Science Epidemiology Medicine and Health Sciences Physical Sciences and Mathematics Public Health
spellingShingle	Data pipeline Data analytics Data visualization Analytical, Diagnostic and Therapeutic Techniques and Equipment Data Science Epidemiology Medicine and Health Sciences Physical Sciences and Mathematics Public Health Tamayo, Lenard Paulo V Pulmano, Christian E Santos, Romel John Buhain, Jay-Arr Ico, Raven FASSSTER Data Pipeline and DevOps
description	In data science; the data pipeline serves as a methodological and potentially architectural framework for setting up systems that require near real-time monitoring through dashboards and visualization. The collection; aggregation; and analysis of data related to COVID-19 cases proved to be important in providing the community with the right information at the right time. In the beginning of the pandemic; the data used for interpretation came from different data sources. Some datasets were made available to the public by the Department of Health (DOH) by publishing a Google Drive that contained the datasets in spreadsheet format (http://bit.ly/DataDropPH). Eventually; DOH provided access to a BigQuery database to select groups where data can be automatically extracted on a daily basis. These datasets are extracted and ingested to a data warehouse for further analysis. Various data analysis and modeling techniques are applied to the data. As such; data analysis scripts are written using two popular programming languages; R and Python; to facilitate the processing and transformation of data. The stakeholders then view model outputs in a web-based visualization platform. This chapter describes the FASSSTER data pipeline; from extraction; preprocessing; and processing to produce outputs generated by analytics and models and corresponding data visualization techniques.
format	text
author	Tamayo, Lenard Paulo V Pulmano, Christian E Santos, Romel John Buhain, Jay-Arr Ico, Raven
author_facet	Tamayo, Lenard Paulo V Pulmano, Christian E Santos, Romel John Buhain, Jay-Arr Ico, Raven
author_sort	Tamayo, Lenard Paulo V
title	FASSSTER Data Pipeline and DevOps
title_short	FASSSTER Data Pipeline and DevOps
title_full	FASSSTER Data Pipeline and DevOps
title_fullStr	FASSSTER Data Pipeline and DevOps
title_full_unstemmed	FASSSTER Data Pipeline and DevOps
title_sort	fassster data pipeline and devops
publisher	Archīum Ateneo
publishDate	2023
url	https://archium.ateneo.edu/discs-faculty-pubs/415 https://doi.org/10.1007/978-981-99-3153-8_3
_version_	1800919426736848896

FASSSTER Data Pipeline and DevOps

Similar Items