FASSSTER Data Pipeline and DevOps

In data science; the data pipeline serves as a methodological and potentially architectural framework for setting up systems that require near real-time monitoring through dashboards and visualization. The collection; aggregation; and analysis of data related to COVID-19 cases proved to be important...

Full description

Saved in:
Bibliographic Details
Main Authors: Tamayo, Lenard Paulo V, Pulmano, Christian E, Santos, Romel John, Buhain, Jay-Arr, Ico, Raven
Format: text
Published: Archīum Ateneo 2023
Subjects:
Online Access:https://archium.ateneo.edu/discs-faculty-pubs/415
https://doi.org/10.1007/978-981-99-3153-8_3
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Ateneo De Manila University
id ph-ateneo-arc.discs-faculty-pubs-1415
record_format eprints
spelling ph-ateneo-arc.discs-faculty-pubs-14152024-04-29T08:16:10Z FASSSTER Data Pipeline and DevOps Tamayo, Lenard Paulo V Pulmano, Christian E Santos, Romel John Buhain, Jay-Arr Ico, Raven In data science; the data pipeline serves as a methodological and potentially architectural framework for setting up systems that require near real-time monitoring through dashboards and visualization. The collection; aggregation; and analysis of data related to COVID-19 cases proved to be important in providing the community with the right information at the right time. In the beginning of the pandemic; the data used for interpretation came from different data sources. Some datasets were made available to the public by the Department of Health (DOH) by publishing a Google Drive that contained the datasets in spreadsheet format (http://bit.ly/DataDropPH). Eventually; DOH provided access to a BigQuery database to select groups where data can be automatically extracted on a daily basis. These datasets are extracted and ingested to a data warehouse for further analysis. Various data analysis and modeling techniques are applied to the data. As such; data analysis scripts are written using two popular programming languages; R and Python; to facilitate the processing and transformation of data. The stakeholders then view model outputs in a web-based visualization platform. This chapter describes the FASSSTER data pipeline; from extraction; preprocessing; and processing to produce outputs generated by analytics and models and corresponding data visualization techniques. 2023-08-08T07:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/415 https://doi.org/10.1007/978-981-99-3153-8_3 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo Data pipeline Data analytics Data visualization Analytical, Diagnostic and Therapeutic Techniques and Equipment Data Science Epidemiology Medicine and Health Sciences Physical Sciences and Mathematics Public Health
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic Data pipeline
Data analytics
Data visualization
Analytical, Diagnostic and Therapeutic Techniques and Equipment
Data Science
Epidemiology
Medicine and Health Sciences
Physical Sciences and Mathematics
Public Health
spellingShingle Data pipeline
Data analytics
Data visualization
Analytical, Diagnostic and Therapeutic Techniques and Equipment
Data Science
Epidemiology
Medicine and Health Sciences
Physical Sciences and Mathematics
Public Health
Tamayo, Lenard Paulo V
Pulmano, Christian E
Santos, Romel John
Buhain, Jay-Arr
Ico, Raven
FASSSTER Data Pipeline and DevOps
description In data science; the data pipeline serves as a methodological and potentially architectural framework for setting up systems that require near real-time monitoring through dashboards and visualization. The collection; aggregation; and analysis of data related to COVID-19 cases proved to be important in providing the community with the right information at the right time. In the beginning of the pandemic; the data used for interpretation came from different data sources. Some datasets were made available to the public by the Department of Health (DOH) by publishing a Google Drive that contained the datasets in spreadsheet format (http://bit.ly/DataDropPH). Eventually; DOH provided access to a BigQuery database to select groups where data can be automatically extracted on a daily basis. These datasets are extracted and ingested to a data warehouse for further analysis. Various data analysis and modeling techniques are applied to the data. As such; data analysis scripts are written using two popular programming languages; R and Python; to facilitate the processing and transformation of data. The stakeholders then view model outputs in a web-based visualization platform. This chapter describes the FASSSTER data pipeline; from extraction; preprocessing; and processing to produce outputs generated by analytics and models and corresponding data visualization techniques.
format text
author Tamayo, Lenard Paulo V
Pulmano, Christian E
Santos, Romel John
Buhain, Jay-Arr
Ico, Raven
author_facet Tamayo, Lenard Paulo V
Pulmano, Christian E
Santos, Romel John
Buhain, Jay-Arr
Ico, Raven
author_sort Tamayo, Lenard Paulo V
title FASSSTER Data Pipeline and DevOps
title_short FASSSTER Data Pipeline and DevOps
title_full FASSSTER Data Pipeline and DevOps
title_fullStr FASSSTER Data Pipeline and DevOps
title_full_unstemmed FASSSTER Data Pipeline and DevOps
title_sort fassster data pipeline and devops
publisher Archīum Ateneo
publishDate 2023
url https://archium.ateneo.edu/discs-faculty-pubs/415
https://doi.org/10.1007/978-981-99-3153-8_3
_version_ 1800919426736848896