FASSSTER Data Pipeline and DevOps
In data science; the data pipeline serves as a methodological and potentially architectural framework for setting up systems that require near real-time monitoring through dashboards and visualization. The collection; aggregation; and analysis of data related to COVID-19 cases proved to be important...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Published: |
Archīum Ateneo
2023
|
Subjects: | |
Online Access: | https://archium.ateneo.edu/discs-faculty-pubs/415 https://doi.org/10.1007/978-981-99-3153-8_3 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Ateneo De Manila University |
id |
ph-ateneo-arc.discs-faculty-pubs-1415 |
---|---|
record_format |
eprints |
spelling |
ph-ateneo-arc.discs-faculty-pubs-14152024-04-29T08:16:10Z FASSSTER Data Pipeline and DevOps Tamayo, Lenard Paulo V Pulmano, Christian E Santos, Romel John Buhain, Jay-Arr Ico, Raven In data science; the data pipeline serves as a methodological and potentially architectural framework for setting up systems that require near real-time monitoring through dashboards and visualization. The collection; aggregation; and analysis of data related to COVID-19 cases proved to be important in providing the community with the right information at the right time. In the beginning of the pandemic; the data used for interpretation came from different data sources. Some datasets were made available to the public by the Department of Health (DOH) by publishing a Google Drive that contained the datasets in spreadsheet format (http://bit.ly/DataDropPH). Eventually; DOH provided access to a BigQuery database to select groups where data can be automatically extracted on a daily basis. These datasets are extracted and ingested to a data warehouse for further analysis. Various data analysis and modeling techniques are applied to the data. As such; data analysis scripts are written using two popular programming languages; R and Python; to facilitate the processing and transformation of data. The stakeholders then view model outputs in a web-based visualization platform. This chapter describes the FASSSTER data pipeline; from extraction; preprocessing; and processing to produce outputs generated by analytics and models and corresponding data visualization techniques. 2023-08-08T07:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/415 https://doi.org/10.1007/978-981-99-3153-8_3 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo Data pipeline Data analytics Data visualization Analytical, Diagnostic and Therapeutic Techniques and Equipment Data Science Epidemiology Medicine and Health Sciences Physical Sciences and Mathematics Public Health |
institution |
Ateneo De Manila University |
building |
Ateneo De Manila University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
Ateneo De Manila University Library |
collection |
archium.Ateneo Institutional Repository |
topic |
Data pipeline Data analytics Data visualization Analytical, Diagnostic and Therapeutic Techniques and Equipment Data Science Epidemiology Medicine and Health Sciences Physical Sciences and Mathematics Public Health |
spellingShingle |
Data pipeline Data analytics Data visualization Analytical, Diagnostic and Therapeutic Techniques and Equipment Data Science Epidemiology Medicine and Health Sciences Physical Sciences and Mathematics Public Health Tamayo, Lenard Paulo V Pulmano, Christian E Santos, Romel John Buhain, Jay-Arr Ico, Raven FASSSTER Data Pipeline and DevOps |
description |
In data science; the data pipeline serves as a methodological and potentially architectural framework for setting up systems that require near real-time monitoring through dashboards and visualization. The collection; aggregation; and analysis of data related to COVID-19 cases proved to be important in providing the community with the right information at the right time. In the beginning of the pandemic; the data used for interpretation came from different data sources. Some datasets were made available to the public by the Department of Health (DOH) by publishing a Google Drive that contained the datasets in spreadsheet format (http://bit.ly/DataDropPH). Eventually; DOH provided access to a BigQuery database to select groups where data can be automatically extracted on a daily basis. These datasets are extracted and ingested to a data warehouse for further analysis. Various data analysis and modeling techniques are applied to the data. As such; data analysis scripts are written using two popular programming languages; R and Python; to facilitate the processing and transformation of data. The stakeholders then view model outputs in a web-based visualization platform. This chapter describes the FASSSTER data pipeline; from extraction; preprocessing; and processing to produce outputs generated by analytics and models and corresponding data visualization techniques. |
format |
text |
author |
Tamayo, Lenard Paulo V Pulmano, Christian E Santos, Romel John Buhain, Jay-Arr Ico, Raven |
author_facet |
Tamayo, Lenard Paulo V Pulmano, Christian E Santos, Romel John Buhain, Jay-Arr Ico, Raven |
author_sort |
Tamayo, Lenard Paulo V |
title |
FASSSTER Data Pipeline and DevOps |
title_short |
FASSSTER Data Pipeline and DevOps |
title_full |
FASSSTER Data Pipeline and DevOps |
title_fullStr |
FASSSTER Data Pipeline and DevOps |
title_full_unstemmed |
FASSSTER Data Pipeline and DevOps |
title_sort |
fassster data pipeline and devops |
publisher |
Archīum Ateneo |
publishDate |
2023 |
url |
https://archium.ateneo.edu/discs-faculty-pubs/415 https://doi.org/10.1007/978-981-99-3153-8_3 |
_version_ |
1800919426736848896 |