Integrating apache spark and R for big data analytics on solving geographic problems

With the advent ofdigital technology and smart devices, a flood of digital data is beinggenerated every day. This huge amount of data not only records the historyactivities but also provides future valuable information for organizations andbusinesses. However, the true values of these data will not...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHANG, Mengqi, KAM, Tin Seong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3830
https://ink.library.smu.edu.sg/context/sis_research/article/4832/viewcontent/Presentation_SparkR_Zhangmengqi.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4832
record_format dspace
spelling sg-smu-ink.sis_research-48322017-11-17T01:20:57Z Integrating apache spark and R for big data analytics on solving geographic problems ZHANG, Mengqi KAM, Tin Seong With the advent ofdigital technology and smart devices, a flood of digital data is beinggenerated every day. This huge amount of data not only records the historyactivities but also provides future valuable information for organizations andbusinesses. However, the true values of these data will not be fullyappreciated until they have been processed, analyzed and the analysis resultsbeen communicated to decision makers in a business friendly manner.In view of thisneed, big data has been one of the major research focus in the academicresearch community especially in the field of computer science and the softwarevendor as well as the big data service providers. However, majority of the current academicresearch and practice development efforts tend to focus on the technologicalaspect of big data. To fill the currentresearch gap of big data especially big data analytics, this study aims toinvestigate the possible approach to design and implement big data analyticsapplication by integrating an open source big data processing framework and anopen source data analysis environment. The presentationaims to share our findings and experiences learned through working on thisstudy. It consist of five sections. First, the motivation, objective and scope ofthe study will be presented. This isfollowed by a review of related literature on big data and big data analytics. In Section 3, the case scenario and data usedin the study will be introduced. Adetailed discussion on the data preparation will be presented too. Next, we will introduce the analysis andmodelling method used in this study. Insection 5, the user interface of the application designed for this study willbe discussed. A use case will then beused to demonstrate and evaluate the performance and analysis of theapplication. Finally, the overallconclusion, lessons learned and direction for future research will bepresented. 2017-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3830 https://ink.library.smu.edu.sg/context/sis_research/article/4832/viewcontent/Presentation_SparkR_Zhangmengqi.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Big Data Analytics SparkR Spatial Interaction Model Geospatial Analytics R Shiny Categorical Data Analysis Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Big Data Analytics
SparkR
Spatial Interaction Model
Geospatial Analytics
R Shiny
Categorical Data Analysis
Databases and Information Systems
spellingShingle Big Data Analytics
SparkR
Spatial Interaction Model
Geospatial Analytics
R Shiny
Categorical Data Analysis
Databases and Information Systems
ZHANG, Mengqi
KAM, Tin Seong
Integrating apache spark and R for big data analytics on solving geographic problems
description With the advent ofdigital technology and smart devices, a flood of digital data is beinggenerated every day. This huge amount of data not only records the historyactivities but also provides future valuable information for organizations andbusinesses. However, the true values of these data will not be fullyappreciated until they have been processed, analyzed and the analysis resultsbeen communicated to decision makers in a business friendly manner.In view of thisneed, big data has been one of the major research focus in the academicresearch community especially in the field of computer science and the softwarevendor as well as the big data service providers. However, majority of the current academicresearch and practice development efforts tend to focus on the technologicalaspect of big data. To fill the currentresearch gap of big data especially big data analytics, this study aims toinvestigate the possible approach to design and implement big data analyticsapplication by integrating an open source big data processing framework and anopen source data analysis environment. The presentationaims to share our findings and experiences learned through working on thisstudy. It consist of five sections. First, the motivation, objective and scope ofthe study will be presented. This isfollowed by a review of related literature on big data and big data analytics. In Section 3, the case scenario and data usedin the study will be introduced. Adetailed discussion on the data preparation will be presented too. Next, we will introduce the analysis andmodelling method used in this study. Insection 5, the user interface of the application designed for this study willbe discussed. A use case will then beused to demonstrate and evaluate the performance and analysis of theapplication. Finally, the overallconclusion, lessons learned and direction for future research will bepresented.
format text
author ZHANG, Mengqi
KAM, Tin Seong
author_facet ZHANG, Mengqi
KAM, Tin Seong
author_sort ZHANG, Mengqi
title Integrating apache spark and R for big data analytics on solving geographic problems
title_short Integrating apache spark and R for big data analytics on solving geographic problems
title_full Integrating apache spark and R for big data analytics on solving geographic problems
title_fullStr Integrating apache spark and R for big data analytics on solving geographic problems
title_full_unstemmed Integrating apache spark and R for big data analytics on solving geographic problems
title_sort integrating apache spark and r for big data analytics on solving geographic problems
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/3830
https://ink.library.smu.edu.sg/context/sis_research/article/4832/viewcontent/Presentation_SparkR_Zhangmengqi.pdf
_version_ 1770573802693984256