Exploring influencing factors on international tourist arrivals in the Philippines using clustering methods and negative binomial regression

K-means clustering algorithm is a commonly-used clustering algorithm with many advantages, such as simple understanding, realizing quickly, and processing large datasets conveniently. Count data often applies to many fields, such as medicine, sociology, and psychology. It is an essential statistical...

Full description

Saved in:
Bibliographic Details
Main Author: An, Fengyi
Format: text
Language:English
Published: Animo Repository 2022
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etdm_math/6
https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1005&context=etdm_math
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etdm_math-1005
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etdm_math-10052022-07-05T02:59:38Z Exploring influencing factors on international tourist arrivals in the Philippines using clustering methods and negative binomial regression An, Fengyi K-means clustering algorithm is a commonly-used clustering algorithm with many advantages, such as simple understanding, realizing quickly, and processing large datasets conveniently. Count data often applies to many fields, such as medicine, sociology, and psychology. It is an essential statistical data type. Count data is analyzed using some frequently-used models, such as the Poisson regression and the negative binomial regression models. The negative binomial regression model has the phenomenon of overdispersion, wherein the variance is greater than the mean, that exists in the count data. As a consequence, overdispersion data analysis has become a crucial statistical issue. This thesis focused on studying the application of K-means clustering and the negative binomial regression model in an overdispersed inbound tourism data of the Philippines from 2009 to 2018. The K-means method was used to cluster 58 countries or regions by purpose of travel in the Philippines. The negative binomial regression model was performed for each cluster to identify the determinants of foreign tourist arrivals in the Philippines. Results showed that only the pattern of the number of tourist arrivals for holiday purpose had a trend stationarity. The number of tourists for holiday purpose was expected to improve the development of tourism. In addition, influencing factors were found to vary among the different clusters. 2022-01-01T08:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etdm_math/6 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1005&context=etdm_math Mathematics and Statistics Master's Theses English Animo Repository Negative binomial distribution Statistics and Probability
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Negative binomial distribution
Statistics and Probability
spellingShingle Negative binomial distribution
Statistics and Probability
An, Fengyi
Exploring influencing factors on international tourist arrivals in the Philippines using clustering methods and negative binomial regression
description K-means clustering algorithm is a commonly-used clustering algorithm with many advantages, such as simple understanding, realizing quickly, and processing large datasets conveniently. Count data often applies to many fields, such as medicine, sociology, and psychology. It is an essential statistical data type. Count data is analyzed using some frequently-used models, such as the Poisson regression and the negative binomial regression models. The negative binomial regression model has the phenomenon of overdispersion, wherein the variance is greater than the mean, that exists in the count data. As a consequence, overdispersion data analysis has become a crucial statistical issue. This thesis focused on studying the application of K-means clustering and the negative binomial regression model in an overdispersed inbound tourism data of the Philippines from 2009 to 2018. The K-means method was used to cluster 58 countries or regions by purpose of travel in the Philippines. The negative binomial regression model was performed for each cluster to identify the determinants of foreign tourist arrivals in the Philippines. Results showed that only the pattern of the number of tourist arrivals for holiday purpose had a trend stationarity. The number of tourists for holiday purpose was expected to improve the development of tourism. In addition, influencing factors were found to vary among the different clusters.
format text
author An, Fengyi
author_facet An, Fengyi
author_sort An, Fengyi
title Exploring influencing factors on international tourist arrivals in the Philippines using clustering methods and negative binomial regression
title_short Exploring influencing factors on international tourist arrivals in the Philippines using clustering methods and negative binomial regression
title_full Exploring influencing factors on international tourist arrivals in the Philippines using clustering methods and negative binomial regression
title_fullStr Exploring influencing factors on international tourist arrivals in the Philippines using clustering methods and negative binomial regression
title_full_unstemmed Exploring influencing factors on international tourist arrivals in the Philippines using clustering methods and negative binomial regression
title_sort exploring influencing factors on international tourist arrivals in the philippines using clustering methods and negative binomial regression
publisher Animo Repository
publishDate 2022
url https://animorepository.dlsu.edu.ph/etdm_math/6
https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1005&context=etdm_math
_version_ 1738854789889916928