Exploring influencing factors on international tourist arrivals in the Philippines using clustering methods and negative binomial regression

K-means clustering algorithm is a commonly-used clustering algorithm with many advantages, such as simple understanding, realizing quickly, and processing large datasets conveniently. Count data often applies to many fields, such as medicine, sociology, and psychology. It is an essential statistical...

Full description

Saved in:
Bibliographic Details
Main Author: An, Fengyi
Format: text
Language:English
Published: Animo Repository 2022
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etdm_math/6
https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1005&context=etdm_math
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
Description
Summary:K-means clustering algorithm is a commonly-used clustering algorithm with many advantages, such as simple understanding, realizing quickly, and processing large datasets conveniently. Count data often applies to many fields, such as medicine, sociology, and psychology. It is an essential statistical data type. Count data is analyzed using some frequently-used models, such as the Poisson regression and the negative binomial regression models. The negative binomial regression model has the phenomenon of overdispersion, wherein the variance is greater than the mean, that exists in the count data. As a consequence, overdispersion data analysis has become a crucial statistical issue. This thesis focused on studying the application of K-means clustering and the negative binomial regression model in an overdispersed inbound tourism data of the Philippines from 2009 to 2018. The K-means method was used to cluster 58 countries or regions by purpose of travel in the Philippines. The negative binomial regression model was performed for each cluster to identify the determinants of foreign tourist arrivals in the Philippines. Results showed that only the pattern of the number of tourist arrivals for holiday purpose had a trend stationarity. The number of tourists for holiday purpose was expected to improve the development of tourism. In addition, influencing factors were found to vary among the different clusters.