K-means clustering analysis and multiple linear regression model on household income in Malaysia
Household income plays a significant role in determining a country's socioeconomic standing. This measure is often used by the government to formulate the federal budget and policies that are most appropriate for national development. In spite of this, Malaysia's current economic circumsta...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | http://eprints.uthm.edu.my/10353/1/J16020_a18540f8debfd456df2dd54f8eb422f5.pdf http://eprints.uthm.edu.my/10353/ https://doi.org/10.11591/ijai.v12.i2.pp731-738 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Tun Hussein Onn Malaysia |
Language: | English |
id |
my.uthm.eprints.10353 |
---|---|
record_format |
eprints |
spelling |
my.uthm.eprints.103532023-10-30T07:32:08Z http://eprints.uthm.edu.my/10353/ K-means clustering analysis and multiple linear regression model on household income in Malaysia Gan Pei Yee, Gan Pei Yee Rusiman, Mohd Saifullah Ismail, Shuhaida Suparman, Suparman Mohamad Hamzah, Firdaus Muhammad Ammar Shaf, Muhammad Ammar Shaf T Technology (General) Household income plays a significant role in determining a country's socioeconomic standing. This measure is often used by the government to formulate the federal budget and policies that are most appropriate for national development. In spite of this, Malaysia's current economic circumstances continue to be characterized by income disparity. Therefore, this shortcoming can be addressed by analyzing the household income survey (HIS) conducted by Department of Statistics Malaysia (DoSM). In this study, the hybrid model is proposed where K-means and multiple linear regression (MLR) for clustering and predicting household income in Malaysia. Based on the experimental results, the K-means clustering analysis in conjunction with the MLR model outperformed the MLR model without clustering with a smaller mean square error. As a result, clustering analysis results in a more accurate estimate of household income because it reduces the variation between households. It is important that household income information reflect the concern of policymakers about the impact of universal and targeted interventions on different socioeconomic groups. 2023 Article PeerReviewed text en http://eprints.uthm.edu.my/10353/1/J16020_a18540f8debfd456df2dd54f8eb422f5.pdf Gan Pei Yee, Gan Pei Yee and Rusiman, Mohd Saifullah and Ismail, Shuhaida and Suparman, Suparman and Mohamad Hamzah, Firdaus and Muhammad Ammar Shaf, Muhammad Ammar Shaf (2023) K-means clustering analysis and multiple linear regression model on household income in Malaysia. IAES International Journal of Artificial Intelligence, 12 (2). pp. 731-738. ISSN 2252-8938 https://doi.org/10.11591/ijai.v12.i2.pp731-738 |
institution |
Universiti Tun Hussein Onn Malaysia |
building |
UTHM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Tun Hussein Onn Malaysia |
content_source |
UTHM Institutional Repository |
url_provider |
http://eprints.uthm.edu.my/ |
language |
English |
topic |
T Technology (General) |
spellingShingle |
T Technology (General) Gan Pei Yee, Gan Pei Yee Rusiman, Mohd Saifullah Ismail, Shuhaida Suparman, Suparman Mohamad Hamzah, Firdaus Muhammad Ammar Shaf, Muhammad Ammar Shaf K-means clustering analysis and multiple linear regression model on household income in Malaysia |
description |
Household income plays a significant role in determining a country's socioeconomic standing. This measure is often used by the government to formulate the federal budget and policies that are most appropriate for national development. In spite of this, Malaysia's current economic circumstances continue to be characterized by income disparity. Therefore,
this shortcoming can be addressed by analyzing the household income survey (HIS) conducted by Department of Statistics Malaysia (DoSM). In this study, the hybrid model is proposed where K-means and multiple linear regression (MLR) for clustering and predicting household income in
Malaysia. Based on the experimental results, the K-means clustering analysis in conjunction with the MLR model outperformed the MLR model without clustering with a smaller mean square error. As a result, clustering analysis results in a more accurate estimate of household income because it reduces the variation between households. It is important that household income information reflect the concern of policymakers about the impact of universal and targeted interventions on different socioeconomic groups. |
format |
Article |
author |
Gan Pei Yee, Gan Pei Yee Rusiman, Mohd Saifullah Ismail, Shuhaida Suparman, Suparman Mohamad Hamzah, Firdaus Muhammad Ammar Shaf, Muhammad Ammar Shaf |
author_facet |
Gan Pei Yee, Gan Pei Yee Rusiman, Mohd Saifullah Ismail, Shuhaida Suparman, Suparman Mohamad Hamzah, Firdaus Muhammad Ammar Shaf, Muhammad Ammar Shaf |
author_sort |
Gan Pei Yee, Gan Pei Yee |
title |
K-means clustering analysis and multiple linear regression
model on household income in Malaysia |
title_short |
K-means clustering analysis and multiple linear regression
model on household income in Malaysia |
title_full |
K-means clustering analysis and multiple linear regression
model on household income in Malaysia |
title_fullStr |
K-means clustering analysis and multiple linear regression
model on household income in Malaysia |
title_full_unstemmed |
K-means clustering analysis and multiple linear regression
model on household income in Malaysia |
title_sort |
k-means clustering analysis and multiple linear regression
model on household income in malaysia |
publishDate |
2023 |
url |
http://eprints.uthm.edu.my/10353/1/J16020_a18540f8debfd456df2dd54f8eb422f5.pdf http://eprints.uthm.edu.my/10353/ https://doi.org/10.11591/ijai.v12.i2.pp731-738 |
_version_ |
1781707469958938624 |