Ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system
The rating matrix of a personalized recommendation system contains a high percentage of unknown rating scores which lowers the quality of the prediction. Besides, during data streaming into memory, some rating scores are misplaced from its appropriate cell in the rating matrix which also decrease th...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Science Publications
2016
|
Online Access: | http://psasir.upm.edu.my/id/eprint/54173/1/Ensemble%20divide%20and%20conquer%20approach%20to%20solve%20the%20rating%20scores%E2%80%99%20deviation%20in%20recommendation%20system.pdf http://psasir.upm.edu.my/id/eprint/54173/ http://thescipub.com/abstract/10.3844/jcssp.2016.265.275 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Putra Malaysia |
Language: | English |
id |
my.upm.eprints.54173 |
---|---|
record_format |
eprints |
spelling |
my.upm.eprints.541732018-03-02T02:14:12Z http://psasir.upm.edu.my/id/eprint/54173/ Ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system Al-Hadi, Ismail Ahmed Al-Qasem Mohd Sharef, Nurfadhlina Sulaiman, Md Nasir Mustapha, Norwati The rating matrix of a personalized recommendation system contains a high percentage of unknown rating scores which lowers the quality of the prediction. Besides, during data streaming into memory, some rating scores are misplaced from its appropriate cell in the rating matrix which also decrease the quality of the prediction. The singular value decomposition algorithm predicts the unknown rating scores based on the relation between the implicit feedback of both users and items, but exploiting neither the user similarity nor item similarity which leads to low accuracy predictions. There are several factorization methods used in improving the prediction performance of the collaborative filtering technique such as baseline, matrix factorization, neighbour-base. However, the prediction performance of the collaborative filtering using factorization methods is still low while baseline and neighbours-base have limitations in terms of over fitting. Therefore, this paper proposes Ensemble Divide and Conquer (EDC) approach for solving 2 main problems which are the data sparsity and the rating scores’ deviation (misplace). The EDC approach is founded by the Singular Value Decomposition (SVD) algorithm which extracts the relationship between the latent feedback of users and the latent feedback of the items. Furthermore, this paper addresses the scale of rating scores as a sub problem which effect on the rank approximation among the users’ features. The latent feedback of the users and items are also SVD factors. The results using the EDC approach are more accurate than collaborative filtering and existing methods of matrix factorization namely SVD, baseline, matrix factorization and neighbours-base. This indicates the significance of the latent feedback of both users and items against the different factorization features in improving the prediction accuracy of the collaborative filtering technique. Science Publications 2016 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/54173/1/Ensemble%20divide%20and%20conquer%20approach%20to%20solve%20the%20rating%20scores%E2%80%99%20deviation%20in%20recommendation%20system.pdf Al-Hadi, Ismail Ahmed Al-Qasem and Mohd Sharef, Nurfadhlina and Sulaiman, Md Nasir and Mustapha, Norwati (2016) Ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system. Journal of Computer Science, 12 (6). pp. 265-275. ISSN 1549-3636; ESSN: 1552-6607 http://thescipub.com/abstract/10.3844/jcssp.2016.265.275 10.3844/jcssp.2016.265.275 |
institution |
Universiti Putra Malaysia |
building |
UPM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Putra Malaysia |
content_source |
UPM Institutional Repository |
url_provider |
http://psasir.upm.edu.my/ |
language |
English |
description |
The rating matrix of a personalized recommendation system contains a high percentage of unknown rating scores which lowers the quality of the prediction. Besides, during data streaming into memory, some rating scores are misplaced from its appropriate cell in the rating matrix which also decrease the quality of the prediction. The singular value decomposition algorithm predicts the unknown rating scores based on the relation between the implicit feedback of both users and items, but exploiting neither the user similarity nor item similarity which leads to low accuracy predictions. There are several factorization methods used in improving the prediction performance of the collaborative filtering technique such as baseline, matrix factorization, neighbour-base. However, the prediction performance of the collaborative filtering using factorization methods is still low while baseline and neighbours-base have limitations in terms of over fitting. Therefore, this paper proposes Ensemble Divide and Conquer (EDC) approach for solving 2 main problems which are the data sparsity and the rating scores’ deviation (misplace). The EDC approach is founded by the Singular Value Decomposition (SVD) algorithm which extracts the relationship between the latent feedback of users and the latent feedback of the items. Furthermore, this paper addresses the scale of rating scores as a sub problem which effect on the rank approximation among the users’ features. The latent feedback of the users and items are also SVD factors. The results using the EDC approach are more accurate than collaborative filtering and existing methods of matrix factorization namely SVD, baseline, matrix factorization and neighbours-base. This indicates the significance of the latent feedback of both users and items against the different factorization features in improving the prediction accuracy of the collaborative filtering technique. |
format |
Article |
author |
Al-Hadi, Ismail Ahmed Al-Qasem Mohd Sharef, Nurfadhlina Sulaiman, Md Nasir Mustapha, Norwati |
spellingShingle |
Al-Hadi, Ismail Ahmed Al-Qasem Mohd Sharef, Nurfadhlina Sulaiman, Md Nasir Mustapha, Norwati Ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system |
author_facet |
Al-Hadi, Ismail Ahmed Al-Qasem Mohd Sharef, Nurfadhlina Sulaiman, Md Nasir Mustapha, Norwati |
author_sort |
Al-Hadi, Ismail Ahmed Al-Qasem |
title |
Ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system |
title_short |
Ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system |
title_full |
Ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system |
title_fullStr |
Ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system |
title_full_unstemmed |
Ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system |
title_sort |
ensemble divide and conquer approach to solve the rating scores’ deviation in recommendation system |
publisher |
Science Publications |
publishDate |
2016 |
url |
http://psasir.upm.edu.my/id/eprint/54173/1/Ensemble%20divide%20and%20conquer%20approach%20to%20solve%20the%20rating%20scores%E2%80%99%20deviation%20in%20recommendation%20system.pdf http://psasir.upm.edu.my/id/eprint/54173/ http://thescipub.com/abstract/10.3844/jcssp.2016.265.275 |
_version_ |
1643835580443787264 |