Compositional coding for collaborative filtering

Efficiency is crucial to the online recommender systems, especially for the ones which needs to deal with tens of millions of users and items. Because representing users and items as binary vectors for Collaborative Filtering (CF) can achieve fast user-item affinity computation in the Hamming space,...

Full description

Saved in:
Bibliographic Details
Main Authors: LIU, Chenghao, LU, Tao, WANG, Xin, CHENG, Zhiyong, SUN, Jianling, HOI, Steven C. H.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5100
https://ink.library.smu.edu.sg/context/sis_research/article/6103/viewcontent/Compositional_coding_collaborative_filtering_pv_oa.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6103
record_format dspace
spelling sg-smu-ink.sis_research-61032020-04-17T06:06:55Z Compositional coding for collaborative filtering LIU, Chenghao LU, Tao WANG, Xin CHENG, Zhiyong SUN, Jianling HOI, Steven C. H. Efficiency is crucial to the online recommender systems, especially for the ones which needs to deal with tens of millions of users and items. Because representing users and items as binary vectors for Collaborative Filtering (CF) can achieve fast user-item affinity computation in the Hamming space, in recent years, we have witnessed an emerging research effort in exploiting binary hashing techniques for CF methods. However, CF with binary codes naturally suffers from low accuracy due to limited representation capability in each bit, which impedes it from modeling complex structure of the data. In this work, we attempt to improve the efficiency without hurting the model performance by utilizing both the accuracy of real-valued vectors and the efficiency of binary codes to represent users/items. In particular, we propose the Compositional Coding for Collaborative Filtering (CCCF) framework, which not only gains better recommendation efficiency than the state-of-the-art binarized CF approaches but also achieves even higher accuracy than the real-valued CF method. Specifically, CCCF innovatively represents each user/item with a set of binary vectors, which are associated with a sparse real-value weight vector. Each value of the weight vector encodes the importance of the corresponding binary vector to the user/item. The continuous weight vectors greatly enhances the representation capability of binary codes, and its sparsity guarantees the processing speed. Furthermore, an integer weight approximation scheme is proposed to further accelerate the speed. Based on the CCCF framework, we design an efficient discrete optimization algorithm to learn its parameters. Extensive experiments on three real-world datasets show that our method outperforms the state-of-the-art binarized CF methods (even achieves better performance than the real-valued CF method) by a large margin in terms of both recommendation accuracy and efficiency. We publish our project at https://github.com/3140102441/CCCF. 2019-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5100 info:doi/10.1145/3331184.3331206 https://ink.library.smu.edu.sg/context/sis_research/article/6103/viewcontent/Compositional_coding_collaborative_filtering_pv_oa.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Collaborative Filtering Discrete Hashing Recommendation Databases and Information Systems Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Collaborative Filtering
Discrete Hashing
Recommendation
Databases and Information Systems
Software Engineering
spellingShingle Collaborative Filtering
Discrete Hashing
Recommendation
Databases and Information Systems
Software Engineering
LIU, Chenghao
LU, Tao
WANG, Xin
CHENG, Zhiyong
SUN, Jianling
HOI, Steven C. H.
Compositional coding for collaborative filtering
description Efficiency is crucial to the online recommender systems, especially for the ones which needs to deal with tens of millions of users and items. Because representing users and items as binary vectors for Collaborative Filtering (CF) can achieve fast user-item affinity computation in the Hamming space, in recent years, we have witnessed an emerging research effort in exploiting binary hashing techniques for CF methods. However, CF with binary codes naturally suffers from low accuracy due to limited representation capability in each bit, which impedes it from modeling complex structure of the data. In this work, we attempt to improve the efficiency without hurting the model performance by utilizing both the accuracy of real-valued vectors and the efficiency of binary codes to represent users/items. In particular, we propose the Compositional Coding for Collaborative Filtering (CCCF) framework, which not only gains better recommendation efficiency than the state-of-the-art binarized CF approaches but also achieves even higher accuracy than the real-valued CF method. Specifically, CCCF innovatively represents each user/item with a set of binary vectors, which are associated with a sparse real-value weight vector. Each value of the weight vector encodes the importance of the corresponding binary vector to the user/item. The continuous weight vectors greatly enhances the representation capability of binary codes, and its sparsity guarantees the processing speed. Furthermore, an integer weight approximation scheme is proposed to further accelerate the speed. Based on the CCCF framework, we design an efficient discrete optimization algorithm to learn its parameters. Extensive experiments on three real-world datasets show that our method outperforms the state-of-the-art binarized CF methods (even achieves better performance than the real-valued CF method) by a large margin in terms of both recommendation accuracy and efficiency. We publish our project at https://github.com/3140102441/CCCF.
format text
author LIU, Chenghao
LU, Tao
WANG, Xin
CHENG, Zhiyong
SUN, Jianling
HOI, Steven C. H.
author_facet LIU, Chenghao
LU, Tao
WANG, Xin
CHENG, Zhiyong
SUN, Jianling
HOI, Steven C. H.
author_sort LIU, Chenghao
title Compositional coding for collaborative filtering
title_short Compositional coding for collaborative filtering
title_full Compositional coding for collaborative filtering
title_fullStr Compositional coding for collaborative filtering
title_full_unstemmed Compositional coding for collaborative filtering
title_sort compositional coding for collaborative filtering
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/5100
https://ink.library.smu.edu.sg/context/sis_research/article/6103/viewcontent/Compositional_coding_collaborative_filtering_pv_oa.pdf
_version_ 1770575220133855232