Variational learning from implicit bandit feedback
Recommendations are prevalent in Web applications (e.g., search ranking, item recommendation, advertisement placement). Learning from bandit feedback is challenging due to the sparsity of feedback limited to system-provided actions. In this work, we focus on batch learning from logs of recommender s...
Saved in:
Main Authors: | TRUONG, Quoc Tuan, LAUW, Hady W. |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6431 https://ink.library.smu.edu.sg/context/sis_research/article/7434/viewcontent/ml21.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Dynamic Clustering of Contextual Multi-Armed Bandits
by: NGUYEN, Trong T., et al.
Published: (2014) -
Bilateral variational autoencoder for collaborative filtering
by: TRUONG, Quoc Tuan, et al.
Published: (2021) -
Multi-representation Variational Autoencoder via iterative latent attention and implicit differentiation
by: TRAN, Nhu Thuat, et al.
Published: (2023) -
Learning index policies for restless bandits with application to maternal healthcare
by: BISWAS, Arpita, et al.
Published: (2021) -
Multi-armed linear bandits with latent biases
by: Kang, Qiyu, et al.
Published: (2024)