Pricing problems with Thompson sampling

In 1933, William R. Thompson proposed an algorithm known as Thompson sampling in order to maximise culmulative payo in a multi-armed bandit (MAB) problem. MAB problems have been fre- quently used to model real-life decision making scenarios. This pa- per explores the extension of Thompson sampl...

Full description

Saved in:

Bibliographic Details
Main Author:	Lee, Samuel Wai Leong
Other Authors:	Yan Zhenzhen
Format:	Final Year Project
Language:	English
Published:	2019
Subjects:	DRNTU::Science::Mathematics::Statistics
Online Access:	http://hdl.handle.net/10356/77144
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Be the first to leave a comment!

Pricing problems with Thompson sampling

Similar Items