Pricing problems with Thompson sampling

In 1933, William R. Thompson proposed an algorithm known as Thompson sampling in order to maximise culmulative payo in a multi-armed bandit (MAB) problem. MAB problems have been fre- quently used to model real-life decision making scenarios. This pa- per explores the extension of Thompson sampl...

Full description

Saved in:
Bibliographic Details
Main Author: Lee, Samuel Wai Leong
Other Authors: Yan Zhenzhen
Format: Final Year Project
Language:English
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/10356/77144
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:In 1933, William R. Thompson proposed an algorithm known as Thompson sampling in order to maximise culmulative payo in a multi-armed bandit (MAB) problem. MAB problems have been fre- quently used to model real-life decision making scenarios. This pa- per explores the extension of Thompson sampling to other problems beyond the MAB setting. More speci cally, Thompson sampling is applied to product sales using data from a real dataset in a dynamic pricing setting as part of the multi-product pricing problem.