Pricing problems with Thompson sampling

In 1933, William R. Thompson proposed an algorithm known as Thompson sampling in order to maximise culmulative payo in a multi-armed bandit (MAB) problem. MAB problems have been fre- quently used to model real-life decision making scenarios. This pa- per explores the extension of Thompson sampl...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Lee, Samuel Wai Leong
مؤلفون آخرون:	Yan Zhenzhen
التنسيق:	Final Year Project
اللغة:	English
منشور في:	2019
الموضوعات:	DRNTU::Science::Mathematics::Statistics
الوصول للمادة أونلاين:	http://hdl.handle.net/10356/77144
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

الانترنت

http://hdl.handle.net/10356/77144

Pricing problems with Thompson sampling

الانترنت

مواد مشابهة