Multi-armed linear bandits with latent biases

Multi-armed linear bandits with latent biases

In a linear stochastic bandit model, each arm corresponds to a vector in Euclidean space, and the expected return observed at each time step is determined by an unknown linear function of the selected arm. This paper addresses the challenge of identifying the optimal arm in a linear stochastic bandi...

Saved in:

書目詳細資料
Main Authors:	Kang, Qiyu, Tay, Wee Peng, She, Rui, Wang, Sijie, Liu, Xiaoqian, Yang, Yuan-Rui
其他作者:	School of Electrical and Electronic Engineering
格式:	Article
語言:	English
出版:	2024
主題:	Computer and Information Science Linear bandit Multi-armed bandit
在線閱讀:	https://hdl.handle.net/10356/175416
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!

相似書籍

Multi-arm bandit-led clustering in federated learning
由: Zhao, Joe Chen Xuan
出版: (2024)

PERFORMANCE GUARANTEES FOR ONLINE LEARNING: CASCADING BANDITS AND ADVERSARIAL CORRUPTIONS
由: ZHONG ZIXIN
出版: (2021)

THE CONFIDENCE BOUND METHOD FOR THE MULTI-ARMED BANDIT PROBLEM WITH LARGE ARM SIZE
由: HU SHOURI
出版: (2020)

Dynamic Clustering of Contextual Multi-Armed Bandits
由: NGUYEN, Trong T., et al.
出版: (2014)

Efficient resource allocation with fairness constraints in restless multi-armed bandits
由: LI, Dexun, et al.
出版: (2022)

Combinatorial multi-armed bandit problem with probabilistically triggered arms: A case with bounded regret
由: SARITAC, Omer, et al.
出版: (2017)

REVISED APPROACH FOR RISK-AVERSE MULTI-ARMED BANDITS UNDER CVAR CRITERIA
由: NAJAKORN KHAJONCHOTPANYA
出版: (2020)

Avoiding starvation of arms in restless multi-armed bandit
由: LI, Dexun, et al.
出版: (2023)

DECISION MODELS FOR CONTEXT-AWARE IOT APPLICATIONS
由: NIRANDIKA WANIGASEKARA
出版: (2020)

Avoiding starvation of arms in restless multi-armed bandit
由: LI, Dexun, et al.
出版: (2023)

Learning index policies for restless bandits with application to maternal healthcare
由: BISWAS, Arpita, et al.
出版: (2021)

Tuning an underwater communication link
由: SATISH SHANKAR
出版: (2014)

On power-efficient planning in dynamic small cell networks
由: Maghsudi, Setareh, et al.
出版: (2020)

Efficient resource allocation with fairness constraints in restless multi-armed bandits
由: LI, Dexun, et al.
出版: (2022)

Distributed bandit online convex optimization with time-varying coupled inequality constraints
由: Yi, Xinlei, et al.
出版: (2022)

LEARNING TO MAKE DECISIONS WITH INCOMPLETE INFORMATION: REINFORCEMENT LEARNING, INFORMATION GEOMETRY, AND REAL-LIFE APPLICATIONS
由: DEBABROTA BASU
出版: (2018)

BANDIT-STYLE ALGORITHMS FOR WIRELESS NETWORK SELECTION
由: ANUJA MEETOO APPAVOO
出版: (2021)

ONLINE RESOURCE ALLOCATION AND ITS APPLICATIONS
由: ZHU QIUYU
出版: (2022)

Variational learning from implicit bandit feedback
由: TRUONG, Quoc Tuan, et al.
出版: (2021)

A hybrid bandit framework for diversified recommendation
由: Ding, Qinxu, et al.
出版: (2021)

Adapting underwater physical link parameters using data driven algorithms
由: D. MELANI JAYASURIYA
出版: (2011)

Optimal stopping for Brownian motion with applications to sequential analysis and option pricing
由: Lai, T.L., et al.
出版: (2014)

Bounding regret in empirical games
由: JECMEN, Steven, et al.
出版: (2020)

Decision-theoretic designs for dose-finding clinical trials with multiple outcomes
由: Fan, S.K., et al.
出版: (2014)

Optimal strategies for a class of sequential control problems with precedence relations
由: Chan, H.P., et al.
出版: (2014)

Privacy-preserving user recruitment with sensing quality evaluation in mobile crowdsensing
由: AN, Jieying, et al.
出版: (2024)

Hedging the Drift: Learning to Optimize under Non-Stationarity
由: CHEUNG WANG CHI, et al.
出版: (2020)

Self‐regulating action exploration in reinforcement learning
由: TENG, Teck-Hou, et al.
出版: (2012)

Self-regulating action exploration in reinforcement learning
由: TENG, Teck-Hou, et al.
出版: (2012)

Learning and adaptation under uncertainty and ambiguity
由: ZHENG, Lei
出版: (2020)

PRFusion: toward effective and robust multi-modal place recognition with image and point cloud fusion
由: Wang, Sijie, et al.
出版: (2025)

A FAMILY OF ADAPTIVE DESIGNS FOR MULTI-ARM CLINICAL TRIALS
由: HU JINJIN
出版: (2019)

A FAMILY OF ADAPTIVE DESIGNS FOR MULTI-ARM CLINICAL TRIALS
由: HU JINJIN
出版: (2019)

Online contextual influence maximization with costly observations
由: SARITAC, Omer, et al.
出版: (2019)

The effective coverage of homogeneous teams with radial attenuation models
由: Yang, Yuan-Rui, et al.
出版: (2023)

The development of military factions in the Armed Forces of the Philippines
由: Garcia, John Edward
出版: (1989)

Assessing Latent Inhibition Deficits in Youth At-risk of Conversion to Psychosis
由: JAMIE THONG YU JIN
出版: (2012)

Burst-induced Multi-Armed Bandit for learning recommendation
由: ALVES, Rodrigo, et al.
出版: (2021)

Arms control and conflict resolution as games : NATO-Russia security cooperation from 1990-2000
由: Gan, Edito C., Jr.
出版: (2022)

Comparison of bipolar sub-modules for the alternate arm converter
由: Wickramasinghe, Harith R., et al.
出版: (2017)