(Private) Kernelized Bandits with Distributed Biased Feedback

Li, Fengjiao; Zhou, Xingyu; Ji, Bo

(Private) Kernelized Bandits with Distributed Biased Feedback

dc.contributor.author	Li, Fengjiao	en
dc.contributor.author	Zhou, Xingyu	en
dc.contributor.author	Ji, Bo	en
dc.date.accessioned	2023-07-11T13:47:17Z	en
dc.date.available	2023-07-11T13:47:17Z	en
dc.date.issued	2023-06-19	en
dc.date.updated	2023-07-01T08:02:54Z	en
dc.description.abstract	We study kernelized bandits with distributed biased feedback. This problem is motivated by several real-world applications (such as dynamic pricing, cellular network configuration, and policy making), where users from a large population contribute to the reward of the action chosen by a central entity, but it is difficult to collect feedback from all users. Instead, only biased feedback (due to user heterogeneity) from a subset of users may be available. In addition to such biased feedback, we are also faced with two practical challenges due to communication cost and computation complexity. To tackle these challenges, we carefully design a new distributed phase-thenbatch- based elimination (DPBE) algorithm, which samples users in phases for collecting feedback to reduce the bias and employs maximum variance reduction to select actions in batches within each phase. By properly choosing the phase length, the batch size, and the confidence width used for eliminating suboptimal actions, we show that DPBE achieves a sublinear regret of ˜ 𝑂 (𝑇 1−𝛼/2 + √︁ 𝛾𝑇𝑇 ), where 𝛼 ∈ (0, 1) is the user-sampling parameter one can tune. Moreover, DPBE can significantly reduce both communication cost and computation complexity in distributed kernelized bandits, compared to some variants of the state-of-the-art algorithms (originally developed for standard kernelized bandits). Furthermore, by incorporating various differential privacy models, we generalize DPBE to provide privacy guarantees for users participating in the distributed learning process. The algorithm design, analyses, and numerical experiments are provided in the full version of this paper [4].	en
dc.description.version	Published version	en
dc.format.mimetype	application/pdf	en
dc.identifier.doi	https://doi.org/10.1145/3578338.3593565	en
dc.identifier.uri	http://hdl.handle.net/10919/115729	en
dc.language.iso	en	en
dc.publisher	ACM	en
dc.rights	In Copyright	en
dc.rights.holder	The author(s)	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.title	(Private) Kernelized Bandits with Distributed Biased Feedback	en
dc.type	Article - Refereed	en
dc.type.dcmitype	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 3578338.3593565.pdf
Size:: 919.89 KB
Format:: Adobe Portable Document Format
Description:: Published version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 0 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Journal Articles, Association for Computing Machinery (ACM)
Scholarly Works, Computer Science