Distributed Linear Bandits with Differential Privacy

dc.contributor.authorLi, Fengjiaoen
dc.contributor.authorZhou, Xingyuen
dc.contributor.authorJi, Boen
dc.date.accessioned2024-02-19T14:21:17Zen
dc.date.available2024-02-19T14:21:17Zen
dc.date.issued2024-02-06en
dc.description.abstractIn this paper, we study the problem of global reward maximization with only partial distributed feedback. This problem is motivated by several real-world applications (e.g., cellular network configuration, dynamic pricing, and policy selection) where an action taken by a central entity influences a large population that contributes to the global reward. However, collecting such reward feedback from the entire population not only incurs a prohibitively high cost, but often leads to privacy concerns. To tackle this problem, we consider distributed linear bandits with differential privacy, where a subset of users from the population are selected (called clients) to participate in the learning process and the central server learns the global model from such partial feedback by iteratively aggregating these clients’ local feedback in a differentially private fashion. We then propose a unified algorithmic learning framework, called differentially private distributed phased elimination (DP-DPE), which can be naturally integrated with popular differential privacy (DP) models (including central DP, local DP, and shuffle DP). Furthermore, we show that DP-DPE achieves both sublinear regret and sublinear communication cost. Interestingly, DP-DPE also achieves privacy protection “for free” in the sense that the additional cost due to privacy guarantees is a lower-order additive term. In addition, as a by-product of our techniques, the same results of “free” privacy can also be achieved for the standard differentially private linear bandits. Finally, we conduct simulations to corroborate our theoretical results and demonstrate the effectiveness of DP-DPE.en
dc.description.versionAccepted versionen
dc.format.extentPages 1-13en
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1109/tnse.2024.3362978en
dc.identifier.eissn2327-4697en
dc.identifier.issn2334-329Xen
dc.identifier.issue99en
dc.identifier.orcidJi, Bo [0000-0003-0149-7509]en
dc.identifier.urihttps://hdl.handle.net/10919/118016en
dc.identifier.volumePPen
dc.language.isoenen
dc.publisherIEEEen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.titleDistributed Linear Bandits with Differential Privacyen
dc.title.serialIEEE Transactions on Network Science and Engineeringen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
pubs.organisational-group/Virginia Techen
pubs.organisational-group/Virginia Tech/Engineeringen
pubs.organisational-group/Virginia Tech/Engineering/Computer Scienceen
pubs.organisational-group/Virginia Tech/All T&R Facultyen
pubs.organisational-group/Virginia Tech/Engineering/COE T&R Facultyen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DLB_TNSE_final.pdf
Size:
2.22 MB
Format:
Adobe Portable Document Format
Description:
Accepted version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Plain Text
Description: