Vulnerabilities Caused by Metric-based Policies in Reinforcement Learning Based Covert Communication Under Steering Attack
| dc.contributor.author | Jones, Alyse M. | en |
| dc.contributor.author | Costa, Maice | en |
| dc.date.accessioned | 2025-08-04T18:19:57Z | en |
| dc.date.available | 2025-08-04T18:19:57Z | en |
| dc.date.issued | 2025-06-30 | en |
| dc.date.updated | 2025-08-01T07:52:00Z | en |
| dc.description.abstract | This paper explores the concept of timeliness in covert communications when faced with eavesdropping and jamming. We consider a transmitter-receiver pair communicating over a wireless channel where the choice of a resource block (frequency, time) to transmit is the result of a Reinforcement Learning policy. The eavesdropper aims to detect a transmission to perform a steering attack. Using two multiarmed bandit systems, we investigate the problem of minimizing the Age of Information (AoI) regret at the legit receiver, while maximizing the AoI regret at the adversary. We present an upper bound for regret and demonstrate through simulations the validity of the bound and the vulnerabilities introduced by the use of metric-guided policies such as age-aware policies. | en |
| dc.description.version | Published version | en |
| dc.format.mimetype | application/pdf | en |
| dc.identifier.doi | https://doi.org/10.1145/3733965.3733973 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/136952 | en |
| dc.language.iso | en | en |
| dc.publisher | ACM | en |
| dc.rights | Creative Commons Attribution 4.0 International | en |
| dc.rights.holder | The author(s) | en |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
| dc.title | Vulnerabilities Caused by Metric-based Policies in Reinforcement Learning Based Covert Communication Under Steering Attack | en |
| dc.type | Article - Refereed | en |
| dc.type.dcmitype | Text | en |