Multi-player Multi-armed Bandits with Delayed Feedback
Authors: Jingqi Fan, Zilong Wang, Shuai Li, Linghe Kong
IJCAI 2025 | Venue PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | Numerical experiments on both synthetic and real-world datasets validate the effectiveness of our algorithm. |
| Researcher Affiliation | Academia | Jingqi Fan1 , Zilong Wang2 , Shuai Li2 , Linghe Kong2 1Northeastern University, China 2Shanghai Jiao Tong University EMAIL, EMAIL |
| Pseudocode | Yes | Algorithm 1 DDSE (Leader with j = M) Algorithm 2 Communication (Leader with j = M) Algorithm 3 Communication (Follower j) |
| Open Source Code | No | The paper does not provide any explicit statements about releasing code for their methodology, nor does it include links to source code repositories. The only link provided is for a dataset. |
| Open Datasets | Yes | We evaluate our algorithms using real-world spectrum data collected in Finland by the 5G-Xcast project2. ... The full dataset used in this experiment is publicly available at https://zenodo.org/records/1293283. |
| Dataset Splits | No | The paper mentions running experiments for T = 300,000 rounds and averaging over 20 trials, and using real-world spectrum data. However, it does not specify any dataset splits like training/test/validation percentages or counts, or predefined splits for the real-world or synthetic data. |
| Hardware Specification | No | The paper does not provide any specific details about the hardware used to run the experiments, such as GPU or CPU models. |
| Software Dependencies | No | The paper does not specify any software dependencies with version numbers, such as programming languages, libraries, or frameworks used for implementation. |
| Experiment Setup | Yes | Each experiment runs for T = 300,000 rounds and is averaged over 20 trials. Default parameters are K = 20, M = 10, E[d] = 200, σd = 100, and = 0.05. |