reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning

Authors: Yuchen Xiao, Weihao Tan, Christopher Amato

NeurIPS 2022 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Empirical results (in simulation and hardware) in a variety of realistic domains demonstrate the superiority of our approaches in large multi-agent problems and validate the effectiveness of our algorithms for learning high-quality and asynchronous solutions.
Researcher Affiliation	Academia	Yuchen Xiao Khoury College of Computer Sciences Northeastern University Boston, MA 02115 EMAIL Weihao Tan Khoury College of Computer Sciences Northeastern University Boston, MA 02115 EMAIL Christopher Amato Khoury College of Computer Sciences Northeastern University Boston, MA 02115 EMAIL
Pseudocode	Yes	The pseudocode and detailed trajectory squeezing process for each proposed method are presented in Appendix C.
Open Source Code	Yes	In supplementary materials, we include the code and a README.txt ﬁle to reproduce the main experimental results.
Open Datasets	Yes	We investigate the performance of our algorithms over a variety of multi-agent problems with macroactions (Fig. 1): Box Pushing [Xiao et al., 2019], Overcooked [Wu et al., 2021b], and a larger Warehouse Tool Delivery [Xiao et al., 2019] domain.
Dataset Splits	No	The paper refers to 'training trials' and 'testing episodes' for evaluation but does not specify explicit dataset splits (e.g., percentages or counts for training, validation, and test sets).
Hardware Specification	Yes	The details of used computational resources are mentioned in Appendix E.
Software Dependencies	No	The provided text does not explicitly list software dependencies with specific version numbers.
Experiment Setup	Yes	All the training details including hyperparameters are in Appendix E.