reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Subgame Solving in Adversarial Team Games

Authors: Brian Zhang, Luca Carminati, Federico Cacciamani, Gabriele Farina, Pierriccardo Olivieri, Nicola Gatti, Tuomas Sandholm

NeurIPS 2022 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We apply our method to a standard test suite, and we empirically show the performance improvement of the strategies thanks to subgame solving.
Researcher Affiliation	Collaboration	Brian Hu Zhang Computer Science Department Carnegie Mellon University EMAIL Luca Carminati DEIB, Politecnico di Milano EMAIL Federico Cacciamani DEIB, Politecnico di Milano EMAIL Gabriele Farina Computer Science Department Carnegie Mellon University EMAIL Pierriccardo Olivieri DEIB, Politecnico di Milano EMAIL Nicola Gatti DEIB, Politecnico di Milano EMAIL Tuomas Sandholm Computer Science Department, CMU Strategic Machine, Inc. Strategy Robot, Inc. Optimized Markets, Inc. EMAIL
Pseudocode	Yes	Algorithm 1 Maxmargin subgame solving with column generation, at public state P
Open Source Code	No	The paper states in its self-assessment checklist that code is included, but the provided text does not contain a direct link to a code repository, nor an explicit statement in the main body or appendices (other than results in Appendix C) indicating where the code for the described methodology can be accessed.
Open Datasets	No	The paper uses "parametric versions of the ATG instances customarily adopted in the literature" such as Kuhn poker, Leduc poker, Liar's Dice, and Tricks, citing papers [11, 18, 13, 21] for their rules. These are game definitions/frameworks, not publicly available datasets in the traditional sense with specific access information (link, DOI, repository) for data files.
Dataset Splits	No	The paper does not specify traditional training/validation/test dataset splits. It describes using game instances and parameters for those games, rather than data splits.
Hardware Specification	Yes	Each experiment was allocated 32 CPU cores and 256 GB RAM on a cluster machine.
Software Dependencies	Yes	Integer and linear programs were solved with Gurobi 9.5.
Experiment Setup	Yes	More precisely, the blueprint computation is stopped once 10 minutes have elapsed or column generation has achieved a Nash gap of /10, where is the difference between maximum and minimum team s payoffs, whichever comes first. We use a range of time limits for the strategy refinement, defined as the average time needed by a single iteration of the CG algorithm at the root of the whole game multiplied by a number α {0, 1, ..., 10}.