reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Provably Efficient Model-Free Constrained RL with Linear Function Approximation

Authors: Arnob Ghosh, Xingyu Zhou, Ness Shroff

NeurIPS 2022 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We evaluate Algorithm 1 on a simulated model for job scheduling to validate our theoretical results.
Researcher Affiliation	Academia	Arnob Ghosh Electrical and Computer Engineering The Ohio State University Columbus, OH, USA EMAIL Xingyu Zhou Electrical and Computer Engineering Wayne State University Detroit, MI, USA EMAIL Ness Shroff Electrical and Computer Engineering The Ohio State University Columbus, OH, USA EMAIL
Pseudocode	Yes	Algorithm 1 Model Free Primal-Dual Algorithm for Linear Function Approximation
Open Source Code	No	The paper states in a checklist that code is included, but does not provide a URL or explicit mention within the main body of the text for public access.
Open Datasets	No	The paper uses a 'simulated model for job scheduling' and does not provide concrete access information or citations for a publicly available dataset.
Dataset Splits	No	The paper uses a simulated model for experiments and does not specify any train/validation/test dataset splits or their methodology.
Hardware Specification	No	The paper does not provide any specific hardware details for running the experiments, and the authors state 'N/A' for compute resources in the checklist.
Software Dependencies	No	The paper does not list specific software dependencies with version numbers.
Experiment Setup	Yes	The parameters we used are the followings: α = K/(1 + 2H/γ + H), η = 2H/(γ KH2). We set γ = 1. We have also set ϵ = 0.1 in order to ensure that the violation goes towards 0 as K increases.