reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

A Unified View of Piecewise Linear Neural Network Verification

Authors: Rudy R. Bunel, Ilker Turkaslan, Philip Torr, Pushmeet Kohli, Pawan K. Mudigonda

NeurIPS 2018 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We use the benchmark to provide the ﬁrst experimental comparison of existing algorithms and identify the factors impacting the hardness of veriﬁcation problems.
Researcher Affiliation	Collaboration	Rudy Bunel University of Oxford EMAIL Ilker Turkaslan University of Oxford EMAIL Philip H.S. Torr University of Oxford EMAIL Pushmeet Kohli Deepmind EMAIL M. Pawan Kumar University of Oxford Alan Turing Institute EMAIL
Pseudocode	Yes	Algorithm 1 Branch and Bound
Open Source Code	Yes	All code and data necessary to replicate our analysis are released.
Open Datasets	Yes	The Collision Detection data set [6] attempts to predict whether two vehicles with parameterized trajectories are going to collide.
Dataset Splits	No	The paper refers to using datasets like Collision Detection, ACAS, and PCAMNIST, but does not provide specific details on how these datasets were split into training, validation, or test sets (e.g., percentages, sample counts, or explicit references to standard splits used).
Hardware Specification	Yes	We attempt to verify each property with a timeout of two hours, and a maximum allowed memory usage of 20GB, on a single core of a machine with an i7-5930K CPU.
Software Dependencies	No	The paper mentions software like 'Python', 'Gurobi', and 'GLPK library' but does not provide specific version numbers for these software dependencies, which are necessary for full reproducibility.
Experiment Setup	Yes	The base network has 10 inputs and 4 layers of 25 hidden units, and the property to prove is True with a margin of 1000. Each of the plot correspond to a variation of one of this parameters.