reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Project and Forget: Solving Large-Scale Metric Constrained Problems

Authors: Rishi Sonthalia, Anna C. Gilbert

JMLR 2022 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We demonstrate that using our method, we can solve large problem instances of general weighted correlation clustering, metric nearness, information theoretic metric learning and quadratically regularized optimal transport; in each case, out-performing the state of the art methods with respect to CPU times and problem sizes. Keywords: Large Scale Convex Optimization, Metric Constrained Optimization, Metric Learning, Correlation Clustering
Researcher Affiliation	Academia	Rishi Sonthalia EMAIL Department of Mathematics University of Michigan Ann Arbor, Michigan, 48104 Anna C. Gilbert EMAIL Departments of Mathematics and Statistics & Data Science Yale University New Haven, Connecticut, 06510
Pseudocode	Yes	Algorithm 1 General Algorithm. 1: function Project and Forget(f convex function) ... Algorithm 2 Finding Metric Violations. 1: function Metric Violations(d) ... Algorithm 3 Project and Forget algorithms. 1: function Project(x, z, L)
Open Source Code	Yes	All implementations and experiments can be found at https://github.com/rsonthal/ Project And Forget.
Open Datasets	Yes	For dense graphs, we use four graphs from the Stanford sparse network repository. Then, following Veldt et al. (2019), we use the method from Wang et al. (2013) to convert these graphs into instances of weighted correlation clustering on the complete graph.
Dataset Splits	Yes	For each data set, we uniformly at random choose 80% of the data points to be the training set and the remaining to be the test set.
Hardware Specification	Yes	All experiments were run on a Computer with 52 GB of memory. All computations were done on a machine with 4 physical cores, each with 13 GB of RAM.
Software Dependencies	Yes	The solvers CPLEX, Mosek, OSQP, SCS, COSMO, Ipopt, Prox SDP, ECOS, and MASS all were accessed via Julia s Ju MP library (Dunning et al., 2017). SLSQP was inferfaced with using Scipy.
Experiment Setup	Yes	The hyper-parameters were set as follows: γ = 1, u = 1, l = 10.