reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

What Causes the Test Error? Going Beyond Bias-Variance via ANOVA

Authors: Licong Lin, Edgar Dobriban

JMLR 2021 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We verify our results in numerical simulations and on empirical data examples. Keywords: Test Error, ANOVA, Double Descent, Ridge Regression, Random Matrix Theory... In this section, we perform several numerical experiments, to check the correctness of our theoretical results... We also show some experiments on empirical data, speciﬁcally on the superconductivity data set (Hamidieh, 2018), where we test our predictions for two-layer orthogonal nets.
Researcher Affiliation	Academia	Licong Lin EMAIL School of Mathematical Sciences Peking University 5 Yiheyuan Road, Beijing, China Edgar Dobriban EMAIL Departments of Statistics & Computer and Information Science University of Pennsylvania Philadelphia, PA, 19104-6340, USA
Pseudocode	No	The paper describes mathematical derivations and theoretical results. It does not include any explicitly labeled "Pseudocode" or "Algorithm" blocks, nor does it present structured steps in a code-like format in the main text.
Open Source Code	Yes	Code associated with the paper is available at https://github.com/licong-lin/Variance Decomposition.
Open Datasets	Yes	We use the Superconductivity Data Set (Hamidieh, 2018) retrieved from the UC Irvine Machine Learning Repository in our data analysis.
Dataset Splits	Yes	We then separate the data set into a training set containing the ﬁrst 90% of the samples, and a test set containing the rest.
Hardware Specification	No	The paper does not explicitly mention any specific hardware (like GPU or CPU models, memory, or cloud instances) used for running the experiments or numerical simulations.
Software Dependencies	No	The paper does not provide specific version numbers for any software libraries, programming languages, or tools used in the numerical simulations or empirical data analysis.
Experiment Setup	Yes	Parameters: signal strength α = 1, noise level σ = 0.3, regularization parameter λ = 0.01, parametrization level π = 0.8. See Sections 2.2, 4.2 for details. ... Parameters: α = 1, σ = 0.3, π = 0.8, n = 150, d = nδ , p = dπ . ... For simplicity, we subtract the test point label noise σ2 from the MSE formula. We randomly generate k = 400 i.i.d. tuples of random variables (xi, θi, εi, Xi, Wi), 1 i k, from their assumed distributions (we assume X and x have i.i.d. N(0, 1) entries in numerical simulations), and estimate the MSE by calculating:...