reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Exponential Tail Local Rademacher Complexity Risk Bounds Without the Bernstein Condition

Authors: Varun Kanade, Patrick Rebeschini, Tomas Vaskevicius

JMLR 2024 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	Our main result is an exponential-tail oﬀset Rademacher complexity excess risk upper bound that yields results at least as sharp as those obtainable via the classical theory. However, our bound applies under an estimator-dependent geometric condition (the oﬀset condition ) instead of the estimator-independent (but, in general, distribution-dependent) Bernstein condition on which the classical theory relies. Our results apply to improper prediction regimes not directly covered by the classical theory, such as optimal model selection aggregation for arbitrary classes (including inﬁnite and non-convex classes), and early-stopping/iterative regularization; the Bernstein condition does not hold in both examples.
Researcher Affiliation	Academia	Varun Kanade Department of Computer Science, University of Oxford Patrick Rebeschini Department of Statistics, University of Oxford Tomas Vaˇskeviˇcius
Pseudocode	No	The paper defines concepts and procedures mathematically and descriptively, but does not include any clearly labeled 'Algorithm' or 'Pseudocode' blocks with structured, step-by-step instructions. For example, Audibert's Star Algorithm and the Midpoint Estimator are described verbally or with mathematical formulas, not as pseudocode.
Open Source Code	No	The paper does not contain any explicit statements about releasing source code, nor does it provide links to code repositories. The license information provided for the paper itself does not refer to code.
Open Datasets	No	The paper is theoretical and does not conduct experiments on specific datasets. It discusses abstract samples (e.g., 'Let Sn = (Xi, Yi)n i=1 denote an i.i.d. sample of input-output pairs (Xi, Yi) X Y distributed according to some unknown distribution P.'), but no concrete datasets with access information are mentioned.
Dataset Splits	No	The paper is theoretical and does not conduct experiments requiring dataset splits. Therefore, no information about training, test, or validation splits is provided.
Hardware Specification	No	This paper is theoretical and does not describe any experimental procedures that would require specific hardware. Consequently, no hardware specifications (e.g., GPU/CPU models, memory) are mentioned.
Software Dependencies	No	The paper is theoretical and does not describe any experimental setup that would require specific software dependencies with version numbers. Therefore, no such information is provided.
Experiment Setup	No	This paper is theoretical and does not present experimental results. As such, there is no description of an experimental setup, including hyperparameters, training configurations, or system-level settings.