reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Distribution Learning via Neural Differential Equations: A Nonparametric Statistical Perspective

Authors: Youssef Marzouk, Zhi (Robert) Ren, Sven Wang, Jakob Zech

JMLR 2024 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	This work establishes the ﬁrst general nonparametric statistical convergence analysis for distribution learning via ODE models trained through likelihood maximization. We ﬁrst prove a convergence theorem applicable to arbitrary velocity ﬁeld classes F satisfying certain simple boundary constraints. This general result captures the trade-oﬀbetween the approximation error and complexity of the ODE model. We show that the latter can be quantiﬁed via the C1-metric entropy of the class F. We then apply this general framework to the setting of Ck-smooth target densities, and establish nearly minimax-optimal convergence rates for two relevant velocity ﬁeld classes F: Ck functions and neural networks.
Researcher Affiliation	Academia	Youssef Marzouk EMAIL Institute for Data, Systems, and Society Massachusetts Institute of Technology Cambridge, MA 02139, USA; Zhi Ren EMAIL Department of Mathematics Massachusetts Institute of Technology Cambridge, MA 02139, USA; Sven Wang EMAIL Institute for Data, Systems, and Society Massachusetts Institute of Technology Cambridge, MA 02139, USA; Jakob Zech EMAIL Interdisciplinary Center for Scientiﬁc Computing Heidelberg University Heidelberg 69120, Germany
Pseudocode	No	The paper does not contain any clearly labeled pseudocode or algorithm blocks. It is a theoretical paper focused on mathematical proofs and convergence rates.
Open Source Code	No	The paper does not explicitly state that source code for the described methodology is being released or provide a link to a code repository. It mentions a license for the paper itself, not for code.
Open Datasets	No	The paper discusses nonparametric density estimation based on a theoretical sampling model: "a ﬁnite collection of independent and identically distributed (iid) samples is given, Z1, . . . , Zn iid P0". It does not refer to any specific public or open dataset used for empirical experiments.
Dataset Splits	No	The paper is theoretical and does not describe experiments using specific datasets, therefore, no information regarding training/test/validation splits is provided.
Hardware Specification	No	The paper does not describe any experimental hardware specifications, as it is a theoretical work focused on mathematical analysis and convergence rates rather than empirical evaluation.
Software Dependencies	No	The paper does not list any specific software dependencies with version numbers, consistent with its theoretical nature.
Experiment Setup	No	The paper does not provide details on experimental setup such as hyperparameters or system-level training settings, as it is a theoretical paper and does not involve empirical experiments.