reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

A noise-corrected Langevin algorithm and sampling by half-denoising

Authors: Aapo Hyvarinen

TMLR 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	4 Experiments Data models We use two different scenarios. The first is Gaussian mixture model in two dimensions. ... MCMC algorithms The starting point is that we only know the noisy-data score function (i.e. the score function for noisy data), and for one single noise level. ... Results 1: GMM bias removal Basic visualization of the results is given in Figs 1 and 2, which we will not comment any further. A comprehensive quantitative comparison is given in Fig. 3.
Researcher Affiliation	Academia	Aapo Hyvärinen EMAIL Department of Computer Science University of Helsinki, Finland
Pseudocode	No	The paper describes the proposed algorithm using mathematical equations (Eq. 6, 7, 15, 16) and textual descriptions of the iteration steps, but it does not include a clearly labeled 'Pseudocode' or 'Algorithm' block with structured, code-like formatting.
Open Source Code	No	The paper does not contain any explicit statement about the release of source code for the methodology described, nor does it provide a link to a code repository.
Open Datasets	No	Data models We use two different scenarios. The first is Gaussian mixture model in two dimensions. ... The second scenario is Gaussian model in higher dimensions. The Gaussian data was white, and the dimension took the values 5,10,100. This describes synthetic data generated for the experiments and does not refer to a publicly available dataset with concrete access information.
Dataset Splits	No	In the main results (bias removal), the last 30% of the data was analyzed as samples of the distribution; it is assumed here that this 30% represents the final distribution and all the algorithms have converged. This refers to the analysis of the generated MCMC samples, not a split of an input dataset for training, validation, or testing.
Hardware Specification	No	The paper does not contain any specific details about the hardware used to run the experiments, such as GPU models, CPU types, or memory specifications.
Software Dependencies	No	The paper does not specify any software libraries, frameworks, or their version numbers used in the implementation or for conducting experiments.
Experiment Setup	Yes	Step sizes are chosen as follows. According to the theory above, the step size for the proposed method is implicitly given based on the noise level as in (14). For the baseline Langevin methods, we use that same step size if nothing is mentioned, but conduct additional simulations with another step size which is 1/4 of the above. ... Each algorithm was run for 1,000,000 steps. ... The number of kernels (components) is varied from 1 to 4; ... We consider two different noise levels in the score function, a higher one (σ2 = 0.3) and a lower one (σ2 = 0.1). ... the dimension took the values 5,10,100.