Data-centric Machine Learning Research (DMLR)

Venue URL:

The Percentage of Empirical Papers Documenting Each Reproducibility Variable

Venue Year Papers
Repro. Score Reproducibility Score based on Gundersen et al. (2025)
Doc. Mean Global mean is the average score over the seven reproducibility variables for empirical research papers.
Doc. Median Global median is the median score over the seven reproducibility variables for empirical research papers.
Dataset Doc. Documentation mean is the average score over the Open Datasets and Dataset Splits reproducibility variables for empirical research papers.
Code Doc. Documentation mean is the average score over the Open Source Code reproducibility variables for empirical research papers.
Other Doc. Documentation mean is the average score over the Pseudocode, Hardware Specification, Software Dependencies, and Experiment Setup reproducibility variables for empirical research papers.
% Empirical Percentage of papers that are empirical research vs theoretical research
% Industry Percentage of empirical research papers with at least one author from Industry
Website
DMLR 2025 13 0.76 4.55 5.0 1.82 0.82 1.91 84.62% 18.18%
DMLR 2024 27 0.71 4.4 5.0 1.76 0.76 1.88 92.59% 56.0%