Data-centric Machine Learning Research (DMLR)
The Percentage of Empirical Papers Documenting Each Reproducibility Variable
| Venue | Year | Papers |
Repro. Score
Reproducibility Score based on Gundersen et al. (2025)
|
Doc. Mean
Global mean is the average score over the seven reproducibility variables for empirical research papers.
|
Doc. Median
Global median is the median score over the seven reproducibility variables for empirical research papers.
|
Dataset Doc.
Documentation mean is the average score over the Open Datasets and Dataset Splits reproducibility variables for empirical research papers.
|
Code Doc.
Documentation mean is the average score over the Open Source Code reproducibility variables for empirical research papers.
|
Other Doc.
Documentation mean is the average score over the Pseudocode, Hardware Specification, Software Dependencies, and Experiment Setup reproducibility variables for empirical research papers.
|
% Empirical
Percentage of papers that are empirical research vs theoretical research
|
% Industry
Percentage of empirical research papers with at least one author from Industry
|
Website |
|---|---|---|---|---|---|---|---|---|---|---|---|
| DMLR | 2025 | 13 | 0.76 | 4.55 | 5.0 | 1.82 | 0.82 | 1.91 | 84.62% | 18.18% | |
| DMLR | 2024 | 27 | 0.71 | 4.4 | 5.0 | 1.76 | 0.76 | 1.88 | 92.59% | 56.0% |