reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Constrained Generative Modeling with Manually Bridged Diffusion Models

Authors: Saeid Naderiparizi, Xiaoxuan Liang, Berend Zwartsenberg, Frank Wood

AAAI 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	5 Experiments We demonstrate MBM on a simple 2D synthetic dataset and a traffic scenario generation experiment with collision and offroad avoidance. Additionally, we include an image watermarking experiment in the appendix. We release the source code implementing MBM together with the 2D synthetic and image watermarking experiments.
Researcher Affiliation	Collaboration	1Department of Computer Science, University of British Columbia, Vancouver, Canada 2Alberta Machine Intelligence Institute (Amii), Edmonton, Canada 3Inverted AI, Vancouver, Canada
Pseudocode	No	The paper describes the methodology using mathematical formulations and textual explanations but does not include any explicitly labeled pseudocode or algorithm blocks.
Open Source Code	Yes	Code and extended version github.com/plai-group/manually-bridged-models. We release the source code implementing MBM together with the 2D synthetic and image watermarking experiments.
Open Datasets	No	The paper describes the use of a "simple 2D synthetic dataset" and data from a "traffic scenario generation experiment" but does not provide any specific links, DOIs, repositories, or formal citations for public access to these datasets.
Dataset Splits	No	Our dataset consists of 1,000 samples from this data distribution (Fig. 3b). The problem in this experiment is to generate up to 25 vehicles on a bird s-eye view image of a road from one of 70 locations in the dataset. The paper mentions dataset sizes and origins but does not provide specific details on how the data was split into training, validation, or test sets.
Hardware Specification	No	This research was enabled in part by technical support and computational resources provided by the Digital Research Alliance of Canada Compute Canada (alliancecan.ca), the Advanced Research Computing at the University of British Columbia (arc.ubc.ca), and Amazon. The paper mentions general computing resources but does not specify particular hardware components such as GPU or CPU models, or memory specifications.
Software Dependencies	No	The paper does not provide specific ancillary software details, such as library names with version numbers (e.g., Python, PyTorch, CUDA versions) needed to replicate the experiment.
Experiment Setup	No	The γ functions for collision and offroad are respectively γc(t) = 1 / (10σ2(t)) and γo(t) = 1 / (100σ2(t)). It takes the DB-arch models around 250k iterations to achieve their maximum validation ELBO while the other models achieve the maximum at around 20k-30k iterations. While specific iteration counts and bridge function definitions are given, the paper lacks other explicit details on experimental setup such as learning rates, batch sizes, optimizers, or model initialization strategies.