reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Slicing Unbalanced Optimal Transport

Authors: Clément Bonet, Kimia Nadjahi, Thibault Sejourne, Kilian FATRAS, Nicolas Courty

TMLR 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We finally conduct an empirical analysis of our loss functions and methodology on both synthetic and real datasets, to illustrate their computational efficiency, relevance and applicability to real-world scenarios including geophysical data.
Researcher Affiliation	Academia	Clément Bonet EMAIL CREST, ENSAE, IP Paris, Palaiseau, France Kimia Nadjahi EMAIL CNRS, ENS, Paris, France Thibault Séjourné EMAIL LTS4, EPFL, Lausanne, Switzerland Kilian Fatras EMAIL Mila, Mc Gill University, Montreal, Canada Nicolas Courty EMAIL IRISA, Université Bretagne-Sud, Vannes, France
Pseudocode	Yes	Algorithm 1 Norm(α, β, f, g, ρ1, ρ2) Algorithm 2 SUOT Algorithm 3 USOT Algorithm 4 FWStep(f, g, r, s, γ) Algorithm 5 Sliced OTLoss(α, β, {θ}, p) Algorithm 6 Sliced OTPotentials Backprop(α, β, {θ}, p) Algorithm 7 Barycenter((αb)b, (ωb)b, ρ1, ρ2, lr)
Open Source Code	Yes	1The code is available at https://github.com/clbonet/Slicing_Unbalanced_Optimal_Transport.
Open Datasets	Yes	We consider the BBCSport dataset (Kusner et al., 2015), a standard benchmark with small documents for which OT can be used effectively, and the Goodreads dataset (Maharjan et al., 2017) on two tasks (genre and likability predictions)... We use the Climate Net dataset (Prabhat et al., 2021), and more specifically the TMQ (precipitable water) indicator. The Climate Net dataset is a human-expert-labeled curated dataset which captures tropical cyclones (TCs), among other things.
Dataset Splits	Yes	We consider the BBCSport dataset (Kusner et al., 2015)... We average over the 5 same train/test split of (Kusner et al., 2015). The movie reviews dataset (Pang et al., 2002)... We take five different random 75/25 train/test split. This dataset, proposed in (Maharjan et al., 2017)... The five train/test split are randomly drawn with 75/25 proportions.
Hardware Specification	Yes	All the benchmark methods are computed using the Python OT library (Flamary et al., 2021) on a Nvidia Tesla V100 GPU. Computations have been performed with a NVIDIA Tesla V100 GPU.
Software Dependencies	No	The paper mentions using Python OT library, PyTorch, and Torch Vision, but does not provide specific version numbers for these software components.
Experiment Setup	Yes	For sliced methods, we average over 3 computations of the loss matrix and report the standard deviation in Table 1. The number of neighbors was selected via cross validation. The results for UOT, Sinkh UOT, SUOT and USOT are reported for ρ yielding the best accuracy among a grid (see Appendix C.1 for more details), and we display an ablation of this parameter on the BBCSport dataset in Figure 2. For every results, we iterate the gradient flow for 100 iterations, and the learning rate γ is set to 10 2. We run the barycenter algorithm for 500 iterations (with K = 64 projections)...