reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Learning Gaussian DAG Models without Condition Number Bounds

Authors: Constantinos Costis Daskalakis, Anthimos Vardis Kandiros, Rui Yao

ICML 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We complement our theoretical findings with simulations on synthetic datasets that confirm our predictions. ... To validate our findings, we run Algorithm 1, Algorithm 2, and the algorithm in (Gao et al., 2022) on synthetic datasets generated by DAGs) and compare their performance.
Researcher Affiliation	Academia	1Department of EECS, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States. Correspondence to: Constantinos Daskalakis <EMAIL>, Vardis Kandiros <EMAIL>, Rui Yao <EMAIL>.
Pseudocode	Yes	Algorithm 1 Estimating the DAG Information Theoretically ... Algorithm 2 Efficient algorithm for recovering the topology ... Algorithm 3 Efficient algorithm for recovering the topology
Open Source Code	Yes	More simulations and code, including comparisons with the PC algorithm, can be found in Supplementary Materials E.2, F, and G.
Open Datasets	Yes	To validate our findings, we run Algorithm 1, Algorithm 2, and the algorithm in (Gao et al., 2022) on synthetic datasets generated by DAGs) and compare their performance. The samples are generated i.i.d. from a randomly generated DAG G, which is constructed as follows: (1) Draw a random permutation of {0, 1, . . . , n 1} as σ(0), . . . , σ(n 1). (2) For each node σ(j), choose σ(0), . . . , σ(j 1) as a parent of σ(j) i.i.d. with probability min( d j+2, 1 2). Then, truncate the parent set to at most d nodes. (3) For each edge i j, we assign bi,j i.i.d. uniformly sampled from [ bmax, bmin] [bmin, bmax] where 0 < bmin < bmax are tunable parameters.
Dataset Splits	Yes	For each combination of parameters n, d, m, the accuracy is tested by generating 45 random graphs, running all three algorithms on them, and calculating the fraction of successful recoveries of the graph. ... We set 100 trials and the figure shows how much portion of graphs are correctly identified. All the graphs are random graphs.
Hardware Specification	Yes	The experiments were run using Python with numpy and scikit-learn package on an 11th Gen Intel Core i711800H 2.30 GHz CPU with 16GB of memory.
Software Dependencies	No	The experiments were run using Python with numpy and scikit-learn package on an 11th Gen Intel Core i711800H 2.30 GHz CPU with 16GB of memory. The versions of Python, numpy, and scikit-learn are not specified.
Experiment Setup	Yes	Listing 6. Paremeter settings and function to do the experiments ... n = 5 # Choose any n you want d = 2 # Choose any d you want b_min = 0.5 b_max = 1 # Choose any b_max you want sigma = 1 gamma = b_min**2/2 # As Gao et al 2022 repeat = 45 # Choose the number of trials you want