reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Seeded Graph Matching for Correlated Erdos-Renyi Graphs

Authors: Vince Lyzinski, Donniell E. Fishkind, Carey E. Priebe

JMLR 2014 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We conclude this paper with simulations and a real-data example from human connectomics. These simulations and experiments illuminate the relationship between seeded graph matching via Frank-Wolfe and restricted-focus graph matching via the Hungarian algorithm. We demonstrate that Frank-Wolfe methodology is often superior to restrictedfocus graph matching, an unsurprising result as the Frank-Wolfe methodology merges restricted-focus graph matching with seedless Frank-Wolfe methodology.
Researcher Affiliation	Academia	Vince Lyzinski EMAIL Human Language Technology Center of Excellence Johns Hopkins University Baltimore, MD, 21218, USA Donniell E. Fishkind EMAIL Carey E. Priebe EMAIL Department of Applied Mathematics and Statistics Johns Hopkins University Baltimore, MD, 21218, USA
Pseudocode	No	The paper describes the algorithms (Frank-Wolfe methodology, SGM algorithm, RGM) in prose, detailing the mathematical formulations and steps involved. However, it does not present any formal pseudocode blocks or algorithms labeled as such.
Open Source Code	No	The paper does not explicitly state that source code for the described methodology is being released, nor does it provide any direct links to a code repository. It mentions data availability, but not code.
Open Datasets	Yes	Our data set consists of 45 graphs, each on 70 vertices, these graphs constructed respectively from diffusion tensor (DT) MRI scans of 45 distinct healthy patients. We have 21 scans from the Kennedy Krieger Institute (KKI), with raw data available at http://www.nitrc.org/projects/multimodal/, and 24 scans from the Nathan Kline Institute (NKI), with a description of the raw data available at http://fcon_1000.projects.nitrc.org/indi/pro/e NKI_RS_TRT/Front Page.html. All graphs can be found at http://openconnecto.me/data/public/.MR/ MIGRAINE/.
Dataset Splits	No	For the real-data experiment, the paper describes matching pairs of graphs from two different institutes (NKI and KKI) and performing Monte Carlo replicates for different seed levels, but it does not specify traditional training/validation/test splits. For simulations, it states running '100 simulations' for each parameter combination, which describes experimental runs rather than a dataset split strategy.
Hardware Specification	No	The paper does not provide specific details about the hardware (e.g., CPU, GPU models, memory) used to run the simulations or real-data experiments. It only mentions the source of the data (MRI scans) and the general processing pipeline.
Software Dependencies	No	The paper discusses various algorithms and methodologies like Frank-Wolfe, Hungarian algorithm, SGM, and FAQ. However, it does not specify any software names or versions for these, or any other ancillary software components used for implementation or analysis.
Experiment Setup	Yes	We explore the above further in Figure 1. There we compare the performance of SGM against solving RGM for correlated Erd os-R enyi graphs with n = 300 vertices, p = 0.5, seeding levels ranging from s = 0 to 275, and correlation ranging from ϱ = 0.1 to 1. For each value of ϱ and s, we ran 100 simulations and plotted the fraction of nonseeded vertices correctly matched across the graphs, with corresponding error bars of 2 s.e. Moreover, if 100 \|V (G1)\| and G1 is selected with a discrete-uniform distribution (i.e., all possible graphs on V (G1) are equally likely) and G2 is an isomorphic copy of G1 with V (G2) being a discrete-uniform random permutation of V (G1), then the probability that FAQ (with, say, 20 Frank-Wolfe iterations allowed) yields the correct isomorphism is empirically observed to be very nearly 1.