reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Weisfeiler and Leman go Machine Learning: The Story so far

Authors: Christopher Morris, Yaron Lipman, Haggai Maron, Bastian Rieck, Nils M. Kriege, Martin Grohe, Matthias Fey, Karsten Borgwardt

JMLR 2023 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	Here, we give a comprehensive overview of the algorithm’s use in a machine-learning setting, focusing on the supervised regime. We discuss the theoretical background, show how to use it for supervised graph and node representation learning, discuss recent extensions, and outline the algorithm’s connection to (permutation-)equivariant neural architectures. Moreover, we give an overview of current applications and future directions to stimulate further research.
Researcher Affiliation	Collaboration	Christopher Morris EMAIL Department of Computer Science RWTH Aachen University Aachen, Germany Yaron Lipman EMAIL Meta AI Research Department of Computer Science and Applied Mathematics Weizmann Institute of Science Rehovot, Israel Haggai Maron EMAIL NVIDIA Research Tel Aviv, Israel Bastian Rieck EMAIL AIDOS Lab, Institute of AI for Health Helmholtz Zentrum M unchen and Technical University of Munich Munich, Germany Nils M. Kriege EMAIL Faculty of Computer Science, University of Vienna, Vienna, Austria Research Network Data Science, University of Vienna, Vienna, Austria Martin Grohe EMAIL Department of Computer Science RWTH Aachen University Aachen, Germany Matthias Fey EMAIL Kumo.AI Mountain View, CA Karsten Borgwardt EMAIL Machine Learning & Computational Biology Lab Department of Biosystems Science and Engineering ETH Z urich, Basel, Switzerland
Pseudocode	No	The paper describes algorithms such as the Weisfeiler Leman algorithm and GNN architectures using mathematical equations and textual descriptions, but does not include any explicitly labeled pseudocode or algorithm blocks.
Open Source Code	No	The paper mentions existing open-source and commercial software libraries such as RDKit and Open Eye Graph Sim TK (Section 1.2.2) that implement certain functionalities. However, it does not provide any statement or link for the source code of the methodology described in this survey paper.
Open Datasets	Yes	For the comparison of machine learning methods, several standard benchmark data sets were introduced (Hu et al., 2020; Morris et al., 2020a), which contain graphs representing various objects and concepts such as small molecules, proteins, and protein-protein interactions, citation networks, as well as letters, ﬁngerprints, and cuneiform signs.
Dataset Splits	No	This paper is a survey and does not present its own experimental results. Therefore, it does not specify any training/test/validation dataset splits.
Hardware Specification	No	This paper is a survey and does not present its own experimental results. Therefore, it does not specify any hardware used for running experiments.
Software Dependencies	No	The paper is a survey and primarily discusses existing methodologies. It does not describe new software requiring specific dependencies with version numbers for its own work. While it mentions RDKit and Open Eye Graph Sim TK as tools, these are not dependencies for the paper's own methodology.
Experiment Setup	No	This paper is a survey and does not present its own experimental results. Therefore, it does not describe a specific experimental setup or hyperparameter values.