reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Multi-dimensional concept discovery (MCD): A unifying framework with completeness guarantees

Authors: Johanna Vielhaben, Stefan Bluecher, Nils Strodthoff

TMLR 2023 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We empirically demonstrate the superiority of MCD against more constrained concept deﬁnitions. We carry out our experiments on Image Net (Deng et al., 2009). First, we provide a concrete example for an MCD explanation and showcase its completeness relation introduced in Section 2.3 (beneﬁt 3). To this end, Figure 3 shows an MCD-SSC explanation of a Res Net50v2 prediction for a sample of the police van class in Image Net. We compare MCD with sparse subspace clustering (MCD-SSC), MCD with alternative clustering, and previous methods listed in Table 1 in terms of (1) faithfulness (beneﬁt 1) and (2) conciseness (beneﬁt 2) of the explanations.
Researcher Affiliation	Academia	Johanna Vielhaben EMAIL Explainable Artiﬁcial Intelligence Group Fraunhofer Heinrich-Hertz-Institute Stefan Blüecher EMAIL Machine Learning Group TU Berlin Nils Strodthoﬀ nils.strodthoﬀ@uol.de Division AI4Health Oldenburg University
Pseudocode	No	The paper describes the methodology and algorithms in prose and mathematical equations (e.g., Section 2 and Appendix A). There are no distinct sections or figures explicitly labeled as 'Pseudocode' or 'Algorithm' blocks.
Open Source Code	Yes	Code to reproduce our experiments is publicly available at https://github.com/jvielhaben/MCD-XAI.
Open Datasets	Yes	We carry out our experiments on Image Net (Deng et al., 2009). We base all our experiments on images from a diverse selection of ten Image Net classes, which roughly align with CIFAR10 classes
Dataset Splits	Yes	We carry out our experiments on Image Net (Deng et al., 2009). We base all our experiments on images from a diverse selection of ten Image Net classes, which roughly align with CIFAR10 classes. for each concept l, we sort test set samples by the maximum activation maxxy\|φβ,l x,y\| and choose the top-k samples as concept prototypes.
Hardware Specification	No	The paper mentions model architectures such as Res Net models and Swin Vision Transformer, but does not provide specific details about the CPU, GPU, or other hardware used to run the experiments.
Software Dependencies	No	The paper mentions using 'torchvision' and 'timm' for weights of Res Net and Swin-T models, but does not specify version numbers for these software components or any other libraries.
Experiment Setup	Yes	For all methods within the MCD framework, we ﬁx the number of concepts in a class-dependent way such that we reach a completeness score of η = 0.5. In all our experiments, we ﬁx the hyperparameter γ, which balances sparsity vs. robustness, to γ = 10.