reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Modular Structures and Atomic Decomposition in Ontologies

Authors: Chiara Del Vescovo, Matthew Horridge, Bijan Parsia, Uli Sattler, Thomas Schneider, Haoruo Zhao

JAIR 2020 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	In this section, we complement the conceptual and theoretical contributions from the previous sections with an empirical study. For this purpose, we use an implementation of the decomposition algorithm described in Algorithm 1 using syntactic locality-based modules ( , , and ), run it on a suitable corpus of ontologies, and evaluate the feasibility of this algorithm as well as the structure and granularity of the resulting ADs.
Researcher Affiliation	Collaboration	Chiara Del Vescovo is affiliated with British Broadcasting Corporation (industry). Matthew Horridge is affiliated with Stanford University (academia). Bijan Parsia, Uli Sattler, and Haoruo Zhao are affiliated with University of Manchester (academia). Thomas Schneider is affiliated with Universität Bremen (academia). Since there is a mix of industry and academic affiliations, the paper is classified as a collaboration.
Pseudocode	Yes	In Algorithm 1 we give the pseudocode of an algorithm for the computation of the AD that interleaves the above two points by using so-called key axioms for which to record modules, and by ensuring that, for each genuine module/each atom, exactly one key axiom is identiﬁed.
Open Source Code	No	The paper states: "We evaluate the implementation of Algorithm 1 that is part of the OWL API, namely the one available via Maven Central (maven.org) with an artifact Id of owlapi-tools." This indicates the authors used an existing open-source tool for their evaluation, rather than providing their own specific source code for the methodology described in their paper.
Open Datasets	Yes	To build our corpus, we started with the snapshot of the NCBO Bio Portal ontology repository18 by Matentzoglu and Parsia (2017), which contains 438 ontologies. [Footnote 18]: https://bioportal.bioontology.org/
Dataset Splits	No	The paper describes how the corpus was curated (removing ABox axioms, empty ontologies, and non-OWL 2 DL ontologies) and analyzes the characteristics of this corpus (size, length, OWL profiles). However, it does not mention any training, test, or validation splits, as the experiments evaluate the performance of an algorithm on a set of ontologies rather than training a machine learning model requiring such splits.
Hardware Specification	Yes	All experiments have been performed on Intel(R) Core(TM) i7-6700HQ CPU 2.60GHz RAM 8GB, running Java version 1.8 JDK with an initial heap size of 1 GB and a maximal heap size of 8 GB. Time is measured in CPU time.
Software Dependencies	Yes	We use OWL API Version 5.1.11 to compute -, -, and -AD. All experiments have been performed on Intel(R) Core(TM) i7-6700HQ CPU 2.60GHz RAM 8GB, running Java version 1.8 JDK with an initial heap size of 1 GB and a maximal heap size of 8 GB.
Experiment Setup	Yes	We set 6 hours as the timeout for computing each kind of AD, i.e., we used a total timeout of 18 hours for all three kinds of AD. If the ontology cannot be decomposed within 6 hours for at least one kind of AD, we say that this ontology could not be decomposed due to a timeout. ... Time is measured in CPU time.