reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Module Extraction in Expressive Ontology Languages via Datalog Reasoning

Authors: Ana Armas Romero, Mark Kaminski, Bernardo Cuenca Grau, Ian Horrocks

JAIR 2016 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Our evaluation on a wide range of ontologies conﬁrms the feasibility and beneﬁts of our approach in practice.
Researcher Affiliation	Academia	Ana Armas Romero EMAIL Mark Kaminski EMAIL Bernardo Cuenca Grau EMAIL Ian Horrocks EMAIL Department of Computer Science, University of Oxford, Wolfson Building, Parks Road, Oxford, OX1 3QD, UK
Pseudocode	No	The paper describes its methodology in detail in Sections 4, 5, 6, and 7, but does not present any structured pseudocode or algorithm blocks.
Open Source Code	Yes	We have implemented a prototype system for module extraction in Java, called Pris M, which bundles RDFox as a black-box datalog reasoner (Motik et al., 2014). Pris M is available online under academic license.4 4 http://www.cs.ox.ac.uk/isg/tools/Pris M/
Open Datasets	Yes	We have evaluated our system on a set of test ontologies identiﬁed in the work of Glimm, Horrocks, Motik, Stoilos, and Wang (2014) as non-trivial for reasoning. 5 The ontologies used in our experiments are available for download at https://krr-nas.cs.ox.ac.uk/2015/jair/Pris M/test Ontologies.zip.
Dataset Splits	Yes	Unlike the work of Del Vescovo et al. (2013), who deﬁned random signatures simply as random subsets of the ontology signature, we extracted such signatures using a randomised graph sampling algorithm. We ﬁrst represented the syntactic dependencies between symbols in the (normalised) ontology as a graph, and then traversed the graph in a randomised way until we visited a set number n of nodes. The symbols corresponding to the visited nodes were then taken as a random signature.7 The advantage of this approach is that it yields signatures that are semantically connected , which we believe is likely to be the case in practical applications. The number n was chosen by default as 0.1% of the total graph and then increased by up to two orders of magnitude in cases where the resulting signatures typically contained less than 15 predicates and thus were too small to provide additional information w.r.t. genuine signatures.
Hardware Specification	Yes	All experiments were performed on a server with 2 Intel Xeon E5-2670 2.60GHz processors, each of which has 8 physical cores that serve 2 virtual cores each, making a total of 32 virtual cores. In our experiments we allocated 90GB of RAM, and RDFox was always run on 16 threads.
Software Dependencies	No	We have implemented a prototype system for module extraction in Java, called Pris M, which bundles RDFox as a black-box datalog reasoner (Motik et al., 2014).
Experiment Setup	Yes	For each kind of signature and each ontology, we have considered a sample of 400 runs and averaged module sizes and module extraction times. ... RDFox was always run on 16 threads.