reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

On the Satisfiability Problem for SPARQL Patterns

Authors: Xiaowang Zhang, Jan Van den Bussche, François Picalausa

JAIR 2016 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	6. Experimental Evaluation We want to evaluate experimentally the positive results presented so far:...Our experiments follow up on those reported earlier by Picalausa and Vansummeren (2011). As test datasets of real-life SPARQL queries, we use logs of the SPARQL endpoint for DBpedia. This data source contains the query dumps from the year 2012, divided into 14 logﬁles. Out of these we chose the three logs 20120913, 20120929 and 20121031 to obtain a span of roughly three months; we then took a sample of 100 000 queries from each of them.
Researcher Affiliation	Academia	Xiaowang Zhang EMAIL School of Computer Science and Technology, Tianjin University, China Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin, China Jan Van den Bussche EMAIL Hasselt University, Belgium Fran cois Picalausa EMAIL
Pseudocode	No	The paper describes methods verbally and formally using mathematical notation, but it does not contain any structured pseudocode or algorithm blocks.
Open Source Code	No	The paper mentions implementation details and experiments but does not provide any explicit statement about making the source code available or a link to a repository.
Open Datasets	Yes	As test datasets of real-life SPARQL queries, we use logs of the SPARQL endpoint for DBpedia.5 This data source contains the query dumps from the year 2012, divided into 14 logﬁles. Out of these we chose the three logs 20120913, 20120929 and 20121031 to obtain a span of roughly three months; we then took a sample of 100 000 queries from each of them. 5. ftp://download.openlinksw.com/support/dbpedia/
Dataset Splits	No	The paper describes sampling of queries from log files for experimental evaluation (e.g., 'sample of 100 000 queries'), but it does not specify training, test, or validation dataset splits typically used for model evaluation.
Hardware Specification	Yes	The implementation of the tests was done in Java 7 under Windows 7, on an Intel Core 2 Duo SU94000 processor (1.40GHz, 800MHz, 3MB) with 3GB of memory (SDRAM DDR3 at 1067MHz).
Software Dependencies	Yes	The implementation of the tests was done in Java 7 under Windows 7, on an Intel Core 2 Duo SU94000 processor (1.40GHz, 800MHz, 3MB) with 3GB of memory (SDRAM DDR3 at 1067MHz).
Experiment Setup	Yes	Our tests measure the time needed to perform the analyses of SPARQL queries presented above. The timings are averaged over all queries in a log, and each experiment is repeated ﬁve times to smooth out accidental quirks of the operating system. Although we give absolute timings, the main emphasis is on the percentage of the time needed to analyse a query, with respect to the time needed simply to read and parse that query.