On the Satisfiability Problem for SPARQL Patterns

Authors: Xiaowang Zhang, Jan Van den Bussche, François Picalausa

JAIR 2016 | Venue PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental 6. Experimental Evaluation We want to evaluate experimentally the positive results presented so far:...Our experiments follow up on those reported earlier by Picalausa and Vansummeren (2011). As test datasets of real-life SPARQL queries, we use logs of the SPARQL endpoint for DBpedia. This data source contains the query dumps from the year 2012, divided into 14 logfiles. Out of these we chose the three logs 20120913, 20120929 and 20121031 to obtain a span of roughly three months; we then took a sample of 100 000 queries from each of them.
Researcher Affiliation Academia Xiaowang Zhang EMAIL School of Computer Science and Technology, Tianjin University, China Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin, China Jan Van den Bussche EMAIL Hasselt University, Belgium Fran cois Picalausa EMAIL
Pseudocode No The paper describes methods verbally and formally using mathematical notation, but it does not contain any structured pseudocode or algorithm blocks.
Open Source Code No The paper mentions implementation details and experiments but does not provide any explicit statement about making the source code available or a link to a repository.
Open Datasets Yes As test datasets of real-life SPARQL queries, we use logs of the SPARQL endpoint for DBpedia.5 This data source contains the query dumps from the year 2012, divided into 14 logfiles. Out of these we chose the three logs 20120913, 20120929 and 20121031 to obtain a span of roughly three months; we then took a sample of 100 000 queries from each of them. 5. ftp://download.openlinksw.com/support/dbpedia/
Dataset Splits No The paper describes sampling of queries from log files for experimental evaluation (e.g., 'sample of 100 000 queries'), but it does not specify training, test, or validation dataset splits typically used for model evaluation.
Hardware Specification Yes The implementation of the tests was done in Java 7 under Windows 7, on an Intel Core 2 Duo SU94000 processor (1.40GHz, 800MHz, 3MB) with 3GB of memory (SDRAM DDR3 at 1067MHz).
Software Dependencies Yes The implementation of the tests was done in Java 7 under Windows 7, on an Intel Core 2 Duo SU94000 processor (1.40GHz, 800MHz, 3MB) with 3GB of memory (SDRAM DDR3 at 1067MHz).
Experiment Setup Yes Our tests measure the time needed to perform the analyses of SPARQL queries presented above. The timings are averaged over all queries in a log, and each experiment is repeated five times to smooth out accidental quirks of the operating system. Although we give absolute timings, the main emphasis is on the percentage of the time needed to analyse a query, with respect to the time needed simply to read and parse that query.