Anytime Large-Scale Analytics of Linked Open Data

Abstract : Analytical queries are queries with numerical aggregators: computing the average number of objects per property, identifying the most frequent subjects, etc. Such queries are essential to monitor the quality and the content of the Linked Open Data (LOD) cloud. Many analytical queries cannot be executed directly on the SPARQL endpoints, because the fair use policy cuts off expensive queries. In this paper, we show how to rewrite such queries into a set of queries that each satisfy the fair use policy. We then show how to execute these queries in such a way that the result provably converges to the exact query answer. Our algorithm is an anytime algorithm, meaning that it can give intermediate approximate results at any time point. Our experiments show that the approach converges rapidly towards the exact solution, and that it can compute even complex indicators at the scale of the LOD cloud.
Document type :
Conference papers
Complete list of metadatas

Cited literature [35 references]  Display  Hide  Download

https://hal.telecom-paristech.fr/hal-02302747
Contributor : Fabian Suchanek <>
Submitted on : Tuesday, October 1, 2019 - 10:04:10 PM
Last modification on : Thursday, October 17, 2019 - 12:36:55 PM

File

iswc-2019.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02302747, version 1

Citation

Arnaud Soulet, Fabian Suchanek. Anytime Large-Scale Analytics of Linked Open Data. ISWC, 2019, Auckland, New Zealand. ⟨hal-02302747⟩

Share

Metrics

Record views

12

Files downloads

30