SR4LD 2015
a project with a background image
Tutorial on Stream Reasoning for Linked Data at ISWC 2015
October 11th, 2015
**Bethlehem, Pennsylvania, US**
Co-located with the 14th International Semantic Web Conference (ISWC 2015)
Abstract Motivation Program Prerequisite knowledge The team of presenters References
Abstract
The goal of the Stream Reasoning for Linked Data tutorial is twofold: to (1) introduce scalable reasoning and querying techniques to SW researchers as powerful tool to make use of linked data and large-scale ontologies, and to (2) present interesting research problems for SW that arise in reasoning with highly dynamic data streams [DCvF09,RPZ10a]. The tutorial consists of five parts. It will begin with an introduction of linked data streams, as well as reasoning using the Semantic Web standard ontology language OWL 2. The second part will introduce semantic processing of data streams explained using C-SPARQL, a continuous extension of SPARQL for querying RDF streams and RDF graphs. The third part will provide an overview of ontology-based access to data streams through query rewriting to Stream Processing Engines and using stream-to-ontology mappings. The fourth part of the tutorial is a hands-on session on tools and systems related to the previous parts. The fith part of the tutorial will present other stream reasoning techniques for RDFS and OWL2-RL. The last part will wrap up the tutorial and present an overview of the open challenges.
Motivation
Nowadays, more and more dynamic information is becoming available to decision makers in the form of continuous data streams. These data streams occur in a variety of modern applications, such as network monitoring, traffic engineering, sensor networks, RFID tags, microposts, telecom records, Web logs, click-streams, etc. Processing these continuous flows of information and reasoning taking into account ontological knowledge is certainly one of the key challenges for semantics in the future Internet. While reasoners scale up in the classical, static domain of ontological knowledge, reasoning upon rapidly changing information has received attention only very recently. The combination of reasoning techniques with data streams gives rise to Stream Reasoning, a high impact research area that has already started to produce results that are relevant for both the semantic and data processing communities. This tutorial aims at introducing different existing approaches for reasoning and querying over data streams, and providing the audience with an overview of techniques and tools that can be used for this purpose. The contents of this tutorial can be relevant for ISWC attendees as it focuses in two of the main tasks in semantic data processing, reasoning and querying, in the context of streaming data that is ubiquitous in a large number of applications on the Web.
Program
First part: Introduction to RDF Stream Processing and Reasoning - 9.00 - 10.30
Introduction to Stream Reasoning (30 min) [slides]
The first session gives an overview of the Stream Reasoning research area, covering:
- Challenges, and how existing systems (DSMS/CEP, Semantic Web) address them
- Scope of Stream Reasoning research area
- Existing Systems (quick introduction and high-level comparison)
RSP extensions for RDF and SPARQL (60 min) [slides]
This session covers:
- RDF and SPARQL extensions to manage streaming data
- Overview of RDF model extensions (single timestamped RDF, double-timestamped RDF, etc.)
- Quick recap on SPARQL and SPARQL continuous extensions (e.g. windows, S2R operators, sequencing, followed-by operator)
- Overview of existing systems w.r.t. models presented above
- Practical examples of semantic sensor network querying using SPARQLStream [CCG10,CJCA12]
Coffee Break: 10:30 - 11:00
Second part: Stream Reasoning with Answer Set Programming [slides] - 11:00 - 12:30
This session covers:
- Formal semantics of Stream Reasoning with Answer Set Programming (ASP) [GGK + 13]
- ASP solver for exploratory search dynamic problems encoded as incremental ASP rules
- Stream Reasoning with Probabilistic ASP [NM14]
- Approximate stream reasoning via Inductive Logic Programming
Lunch break 12:30 - 14:00
Third part: Stream Reasoning with IMaRS - 14.00 - 15.30
Stream Reasoning: Naive Approaches (30 min) [slides]
This session covers:
- Problem definition and challenges
- Full goal drive approaches on each snapshot
- Materialization of each snapshot in a stream
- Query rewriting based approaches
- the DReD approach for incremental maintenance of materialisations
IMaRS: Incremental Materialization for RDF Streams (30m) [slides]
This session presents IMaRS, a variation of DRed for the incremental maintenance of the window materializations. This session covers:
- Optimization techniques for incrementally maintaining materializations when changes are caused by streaming data
- Practical examples of continuous social media analysis
Fourth part: The Stream Reasoning Realm
Other Stream Reasoning approaches (30 min) [slides]
This session covers:
- Sparkwave: Continuous Schema-Enhanced Pattern Matching over RDF Data Streams [KCF12]
- DynamiTE: Parallel Materialization of Dynamic RDF Data [UMJ + 13]
- Efficient RDF stream reasoning with GPUs [LUQ14]
Coffee Break: 15:30 - 16:00
Fourth part: The Stream Reasoning Realm - 16:00 - 17:30
Other Stream Reasoning approaches (60 min) [slides]
This session covers:
- Stream reasoning and complex event processing in ETALIS [ARFS12]
- EP-SPARQL [ARFS12]
- Ontology Stream Reasoning with Truth Maintenance Systems [RPZ10a,RPZ10b]
- LARS [BDTE + 15]
- STARQL [OMN14]
Wrap-up and conclusions (30 min) [slides]
This session covers:
- Achievements of existing approaches w.r.t. Stream Reasoning Challenges
- Open problems and a revised Stream Reasoning research agenda
- Open Q/A
Prerequisite knowledge
Basic knowledge in Semantic Web may allow better following the tutorial and gaining more benefits from it.
The team of presenters
- Muhammad Intizar Ali (Insight Centre for Data Analytics)
- Jean-Paul Calbimonte (École Polytechnique Fédérale de Lausanne)
- Daniele Dell’Aglio (Politecnico di Milano)
- Emanuele Della Valle (Politecnico di Milano)
- Alessandra Mileo (Insight Centre for Data Analytics)
References
- [BBC + 10a] Davide Francesco Barbieri, Daniele Braga, Stefano Ceri, Emanuele Della Valle, Yi Huang, Volker Tresp, Achim Rettinger, Hendrik Wermser: Deductive and Inductive Stream Reasoning for Semantic Social Media Analytics. IEEE Intelligent Systems 25(6): 32-41 (2010)
- [BBC + 10b] Davide Francesco Barbieri, Daniele Braga, Stefano Ceri, Emanuele Della Valle, Michael Grossniklaus: C-SPARQL: a Continuous Query Language for RDF Data Streams. Int. J. Semantic Computing 4(1): 3-25 (2010)
- [BBC + 10c] Davide Francesco Barbieri, Daniele Braga, Stefano Ceri, Emanuele Della Valle, Michael Grossniklaus: Incremental Reasoning on Streams and Rich Background Knowledge. ESWC (1) 2010: 1-15
- [BBC + 10d] Davide Francesco Barbieri, Daniele Braga, Stefano Ceri, Emanuele Della Valle, Michael Grossniklaus: Querying RDF streams with C-SPARQL. SIGMOD Record 39(1): 20-26 (2010)
- [CCG10] Jean-Paul Calbimonte, Oscar Corcho, and Alasdair J. G. Gray. Enabling ontology-based access to streaming data sources. In Proc. of the International Semantic Web Conference ISWC 2010, pages 96–111, 2010.
- [CJCA12] Jean-Paul Calbimonte, Hoyoung Jeung, Oscar Corcho, and Karl Aberer. Enabling query technologies for the semantic sensor web. International Journal On Semantic Web and Information Systems (IJSWIS), 8(1):43–63, 2012.
- [DCvF09] Emanuele Della Valle, Stefano Ceri, Frank van Harmelen, Dieter Fensel: It’s a Streaming World! Reasoning upon Rapidly Changing Information. IEEE Intelligent Systems 24(6): 83-89 (2009)
- [PT07] Jeff Z. Pan and Edward Thomas. Approximating OWL-DL Ontologies. In Proc. of the 22nd AAAI Conference on Artificial Intelligence (AAAI-07). 1434-1439. 2007.
- Yuan Ren, Jeff Z. Pan and Yuting Zhao. Towards Scalable Reasoning on Ontology Streams via Syntactic Approximation. In Proc. of the ISWC2010 Workshop on Ontology Dynamics (IWOD2010). 2010.
- [RPZ10a] Yuan Ren, Jeff Z. Pan, and Yuting Zhao. Soundness Preserving Approximation for TBox Reasoning. In the Proc. of the 25th AAAI Conference Conference (AAAI2010), 2010.
- [VSM05] Raphael Volz, Steffen Staab, and Boris Motik. Incrementally maintaining materializations of ontologies stored in logic databases. J. Data Semantics, 2:1–34, 2005.
- [LUQ14] Chang Liu, Jacopo Urbani, Guilin Qi: Efficient RDF stream reasoning with graphics processingunits (GPUs). WWW (Companion Volume) 2014: 343-344
- [ARFS12] Darko Anicic, Sebastian Rudolph, Paul Fodor, Nenad Stojanovic: Stream reasoning and complex event processing in ETALIS. Semantic Web 3(4): 397-407 (2012)
- [KCF12] Srdjan Komazec, Davide Cerri, Dieter Fensel: Sparkwave: continuous schema-enhanced pattern matching over RDF data streams. DEBS 2012: 58-68