Tutorial on RDF Stream Processing
The tutorial provides a comprehensive view of the RDF-Stream Processing (RSP) research area. It consists of four parts. The first one introduces the RSP basic concepts: RDF streams to represent temporally-ordered sequence of data items; continuous SPARQL extensions to query RDF streams, and RSP engines to execute continuous query answering over RDF streams. The second part presents the available RSP engine implementations. It starts with an overview on the existing RSP engines, highlighting similarities and differences among them. Next, two existing implementations are analysed in depth: C-SPARQL and SPARQLstream. The third part is a hands-on session where the attendees learn how to (1) use the three presented RSP engines presented above and (2) let the systems interact among them. Finally, the fourth part of the tutorial provides an overview on RSP-related topics: RSP engine benchmarking, stream reasoning and real-world deployments. The tutorial closes with a discussion on the open challenges and the research problems of this research field.
Nowadays, more and more dynamic information is becoming available to decision makers in the form of continuous data streams. These data streams occur in a variety of modern applications, such as network monitoring, traffic engineering, sensor networks, RFID tags, micro posts, telecom records, Web logs, click streams, etc. Processing these continuous flows of information and reasoning taking into account ontological knowledge is certainly one of the key challenges for semantics in the future Internet. The online and continuous query answering over highly dynamic and heterogeneous data has received attention only
very recently. The combination of query answering techniques with data streams gives rise to RDF-Stream Processing, a high impact research area that has already produced results that are relevant for both the semantic and data processing communities. This tutorial aims at introducing different existing approaches for querying over RDF data streams, and providing the audience with an overview of techniques and tools that can be used for this purpose. The contents of this tutorial are relevant for ESWC as it focuses on the query answering in the context of streaming data that is ubiquitous in a large number of applications on the Web. Attendees will learn the main concepts of RSP (e.g., what is a window and how it works), and they will learn how to write and execute continuous queries over RDF streams.
The learning outcomes are:
- an overview on the basic concepts in RDF stream processing: extensions to RDF and SPARQL to model data streams and query them in a continuous fashion;
- an overview (and a comparison ) of the most relevant available implementations, with a particular focus on C-SPARQL and SPARQLstream (that will be used during the hands-on sessions).
The tutorial targets researchers with a basic knowledge of Semantic Web (i.e., RDF, query answering system, SPARQL). In the hands-on session we will use graphical user interfaces, to allow everyone (even who does not have Jave or an IDE installed) to try the systems.
When and Where
The tutorial is scheduled for May 25th, 2014 in conjunction with ESWC 2014.
14.30 – 14.45 Introduction to RDF-stream processing (15m)
14.45 – 16.00 RDF stream models, RDF stream processing models and implementations (90 min) [slide]
16.00 – 16.30 Coffee break
16.30 – 17.45 Hands-on Session (90m) [c-sparql][c-sparql2][sparqlstream1][sparqlstream2]
17.45 – 18.00 (Open research problems and) wrap-up (15m)
Jean-Paul Calbimonte, École Polytechnique Fédérale de Lausanne
Daniele Dell’Aglio, Dipartimento di Elettronica, Informatica e Bioingegneria, Politecnico di Milano
Marco Balduini, Dipartimento di Elettronica, Informatica e Bioingegneria, Politecnico di Milano