A parallel RDF stream reasoner using the S4 platform

From Master Projects
Jump to: navigation, search


has title::A parallel RDF stream reasoner using the S4 platform
status: finished
Master: project within::Knowledge Technology and Intelligent Internet Applications
Student name: student name::Jesper Hoeksema
Dates
Start start date:=2011/02/22
End end date:=2011/08/31
Supervision
Supervisor: Spyros Kotoulas
Second reader: has second reader::Peter Boncz
Poster: has poster::Media:Media:Posternaam.pdf

Signature supervisor



..................................

Abstract

1st KIM

Data streams occur more and more in modern technologies such as the internet. For example, click-streams from certain IPs, search requests or traffic information can be used to adjust advertisements, websites or cars in real-time. RDF streams and C-SPARQL aim to integrate these data streams and the Semantic Web, providing a data format and query language to operate on these streams of semantic data.

While several approaches have been made to implement query processors for continuous RDF streams, a system which performs real-time reasoning and querying over a stream of triples instead of portions of a stream inserted into an RDF respository has not yet been created. In my thesis I will explore the possibility of building such a distributed OWL2-RL reasoner and C-SPARQL query processor for RDF streams using the Yahoo S4 framework. In this presentation I will outline the specific modules that need to be created in order to build a scalable stream reasoner, after briefly describing the basic principles of the S4 framework.

2nd KIM

Data streams occur more and more in modern technologies such as the internet. For example, click-streams from certain IPs, search requests or traffic information can be used to adjust advertisements, websites or cars in real-time. RDF streams and C-SPARQL aim to integrate these data streams and the Semantic Web, providing a data format and query language to operate on these streams of semantic data.

While several approaches have been made to implement RDF stream reasoners and query processors for continuous RDF streams, a near-linear scaling distributed reasoner that performs real-time reasoning and querying over streams of semantic data has not yet been created. In my thesis I have explored the possibility of building such a distributed system, capable of performing RDFS and OWL2-RL reasoning and processing C-SPARQL queries, using the Yahoo S4 framework. In this final presentation, I will describe the created system, it's components and it's performance, after briefly describing the background and basic principles of the technology and frameworks used.