On demand access to Big Data through Semantic Technologies

Dealing with Big Data involves a number of different challenges, including increasing volume (amount of data), velocity (speed of data), and variety (range of data types, sources).

Most Big Data solutions today focus on volume, in particular supporting vertical scalability. Yet the Big Data problem is not fully solved by vertical scale technologies alone.

A huge problem is that of horizontal scale. Consider the wealth of data that is published in open data initiatives:

We are faced with a massive number of data sources, with a high degree of variety and heterogeneity in coverage, data models, and structure. Solving these problems and enabling users to tap into this wealth of data for on demand analytics bears enormous potentials and economic opportunities.

In this talk we present solutions that enable on demand access to heterogeneous, distributed Big Data, in particular applying semantic technologies and the Linked Data paradigm. We demonstrate the use of these technologies in the Information Workbench, a platform for self-service analytics. Following a simple self-service process, the platform supports end users in

the discovery of relevant data sources, tapping into the Linked Open Data cloud and other open data sources,
the automated integration and interlinking of sources, and
on demand and interactive exploration and analysis of data.

We will also present concrete examples and customer case studies, and lessons learned from applying Linked Data in a variety of domains, including the Life Sciences and Publishing & Media.

providing a semantic end-to-end connection between users and data sources
enabling users to rapidly formulate intuitive queries using familiar vocabularies and conceptualisations
seamlessly integrating data spread across multiple distributed data sources

Peter Haase is working as a lead architect at fluid Operations, where he is leading the research and development activities at the interface of semantic technologies and cloud computing. Previously, Peter was at the Institute of Applied Informatics and Formal Description Methods ([AIFB) at the University of Karlsruhe, where he obtained his PhD in 2006. Before joining the AIFB, he worked in the Silicon Valley Labs of IBM in the development of DB2 until 2003. His research interests include ontology management and evolution, decentralized information systems and Semantic Web. At the AIFB, he previously worked in the EU IST project SWAP (Semantic Web and Peer-to-Peer) and SEKT (Semantically Enabled Knowledge Technologies) and was working as a project leader for the EU IST project NeOn (Lifecycle Support for Networked Ontologies).