|
|
Thursday, August 23, 2012
11:00 AM - 11:30 AM
Level: | Technical - Intermediate
|
With the arrival of a new armada of NoSQL databases, chances are constantly increasing that you will be needing to integrate data from one of them with data from other sources.
In this talk, I'll be going over crazy data integration between various relational and NoSQL databases. You will learn how to use a visual interface to accomplish things like: - Loading Cassandra, MongoDB, HBase and Hadoop from text files and relational sources;
- Reading data from Cassandra, MondoDB, HBase and Hadoop, and post-processing the data to be able to sort and filter it, even when the NoSQL source doesn't support those functions;
- Interactive reporting directly against data in Cassandra, MongoDB, HBase and Hadoop without staging the data;
- Merging data from NoSQL, Hadoop and relational sources, and loading it into a data warehouse/mart
I'll also be giving a short introduction to Pentaho Kettle, the open source graphical programming and design tool involved, as well as a short overview of the NoSQL landscape so that you can get an idea of what's going on in the space.
Ian Fyfe is responsible for driving adoption of Pentaho's business analytics technologies, focusing on Pentaho's customer base and community to ensure their needs are being met and exceeded, and providing input on high-level product strategy and roadmap development. Ian brings extensive experience in the business analytics and data warehouse industry including Jaspersoft, PeopleSoft, Epiphany, Informix, and Business Objects.
|
|
|