JBoss Data Virtualization: Integrating with Impala on Cloudera

Cloudera Impala is a tool to rapidly query Hadoop data in HBase or HDFS using SQL syntax.  You can use Red Hat JBoss Data Virtualization to query that same data via Impala to take advantage of its optimization. You can also combine that data with other data sources in real time.  The goal of this guide is to import data from a Cloudera Impala instance, manipulate it, and then expose that data as a data service.  This guide includes access to a repository with example scripts, creating a custom base and view model, exposing it as a data service, and finally consuming that data via REST. This is a peer article to Unlock Your Cloudera Data with Red Hat JBoss Data Virtualization.

Continue reading “JBoss Data Virtualization: Integrating with Impala on Cloudera”

Share

JBoss Data Virtualization on OpenShift: Integrating a Remote SQL Server Database

This example shows how on OpenShift to use a custom database driver to connect to an external database, through a Virtual Database (aka VDB). For this example, we will use a Microsoft SQL Server database (believe it or not, running on a Linux container), and the latest SQL Server JDBC driver.

Continue reading “JBoss Data Virtualization on OpenShift: Integrating a Remote SQL Server Database”

Share