JBoss Data Virtualization: Integrating with Impala on Cloudera

Cloudera Impala is a tool to rapidly query Hadoop data in HBase or HDFS using SQL syntax.  You can use Red Hat JBoss Data Virtualization to query that same data via Impala to take advantage of its optimization. You can also combine that data with other data sources in real time.  The goal of this guide is to import data from a Cloudera Impala instance, manipulate it, and then expose that data as a data service.  This guide includes access to a repository with example scripts, creating a custom base and view model, exposing it as a data service, and finally consuming that data via REST. This is a peer article to Unlock Your Cloudera Data with Red Hat JBoss Data Virtualization.

Continue reading “JBoss Data Virtualization: Integrating with Impala on Cloudera”

Share

Unlock Your Cloudera Data with Red Hat JBoss Data Virtualization

After Unlock your Hadoop data with Hortonworks and Red Hat JBoss Data Virtualization episode, let’s continue the journey with another “Apache Hadoop” episode of the series: “Unlock your [….] data with Red Hat JBoss Data Virtualization.” Through this blog series, we will look at how to connect Red Hat JBoss Data Virtualization (JDV) to different and heterogeneous data sources.

Continue reading “Unlock Your Cloudera Data with Red Hat JBoss Data Virtualization”

Share