Cloudera Impala is a tool to rapidly query Hadoop data in HBase or HDFS using SQL syntax. You can use Red Hat JBoss Data Virtualization to query that same data via Impala to take advantage of its optimization. You can also combine that data with other data sources in real time. The goal of this guide is to import data from a Cloudera Impala instance, manipulate it, and then expose that data as a data service. This guide includes access to a repository with example scripts, creating a custom base and view model, exposing it as a data service, and finally consuming that data via REST. This is a peer article to Unlock Your Cloudera Data with Red Hat JBoss Data Virtualization.
Continue reading “JBoss Data Virtualization: Integrating with Impala on Cloudera”