What's new in Red Hat JBoss Data Virtualization 6.3

We are happy to announce the availability of Red Hat JBoss Data Virtualization (JDV) 6.3 GA. JDV 6.3 release focuses on three areas:

  1. In-memory technologies for Big Data processing
  2. Development and deployment productivity
  3. Big Data and Cloud data sources

The following new features and data source integration were added in support of these themes:

In-memory technologies for Big Data processing

Apache Spark – Apache Spark is an open source big data processing framework built around speed, ease of use, and sophisticated analytics that scales to large volumes of data. Apache Spark primarily achieves this speed via resilient distributed datasets (RDDs) that are stored in memory while being computed upon, thus eliminating expensive intermediate disk writes. In JDV 6.3, Spark is supported both as a data source and also as consumer of JDV. There are many benefits that JDV provides when integrated with Spark including:  providing centralized security, optimizing performance with pushdown queries to heterogeneous data sources, increased productivity with GUI modeling tools, and the ability to expose data through various standard interfaces.

JDG as materialization target for JDV – JBoss Data Grid (JDG) is designed for elastic scalability and fast access to large volumes of data. In addition to supporting JDG as data source as we did in previous releases, with JDV 6.3, we now also offer support for JDG as a materialization target.  JDG provides scalable, fast and consistent performance for externally materialized views in JDV. Note that the user experience of this integration will become even easier with more improvements coming to Teiid Designer shortly following the JDV 6.3 release.

SAP HANA – SAP HANA is a widely used in-memory, column-oriented, relational database management system. With JDV 6.3, SAP HANA can be integrated in three ways: as a materialization target, as a data source, and as a data consumer of JDV. When deployed together, these technologies complement each other to produce a highly performant, scalable and agile architecture for IOT and Big Data.

HPE Vertica – HPE Vertica Analytics Platform is a column-oriented platform designed to manage large, fast-growing large volumes of data and provide very fast query performance.  It has a hybrid in-memory approach that enables fast data loading through memory with near-real-time access to both on-disk and in-memory data.

Development and Deployment Productivity

OData V4 – OData (Open Data Protocol) is an OASIS standard that defines a definition for building and consuming RESTful APIs.  Some may consider OData to be like a SQL query language for REST.  The OData standard is backed by many companies including Microsoft, IBM, SAP and Red Hat.  With JDV 6.3, we are very excited to announce support for the most recent release of OData, version 4.  With v4, OData now has a much more compact JSON payload.  JDV 6.3supports exposing virtual databases as OData endpoints out of box with no code required. Note that in addition to OData v4, JDV has supported OData v2 for several releases and will continue to do so with JDV 6.3 for backward compatibility.

Kerberos on REST Interfaces – With JDV 6.2 we expanded on our Kerberos capabilities by adding the ability to propagate an existing Kerberos token from the client application stack all the way to the relational data sources.  With JDV 6.3 we add the ability to support Kerberos delegation to OData/REST sources, as well.

OAuth using RH-SSO – With JDV 6.3 we introduce the ability to use OAuth based security on the OData interface when using Red Hat Security (RH-SSO) as identity provider. The OAuth credentials can also be passed through to REST web service sources, when the source service also using RH-SSO

Tooling Improvements
JDV 6.3 is released with JBoss Developer Studio Integration Stack (JBDSIS) 9.0.1 and JBDS 9.1 (Eclipse Mars)

For JDV 6.3, we improved the usability of Teiid Designer in several areas including:

  • Full support for importing/exporting XML-based VDB definitions.  This supports team collaboration by allowing JDV users to share and version VDB files in their source control systems.
  • Improved framework for Preview Data features by adding support for generated query or ad-hoc user queries

VDB Builder CLI (TP) – The VDB Builder is a command line tool for creating, editing and managing Virtual Databases (VDBs) and other Teiid runtime artifacts. This tool is a result of our initiative to develop a non-Eclipse, extensible framework to support automation and team collaboration around VDB creation and maintenance.  VDB Builder will be released as a Technical Preview with JDV 6.3 and be fully supported post-release.

New Getting Started Materials
A series of new Getting Started videos with automated installation and configuration scripts that populate three heterogenous out of box data sources is now available on http://developers.redhat.com/products/datavirt/get-started.

Big Data and Cloud data sources

Amazon RedShift – Amazon RedShift is a popular hosted data warehouse product, which is part of the larger cloud computing platform Amazon Web Services. It is built on top of technology from the massive parallel processing (MPP) data warehouse ParAccel by Actian.  We offer support for the cloud hosted RedShift as a data source with JDV 6.3.

Apache Accumulo – Apache Accumulo is a sorted, distributed key/value store that provides robust, scalable data storage and retrieval that originated at the National Security Agency.Accumulo has several interesting features such as cell-based access control. Accumulo is a newly supported data source with JDV 6.3.

Geospatial support – GeoServer is a Java-based software server that allows users to view and edit geospatial data. Using open standards set forth by the Open Geospatial Consortium (OGC), GeoServer allows for great flexibility in map creation and data sharing.  With JDV 6.3, JDV can be a source for GeoServer and much of the functionality described in the OpenGIS/PostGIS specification is supported.

Apache Cassandra – Apache Cassandra is a distributed database for managing large amounts of structured data across many commodity servers, while providing highly available service and no single point of failure. It was supported as a Technical Preview with JDV 6.2 and is fully supported with JDV 6.3.

Apache HBase – Apache HBase is an open source, non-relational, distributed database modeled after Google’s BigTable and written in Java. In JDV 6.2, we released a resource adapter for Apache HBase as a Technical Preview and it is now officially certified following with JDV 6.3.

Coming Soon…

OpenShift – JDV 6.3 is in the process of being supported on OpenShift and will be available to customers this coming Fall.  The release on OpenShift will allow customers to easily expose multiple heterogenous data sources, both on-premise and in the cloud, as data services with RESTful OData API’s using a simplified Web UI.

OSIsoft PI – OSIsoft PI is a real-time data historian application with a highly efficient time-series database. Customers are using PI systems to record, analyze, and monitor real-time information such as manufacturing processes and product geneology from raw material to final product. OSIsoft PI will be certified post-JDV 6.3 GA.

Trying it out

The new version of Red Hat JBoss Data Virtualization 6.3 is available from http://developers.redhat.com/products/datavirt

 

Share