Pivotal HD 2.0 is a commercial distribution of Apache Hadoop 2.2. Along with that it brings an in-memory, SQL database to Hadoop through seamless integration, Pivotal GemFire XD – a SQL compliant, in-memory database designed for real-time analytics for Big Data applications. GemFire facilitates real-time analytics on Hadoop and enables real-time Big Data analytics but is explicitly designed for data environments with high demands for scalability and availability. Pivotal HD 2.0 also expands analytic use cases by integration with GraphLab for graphing analytics as well as enhancements to HAWQ such as support for MADlib, R, Python, Java, and Parquet.
Organizations can process business data lakes that are manageable sets of data to quickly gain value in the Big Data world. When there is a need to quickly derive insights from real-time transactions, the most recent data, can be treated as business data lakes. These data can be maintained in-memory for quick response and querying analysis. Pivotal HD makes these data immediately available for SQL analysis in-memory or in HDFS completely eliminating the need for ETL. With business data lakes being the foundation for architecture of Pivotal HD 2.0, HAWQ, and GemFire XD, it is best suited for organizations that are looking to take advantage of real-time data analytics.
GemFire XD is an ANSI-compliant SQL database with high-availability features that can run over WANs. It can also coexist with the existing databases. In another major enhancement, HAWQ SQL-on-Hadoop query engine, that is based on the Greenplum database can now apply the more than 50 in-database algorithms in the MADlib Machine Learning Library. It also supports automatic translation of R, Python, and Java-based queries and applications. It also supports GraphLab, an open source framework that contains a set of tools and algorithms for analytics that allow data scientists and analysts to gain deeper insight into the data.
Josh Klahr, Vice President, Product Management, Pivotal says “When it comes to Hadoop, other approaches in the market have left customers with a mishmash of un-integrated products and processes. Pivotal HD 2.0 is the first platform to fully integrate proven enterprise in-memory technology, Pivotal GemFire XD, with advanced services on Hadoop 2.2 that provide native support for a comprehensive data science toolset. Data driven businesses now have the capabilities they need to gain a massive head start toward developing analytics and applications for more intelligent and innovative products and services,”