EMC Corporation today announced a new distribution of Apache Hadoop, Pivotal HD. Pivotal HD features native integration of EMC’s Greenplum massively parallel processing (MPP) database with Apache Hadoop, a cost-effective and flexible open source Big Data platform.
The new EMC Greenplum-developed HAWQ technology brings 10 years of large scale data management research and development to Hadoop and delivers more than 100X performance improvements when compared to existing SQL-like services on top of Hadoop. Leveraging the feature richness and maturity of the industry leading Greenplum MPP analytical database, this innovation has resulted in the first true SQL parallel database on top of the Hadoop Distributed File System (HDFS).
Hadoop has rapidly emerged as a preferred solution for Big Data analytics applications that grapple with vast repositories of unstructured data. It is flexible, scalable, inexpensive, fault-tolerant, and enjoys rapid adoption rates and a rich ecosystem surrounded by massive investment. However, customers face high hurdles to broadly adopting Hadoop as their singular data repository due to a lack of useful interfaces and high-level tooling for Business Intelligence and datamining, components that are critical to data analytics and building a data-driven enterprise.
EMC tackles addresses these challenges with Pivotal HD, a true SQL processing for Hadoop, offering the full spectrum of the SQL interface, and by extension, the entire ecosystem of products that support SQL. Customers no longer need an army of developers to build a dashboard or run a report. Unlike competitive Hadoop distributions, Pivotal HD does this without moving data between systems or using connectors that require users to store the data twice.
Pivotal HD cuts out the complexity of using Hadoop, thus expanding the platform’s potential and productivity, and allowing customers to enjoy the benefits of the most cost-effective and flexible data processing platform ever developed.
HAWQ is the key differentiating technology in making Pivotal HD the world’s most powerful Hadoop distribution. Capabilities of note include Dynamic Pipelining, query optimiser, horizontal scaling, SQL compliant, interactive query, deep analytics, and support for common Hadoop formats.
Pivotal HD is expected to be available at the end of the first quarter of this year as a software-only or appliance-based solution, backed by EMC’s global 24x7 support infrastructure.