Back to Blog

Now Certified: Deploying Cloudera on Containers with BlueData

Today we’re excited to announce that Hewlett Packard Enterprise, with solutions from BlueData, has partnered with Cloudera to deliver the Enterprise Data Cloud experience running on containers.

Building upon our existing partnership and joint customer engagements — along with a similar in-depth certification process with Hortonworks (certified since 2014)—HPE’s container-based BlueData EPIC software platform is now Quality Assured Testing Suite (QATS) for both Cloudera CDH and HDP.

This deeper certification means our joint customers can supercharge their Enterprise Data Cloud and data lake deployments. Now they have greater agility and flexibility, with the ability to quickly deploy multiple versions of CDH and HDP in multi-tenant containerized environments, backed up by in-depth performance testing and validation from Cloudera. Going forward, we will extend this certification to the new unified Cloudera Data Platform (CDP) once it is released.

Our joint customers benefit from self-service elastic clusters for large-scale data analytics and machine learning—with enterprise data governance. They can leverage their existing data lake investments, while exploiting the benefits of containers and the separation of compute and storage. They have complete flexibility to deploy Cloudera running on containers either on-premises, in the public cloud, multi-cloud, or in a hybrid cloud model. Together we can make large-scale enterprise deployments easier, faster, and more cost effective.

HPE, BlueData, and Cloudera

In the past few years, enterprise priorities have matured from a focus on bringing various data sources together in a data lake to initiatives focused on extracting value from that data with advanced data analytics like machine learning using the latest data science toolkits. At the same time, hybrid and multi-cloud strategies have become the norm—and containers have become the standard as technologies like Docker and Kubernetes have become more widely adopted.

HPE is delivering on our vision to enable customers to capture, analyze, and act upon data seamlessly from edge to core to cloud. The BlueData acquisition is one of the ways HPE is delivering on that vision, using container technology to deploy large-scale distributed analytics and machine learning in hybrid deployments.

Cloudera has always embraced the power of the cloud and has long been dedicated to enterprise-ready advanced data analytics, so they are a natural partner for HPE and BlueData in this endeavor.  Furthermore, the release of the “container ready” CDH version 6.2 solidifies this relationship, enabling the delivery of the Enterprise Data Cloud consistently and seamlessly across all public cloud services, private cloud infrastructure, and hybrid cloud deployments.

When put together, our joint solution provides a secure and powerful containerization platform that enables agile application development for data-intensive workloads—allowing our customers to deploy the right workload, with access to the right data, on the right infrastructure, at the right time. The result is dramatically faster deployments down from months to minutes, with the agility and flexibility that analysts and data science teams need to innovate faster and get more value from their data.

What the QATS certification means

The Quality Assured Testing Suite (QATS) program is Cloudera’s highest certification level, with validation and thorough testing for a comprehensive suite of use cases to ensure high performance under rigorous loads.

For this testing, CDH version 6.2 was deployed unmodified on the BlueData EPIC software platform—using hardened Docker containers, with persistent storage. Then the comprehensive set of QATS certification tests were executed, to validate all features and functions for the full breadth of CDH services including HDFS, YARN, Hive, Spark, and more—with Kerberos security, Sentry, Erasure Coding, and High Availability enabled.

BlueData EPIC was validated against nearly 5300 tests across the full range of CDH services covering an all-encompassing set use cases at high performance levels under rigorous loads. Blue Data successfully passed all the core service tests along with all DTAP test cases.

As a result, BlueData has earned the Cloudera QATS certification badge for CDH running on containers. This means:

  • Cloudera now advocates and fully endorses BlueData’s use of containers to simplify and accelerate the deployment of CDH in production.
  • The certification ensures CDH 6.2 is fully supported when running in hardened Docker containers provisioned by the BlueData EPIC platform today and into the future.
  • Joint customers immediately get more value from their data lakes with a self-service analytics experience for their Enterprise Data Cloud, with compute/storage separation and deployment flexibility for on-premises, hybrid, and multi-cloud environments.

Here is a summary of the versions of HDFS (CDH and HDP) that have been through the QATS certification with BlueData:

References: