Operation: Stateful. Introducing BlueK8s and Kubernetes Director

The juggernaut that is Kubernetes has been underway and gaining momentum for some time now. It provides an extensible container orchestration framework for automating the deployment, scaling, and management of any containerized application. It has a rich ecosystem of plugins for handling everything from storage to security. And while it was originally designed for running […] Read More

Containerization for Big Data and Machine Learning

Today we announced the latest release of the BlueData EPIC software platform, with several exciting new innovations for the containerization of Big Data and machine learning workloads. This new ‘summer release’ for BlueData EPIC represents dozens of new features developed by our software engineering team over the past few months. In large part, this new functionality […] Read More

Hortonworks Certification: HDP on Docker Containers with BlueData

Running unmodified open source distributed computing frameworks on Docker containers has long been one of BlueData’s core value propositions. With that in mind, Hortonworks was one of BlueData’s first partners in the Apache Hadoop and Big Data ecosystem; the BlueData EPIC software platform was first certified for the Hortonworks Data Platform (HDP) back in 2014. […] Read More

Deploying Machine Learning Pipelines for AI Use Cases

We all know that Artificial Intelligence (AI) is here to stay. We experience AI everywhere and enjoy its benefits without even realizing it. From streaming video services like Netflix, which learn our viewing behaviors and patterns so we spend our valuable time watching the shows we like best; to digital assistants like Amazon’s Alexa, which […] Read More

Analytics and Machine Learning with SAS Viya on Docker Containers

As the industry leader in business analytics software, SAS brings a formidable toolset to address a wide range of use cases (including churn prediction, customer segmentation, market basket analysis, and more) – enabling enterprises to extract business value from large volumes of data. IDC research shows SAS with more than 30 percent of the market […] Read More

Hadoop 3.0 and the Decoupling of Hadoop Compute from Storage

The traditional Hadoop architecture was founded upon the belief that the only way to get good performance with large-scale distributed data processing was to bring the compute to the data. And in the early part of this century, that was true. The network infrastructure in the typical enterprise data center of that time was not […] Read More

Big Data and Container Orchestration with Kubernetes (K8s)

Here at BlueData, we’ve been leading the charge on deploying and running Big Data applications like Hadoop and Spark on containers: we first announced our support for Docker more than two years ago. At that time, there was no clear choice for container orchestration. More importantly, the existing tools for container orchestration simply didn’t meet […] Read More

Deep Learning with TensorFlow, GPUs, and Docker Containers

I work with a lot of data science teams at our enterprise customers, and in the past several months I’ve seen an increased adoption of machine learning and deep learning frameworks for a wide range of applications. As with other use cases in Big Data analytics and data science, these data science teams want to […] Read More

Deep Learning with BigDL and Apache Spark on Docker

The field of machine learning – and deep learning in particular – has made significant progress recently and use cases for deep learning are becoming more common in the enterprise. We’ve seen more of our customers adopt machine learning and deep learning frameworks for use cases like natural language processing with free-text data analysis, image […] Read More