Back to Blog

Hadoop 3.0 and the Decoupling of Hadoop Compute from Storage

The traditional Hadoop architecture was founded upon the belief that the only way to get good performance with large-scale distributed data processing was to bring the compute to the data. And in the early part of this century, that was true. The network infrastructure in the typical enterprise data center of that time was not […] Read More

HDFS Upgrades Are Painful. But They Don’t Have to Be.

It’s hard enough to gather all the data that an enterprise needs for a Hadoop deployment; it shouldn’t be hard to manage it as well. But if you follow the traditional Hadoop “best practices”, it is. In particular, upgrades to the Hadoop Distributed File System (HDFS) are excruciatingly painful. By way of background, each version […] Read More