Soon, we'll see 'prepacked' applications that incorporate the distributed processing, machine learning, and analytics of today's overhyped, custom-made solutions.
Spark is the hottest project in big data -- but Databricks, the company behind it, needs to ensure its implementation has a plausible path to maturity.
Andrew updates his cheat sheet for developers navigating the ever-expanding Hadoop ecosystem. Storm and Spark still top the list, but don't miss new additions like Phoenix, Kafka, and Falcon.
Hadoop is a wonderful creation, but it's evolving quickly and it can exhibit flaws. Andrew Oliver poses a dozen downers to consider before adopting Hadoop.
The pressure is on to harness machine learning for more responsive enterprise apps. So find the open source tool that works best for your programming style and architecture.
Competitors band together to contribute to Apache Ambari, a project whose popularity is on the rise as more organizations look for ways to extend the Hadoop management framework.
The idea of data lakes has been fermenting, and now real companies are using them for real analysis. Here's why you might want one -- and how to create it.
Leading column-family NoSQL database gets an enterprise-grade makeover, with blazing in-memory speed, Hadoop integration, and granular security controls.