Hadoop

Hadoop | News, how-tos, features, reviews, and videos

A better mousetrap: A JSON data warehouse takes on Hadoop

Sure, a NoSQL or JSON data warehouse sounds faddish, but SonarW is a better solution for many.

LinkedIn fills another SQL-on-Hadoop niche

LinkedIn's open source, home-brew OLAP project is a new way for Hadoop users (and others) to query both real-time and historical data.

Spark and Storm face new competition for real-time Hadoop processing

DataTorrent is releasing its real-time data processing engine for Hadoop and beyond as the open source Project Apex.

big data is dead

Big data is dead -- long live big data

Soon, we'll see 'prepacked' applications that incorporate the distributed processing, machine learning, and analytics of today's overhyped, custom-made solutions.

spark

Spark, big data's brightest star, needs to grow up

Spark is the hottest project in big data -- but Databricks, the company behind it, needs to ensure its implementation has a plausible path to maturity.

What you need to know about Hadoop right now

Andrew updates his cheat sheet for developers navigating the ever-expanding Hadoop ecosystem. Storm and Spark still top the list, but don't miss new additions like Phoenix, Kafka, and Falcon.

Google hitches cloud data analysis to Java SDK

Google Cloud Dataflow is based on FlumeJava but can be extended to other languages and environments.

Hands on: Build a Storm analytics solution

Storm lets you create real-time analytics for every conceivable need. Here's a tasty example using Twitter data and source code hosted on GitHub.

MongoDB gets its first native analytics tool

A new open source analytics tool, SlamData adds extensions to SQL that enable analysts to query MongoDB directly, without conversion to an RDBMS.

11 open source tools for making the most of machine learning

11 open source tools to make the most of machine learning

Tap the predictive power of machine learning with these diverse, easy-to-implement libraries and frameworks

Splice Machine 1.0 offers speedy, scalable SQL on Hadoop

Splice Machine update offers cross-integration with Hadoop apps and supports migration paths from other databases.

Hadoop's growth opens up demand for data migration tools

As more companies adopt Hadoop, they need help getting their data onto the platform -- and a new field is born.

Big data analytics hand touchscreen user man

8 big trends in big data analytics

Big data technologies and practices are moving quickly. Here's what you need to know to stay ahead of the game.

An elephant holds an umbrella for a dog as they sit on a bench in the rain.

12 things I hate about Hadoop

Hadoop is a wonderful creation, but it's evolving quickly and it can exhibit flaws. Andrew Oliver poses a dozen downers to consider before adopting Hadoop.

Twitter's Hadoop project gets Apache's blessing

Storm, an open source, real-time computation framework adopted by Twitter, picks up the Apache Foundation's full backing and support.

Hadoop meets Google Docs: Analytics made easy

Adatao's analytics suite puts a Web-friendly front end on Hadoop data and eases access to self-serve reporting.

Tip

5 ways to add machine learning to Java, JavaScript, and more

The pressure is on to harness machine learning for more responsive enterprise apps. So find the open source tool that works best for your programming style and architecture.

Team of rivals: Hortonworks, Pivotal join up for Hadoop project

Competitors band together to contribute to Apache Ambari, a project whose popularity is on the rise as more organizations look for ways to extend the Hadoop management framework.

big data numbers

How to create a data lake for fun and profit

The idea of data lakes has been fermenting, and now real companies are using them for real analysis. Here's why you might want one -- and how to create it.

DataStax Enterprise 4.5 turbocharges speed and security

Leading column-family NoSQL database gets an enterprise-grade makeover, with blazing in-memory speed, Hadoop integration, and granular security controls.

Load More