Data Storage & Management

Data Storage & Management news, information, and how-to advice

business storm 157689723
spark

hadoop

What you need to know about Hadoop right now

Andrew updates his cheat sheet for developers navigating the ever-expanding Hadoop ecosystem. Storm and Spark still top the list, but don't miss new additions like Phoenix, Kafka, and Falcon.

First look: MongoDB 3.0, for mature audiences

The new MongoDB features document-level locking, better write performance, big memory support, and more. At last, MongoDB is all grown up.

google cloud

Google hitches cloud data analysis to Java SDK

Google Cloud Dataflow is based on FlumeJava but can be extended to other languages and environments.

Hands on: Build a Storm analytics solution

Storm lets you create real-time analytics for every conceivable need. Here's a tasty example using Twitter data and source code hosted on GitHub.

Comparing JVM libraries for MongoDB

Get a quick look at how four leading Java-based libraries for MongoDB handle a common REST-services use case.

big data

MongoDB gets its first native analytics tool

A new open source analytics tool, SlamData adds extensions to SQL that enable analysts to query MongoDB directly, without conversion to an RDBMS.

11 open source tools for making the most of machine learning

11 open source tools to make the most of machine learning

Tap the predictive power of machine learning with these diverse, easy-to-implement libraries and frameworks

storm or spark

Storm or Spark: Choose your real-time weapon

Real-time business intelligence is going mainstream, thanks in part to the Storm and Spark open source projects. Here's how to choose between them.

new tech forum

Beyond Storm for streaming data applications

Why a distributed, in-memory, SQL-relational database beats Apache Storm for apps that make real-time decisions on streaming data.

Splice Machine 1.0 offers speedy, scalable SQL on Hadoop

Splice Machine update offers cross-integration with Hadoop apps and supports migration paths from other databases.

In-memory computing brings real-time intelligence to operational systems

In-memory data grids enable instant responses to financial transactions, shopping cart contents, monitoring streams, and other operational data.

Big data sparks interest in statistical programming languages

Statistical languages offer a great fit for big data, but due to their specific use cases, they're unlikely to catch to up to generalized stalwarts like Java and C.

hadoop thinkstock

Hadoop's growth opens up demand for data migration tools

As more companies adopt Hadoop, they need help getting their data onto the platform -- and a new field is born.

elephant thinkstock

12 things I hate about Hadoop

Hadoop is a wonderful creation, but it's evolving quickly and it can exhibit flaws. Andrew Oliver poses a dozen downers to consider before adopting Hadoop.

Why Redis beats Memcached for caching

Redis is the newer and more flexible datastore overall, but Memcached wins for efficiency in specific infrastructure and programming scenarios.

Bossie Awards 2014: The best open source application development tools

InfoWorld's editors and contributors pick the essential tools for software developers in 2014. Find out which languages, frameworks, and libraries deserve top place in your open source toolbox.

Couchbase readies next-generation NoSQL storage engine

Couchbase 3.0 promises faster performance, plus sneak peeks at the new ForestDB database engine and N1QL query language.

hadoop thinkstock

Twitter's Hadoop project gets Apache's blessing

Storm, an open source, real-time computation framework adopted by Twitter, picks up the Apache Foundation's full backing and support.

Load More