Data Analytics

Analytics | News, how-tos, features, reviews, and videos

jw calculate
big data elephant analytics risk predictions vulnerable

2 hadoop and spark

What’s new in Apache Spark? Low-latency streaming and Kubernetes

Continuous processing and native Kubernetes support in Apache Spark 2.3 spell the end for micro-batching and Hadoop

timeline database

How to choose the right NoSQL database

NoSQL databases vary in architecture and function, so you need to pick the type that is best for the desired task

toy rocket ship

Cython tutorial: How to speed up Python

How to use Cython and its Python-to-C compiler to give your Python applications a rocket boost

blockchain network machine learning neural network

TensorFlow review: The best deep learning library gets better

At version r1.5, Google's open source machine learning and neural network library is more capable, more mature, and easier to learn and use

pixelated clouds reflecting on building windows

AWS cloud services guide: The right tools for the job

Moving to the cloud makes more sense than ever, if you know why you're doing it and how to make the most of your platform of choice. Find out the most common reasons for cloud migration, and which AWS components you'll need to succeed...

overflowing trash can with balled up paper

No, you shouldn’t keep all that data forever

Most of your old data is useless trash. So throw it away, rather than spend all the time and money hoping AI will figure something out about it

confetti 136304738

What’s new in TensorFlow machine learning

Google's TensorFlow 1.4 machine learning library adds the contributed Dataset API for working with data sources, but watch out for breakage caused by the update

holiday lights neurons network stream
external url

What is Apache Spark? The big data analytics platform explained

Fast, flexible, and developer-friendly, Apache Spark is the leading platform for large-scale SQL, batch processing, stream processing, and machine learning

Big data analytics hand touchscreen user man
external url

What is big data? Everything you need to know

Analyzing lots of data is only part pf what makes big data different from previous data analytics. Learn what the other three aspects are

machine learning

What is machine learning? Software derived from data

Building systems that learn from data is a better way to solve complex problems, given enough meaningful data to learn from

Real-world devops failures -- and how to avoid them

How to avoid big data analytics failures

Follow these six best practices to blow past the competition, generate new revenue sources, and better serve customers

artificial intelligence / machine learning

Machine learning comes to your browser via JavaScript

A new JavaScript library runs Google's TensorFlow right in the browser with GPU acceleration—a novel way to bring machine learning to the masses

storm clouds dark

Data is eating the software that is eating the world

The data-driven machine learning algorithms that power AI will not only upend programming, but lower the barriers to AI itself

Sparks

Apache Spark 2.2 gets streaming, R language boosts

The latest additions to Apache's all-in-one in-memory processing framework simplify stream processing and flesh out support for the R language

What is deep learning really?

Easier, faster: The next steps for deep learning

Rapidly advancing software frameworks, dedicated silicon, Spark integrations, and higher level APIs aim to put deep learning within reach

Pipeline

Data in, intelligence out: Machine learning pipelines demystified

Data plus algorithms equals machine learning, but how does that all unfold? Let’s lift the lid on the way those pieces fit together, beginning to end

roses flowers bouquets market

Aggregating with Apache Spark

Get an overview of threadless, multithreaded, and distributed aggregation using the Streams API, Java threads, and MapReduce, then see for yourself what Spark's cluster computing engine brings to the equation

bangkok traffic

MIT-Stanford project uses LLVM to break big data bottlenecks

Written in Rust, Weld can provide orders-of-magnitude speedups to Spark and TensorFlow

Load More