Data Analytics

Analytics | News, how-tos, features, reviews, and videos

sea lion

cloud saas

AWS piles on the machine learning services

Hosted TensorFlow, pay-as-you-go inference serving, reinforcement learning, and automatic data labeling come to the Amazon cloud

agile man floating slip mishap mistake by gladson xavier

Machine learning lessons: 5 companies share their mistakes

Machine learning can also go terribly wrong, making you regret that enthusiastic rush to adopt

sparkler / firework / sparks / celebration / hands

Tutorial: Spark application architecture and clusters

Learn how Spark components work together and how Spark applications run on standalone and YARN clusters

abstract data analytics

How to use Knime for data science

Free, open-source Knime allows you to visually assemble data processing “nodes” into machine learning, deep learning, and other analytics workflows

brain-shaped thought bubble showing flow of alphabetic characters

7 cloud services to ease machine learning

Build, train, tune, and deploy machine learning and deep learning models in these end-to-end machine learning clouds

big data messaging system / information architecture / mosaic infrastructure

Built for realtime: Big data messaging with Apache Kafka, Part 1

Apache Kafka scales horizontally and offers much higher throughput than some traditional messaging systems. Get started with installation, then build your first Kafka messaging system

one yellow arrow moving opposite a stream of white arrows

Real-time data processing with data streaming: new tools for a new era

Real-time data streaming is still early in its adoption, but over the next few years organizations with successful rollouts will gain a competitive advantage

business intelligence data visualization tools analytics

Machine learning: How to create a recommendation engine

In this excerpt from the book “Pragmatic AI,” learn how to code recommendation engines based on machine learning in AWS, Azure, and Google Cloud

shortcut through a maze

Why there are no shortcuts to machine learning

As long as companies understand that good data science takes time in an enterprise, and give these people room to learn and grow, they won’t need shortcuts

jw calculate

Julia tutorial: Get started with the Julia language

Want the convenience of a dynamic language and the performance of a compiled statically typed language? Try Julia

big data elephant analytics risk predictions vulnerable

3 big data platforms look beyond Hadoop

Learn how the Cloudera, Hortonworks, and MapR data platforms are evolving to meet the demands for real-time analytics and machine learning

2 hadoop and spark

What’s new in Apache Spark? Low-latency streaming and Kubernetes

Continuous processing and native Kubernetes support in Apache Spark 2.3 spell the end for micro-batching and Hadoop

timeline database

How to choose the right NoSQL database

NoSQL databases vary in architecture and function, so you need to pick the type that is best for the desired task

toy rocket ship

Cython tutorial: How to speed up Python

How to use Cython and its Python-to-C compiler to give your Python applications a rocket boost

blockchain network machine learning neural network

TensorFlow review: The best deep learning library gets better

At version r1.5, Google's open source machine learning and neural network library is more capable, more mature, and easier to learn and use

pixelated clouds reflecting on building windows

AWS cloud services guide: The right tools for the job

Moving to the cloud makes more sense than ever, if you know why you're doing it and how to make the most of your platform of choice. Find out the most common reasons for cloud migration, and which AWS components you'll need to succeed...

overflowing trash can with balled up paper

No, you shouldn’t keep all that data forever

Most of your old data is useless trash. So throw it away, rather than spend all the time and money hoping AI will figure something out about it

confetti 136304738

What’s new in TensorFlow machine learning

Google's TensorFlow 1.4 machine learning library adds the contributed Dataset API for working with data sources, but watch out for breakage caused by the update

Load More