×
Big Data

Cricket, Wickets and Big Data

Alibaba Cloud Academy has just launched a contest to analyze results from cricket games with our big data tools to increase awareness around cloud intelligence for sports.

How Does the Recommendation System Work on Tmall?

Alibaba algorithm engineer explains how the Tmall homepage recommendation system works, discussing everything from knowledge graphs to all kinds of neural networks.

Diving into Big Data with DataWorks

In this article, you will learn how to use DataWorks together with MaxCompute for a data processing and analytics workflow.

10 Years of Cloud Intelligence: Ground-Breaking Big Data Performance to Feed Advances in AI

Alibaba Cloud's world-renowned MaxCompute big data platform has supported countless industries, processing more than 600PB of data every day.

Diving into Big Data with DataWorks (Continued)

In this article, we will take a deeper dive into all the many features that Alibaba Cloud's DataWorks has to offer.

Diving into Big Data: Hadoop User Experience (Continued)

In this article, we continue with HUE, or Hadoop User Experience, which is an open-source web interface, which can make many operations more simpler and easy to complete.

Diving into Big Data: Hadoop User Experience

In this article, we explore HUE, or Hadoop User Experience, which is an open-source web interface, which can make many operations more simple and easy to complete.

Diving into Big Data: EMR Cluster Management

This article explores the various ways that you can manage EMR clusters in Alibaba E-MapReduce.

Diving into Big Data: Visual Stories through Zeppelin

This article explores different ways in which you can present and visualize data through the Zeppelin interface.

Use Apache Arrow to Assist PySpark in Data Processing

This article looks at Apache Arrow and its usage in Spark and how you can use Apache Arrow to assist PySpark in data processing operations.

Use Relational Cache to Accelerate EMR Spark in Data Analysis

This article looks into what cache and relational cache is and how you can use it to accelerate EMR spark in data analysis operations.

Use EMR Spark Relational Cache to Synchronize Data Across Clusters

This article looks at EMR Spark Relational Cache, how it can be useful in a number of scenarios, and how use it to synchronize Data Across two clusters.

Install a Multi-Node Hadoop Cluster on Alibaba Cloud

This tutorial shows how you can set up a multi-node Hadoop cluster on Alibaba Cloud ECS instances with Ubuntu 18.04 installed.

Setup a Single-Node Hadoop Cluster Using Docker

This article shows you how to set up Docker to be used to launch a single-node Hadoop cluster inside a Docker container on Alibaba Cloud.

Beating Lung, Liver, and Cardiovascular Diseases with AI

This article discusses how artificial intelligence can be applied to the field of medical imaging to improve the efficacy of detection and treatment.

MaxCompute One of World's Leading Cloud-Based Data Warehouse

Forrester names Alibaba Cloud MaxCompute as one of the world's leading cloud-based data warehouse in the "Cloud Data Warehouse, Q1 2018" report.

Finding Public Data for Your Machine Learning Pipelines

This article discusses how and where you can find public data to use in machine learning pipelines that you can then use in a variety of applications.

Fault Tolerance with Application High Availability or Batch Compute

We will talk about two seemingly opposing ideas – high availability and batch computing – can be integrated into a single solution using Alibaba Cloud's services.

Creating Custom Environments for Batch Services

In this article, we will not explore how to create jobs rather we will take a look at how we can customize the underlying infrastructure as needed or required by our software packages.

MaxCompute Wins Science and Technology Award of Zhejiang Province

Alibaba Cloud MaxCompute has recently been awarded first prize of the Science and Technology Progress Award of Zhejiang Province for its contributions in the big data field.