×
E-MapReduce

A Comprehensive List of Big Data Processing Tools

This blog discusses the popular tools used in a big data system and shares some basic tips on building a distributed product roadmap.

Data Lake Acceleration in Data Lake Architecture

This article introduces the reasons for choosing data lake acceleration, and shares Alibaba Cloud's practical experience and technical solutions.

Build a Cloud Data Lake Using E-MapReduce

This article is based on the enterprise data lake construction solution using E-MapReduce and customer best practices shared by Ziguan.

So How Did Flink Double Its GitHub Stars in Just One Year?

Read on to see exactly what happened to Flink in 2019, in particular how Alibaba has contributed to Flink.

Architecture Evolution and Application Scenarios of Real-time Warehouses in the Cainiao Supply Chain

In this blog, we'll discuss the evolution of Cainiao's Flink implementation solution and supply chain data in terms of real-time data technology architecture.

OPPO's Use of Flink-based Real-time Data Warehouses

This article covers the evolution of the OPPO real-time data warehouse and development of Flink SQL.

Netflix: Evolving Keystone to an Open Collaborative Real-time ETL Platform

This article briefly introduces Netflix's data platform team and its key product, Keystone.

Architecture Evolution and Practices of the Xiaomi Streaming Platform

This article discusses how Xiaomi leverages Apache Flink to build its streaming platform.

Meituan-Dianping's Use of Flink-based Real-time Data Warehouse Platforms

In this article, Lu Hao of Meituan-Dianping shares the company's practices using the Flink-based real-time data warehouse platform.

Architecture and Practices of Bilibili's Real-time Platform

This article introduces the architecture and practices of the Bilibili's Saber real-time computing platform by considering the pain points of real-time computing.

Trillions of Bytes of Data Per Day! Application and Evolution of Apache Flink in Kuaishou

This article introduces the technical evolution of Apache Flink during its application in Kuaishou and Kuaishou's future plans regarding Apache Flink.

Lyft's Large-scale Flink-based Near Real-time Data Analytics Platform

This blog shares how Lyft built a large-scale near real-time data analytics platform based on Apache Flink.

Introduction to EMR DataScience

In this article, AI expert Aohai provides an overview of the DataScience node of E-MapReduce and its components.

Cara menghubungkan Tableau dengan Alibaba Cloud Max Compute

Post singkat oleh Eggy kali ini membahas mengenai cara menghubungkan Tableu dengan Alibaba Cloud Max Compute.

The Secrets Behind the Optimized SQL Performance of EMR Spark

Lin Xuewei, a technical expert, gives an overview of the latest performance and efficiency optimizations that were made to TPC-DS Perf after its third submission.

Spark-TFRecord: Toward Full Support of TFRecord in Spark

In this post, we will introduce Spark-TFRecord, a new solution to enable support for native TensorFlow data format in Spark.

Alibaba Cloud E-MapReduce Sets World Record Again on TPC-DS Benchmark

This year, EMR increased its computing speed to 2.2 times of that from last year, breaking the world record again in the big data sector.

Ideas and Methods for System Refactoring

This articles looks at some of the misunderstandings and frequently overlooked aspects of refactoring, proposing some best practices.

Reshaping the Java Language on the Cloud

Learn how Alibaba is transforming the Java language.

50 Efficient Code Samples for Java Programming

This article is a list of 50 efficient Java code samples.