Community

Blog Events Webinars Tutorials Forum

Create Account

×

Shuffle

Learning about Distributed Systems - Part 17: Shuffle

Part 17 of this series introduces several possible Shuffle methods and their adoption in MapReduce and Spark.

Alibaba Cloud_Academy July 24, 2023 4,240

Sort-Based Blocking Shuffle Implementation in Flink – Part 2

Part 2 of this 2-part series will give you insight into some core design considerations and implementation details of the sort-based blocking shuffle in Flink.

Apache Flink Community December 20, 2021 4,077

Sort-Based Blocking Shuffle Implementation in Flink – Part 1

Part 1 of this 2-part series will introduce the sort-based blocking shuffle, present benchmark results, and provide guidelines on how to use this new feature.

Apache Flink Community December 20, 2021 5,020

Revealing DAG – MaxCompute Execution Engine Core Technology

This article explains the core ideas and design of DAG.

Alibaba Cloud MaxCompute November 19, 2021 2,897

Jingdong: Flink SQL Optimization Practice

This article focuses on the optimization measures of Jingdong in Flink SQL tasks, focusing on the aspects of shuffle, join mode selection, object reuse, and UDF reuse.

Apache Flink Community November 12, 2021 4,078

Related Tags

artificial intelligence big data cloud computing