×
flink

How We Improved Scheduler Performance for Large-Scale Jobs

This article discusses scheduler performance improvements for large-scale jobs in Flink 1.13 and 1.14.

Flink Practices in iQiyi's Advertising Business

This article explains thoroughly how iQiyi (a Chinese online video platform) utilizes Apache Flink.

Sort-Based Blocking Shuffle Implementation in Flink – Part 2

Part 2 of this 2-part series will give you insight into some core design considerations and implementation details of the sort-based blocking shuffle in Flink.

Sort-Based Blocking Shuffle Implementation in Flink – Part 1

Part 1 of this 2-part series will introduce the sort-based blocking shuffle, present benchmark results, and provide guidelines on how to use this new feature.

A Few Tips on Large-Scale Real-Time Data Warehouse Construction

This article offers helpful tips for large-scale real-time data warehouse construction.

Crowd Selection and Data Service Practices Based on MaxCompute & Hologres

This article describes how to use MaxCompute to add tags to a large number of people and carry out analysis and modeling through Hologres.

The Practice of Real-Time Data Processing Based on MaxCompute

This article explains how to write real-time streaming data based on BinLog, Flink, and Spark Streaming into MaxCompute.

Kwai Builds Real-Time Data Warehouse Scenario-Based Practice on Flink

This article introduces the real-time data warehouse architecture built by Kwai based on Flink and offers solutions to some difficult problems.

Jingdong: Flink SQL Optimization Practice

This article focuses on the optimization measures of Jingdong in Flink SQL tasks, focusing on the aspects of shuffle, join mode selection, object reuse, and UDF reuse.

Best Practices for Flink on Zeppelin Stream Computing Processing

This article is an overview of the best practices for Flink on Zeppelin stream computing processing taken from a recent lecture.

Zeppelin Notebook: An Important Tool for PyFlink Development Environment

This article introduces a PyFlink development environment tool that can help users solve various problems.

Alibaba Cloud BigData Pipeline 구축하기

이 블로그는 빅데이터 플랫폼 도입을 고려 중이고, 어떤 조합으로 시스템을 구축할지 고민이신 분들을 위해 알리바바가 제공하는 모든 서비스들을 나열해 놓고, 각 서비스들의 적용 가능한 시나리오와 서비스 도입 시 고려해야 할 점등을 설명합니다.

Flink Course Series (7): Flink Ecosystems

This article describes how Flink SQL connects to external systems and introduces commonly used Flink SQL Connectors.

Flink Course Series (5): Introduction and Practice of Flink SQL Table

This article mainly introduces the background, concepts, and features of the Flink SQL and Table API.

Flink Course Series (2): Stream Processing with Apache Flink

This article describes stream processing with Apache Flink from three different aspects.

Flink Course Series (1): A General Introduction to Apache Flink

This article describes the basic concepts, importance, development, and current applications of Apache Flink.

How to Sustain a Growing Platform and Gain Online Users

Learn how Xianyu has been building an e-commerce platform made for sustained growth.

Unveiling Hologres: A Cloud-Native Storage Engine

Abstract: This article introduces the storage engine of Hologres and deeply analyzes its implementation principles and core technical advantages.

Flink: How to Optimize SQL Performance Using Multiple-input Operators

In this article, the author explains how to optimize SQL performance in Apache Flink using multiple-input operators.

How to Analyze CDC Data in Iceberg Data Lake Using Flink

This article discusses the challenges and limitations of various solutions in CDC data analysis and describes how to use Flink and Iceberg to overcome them.