×
Apache flink

Flink CDC Series – Part 2: Flink MongoDB CDC Production Practices in XTransfer

Part 2 of this 5-part series explains how to realize Flink MongoDB CDC Connector through MongoDB Change Streams features based on Flink CDC.

Flink CDC Series – Part 1: How Flink CDC Simplifies Real-Time Data Ingestion

Part 1 of this 5-part series explains how to use Flink CDC to simplify the entry of real-time data into the database.

Packaging Issues in Datastream Development

This article mainly explains which dependencies need to be introduced and which need to be packaged into the job JAR during the job development.

How to Build a Cloud-Native Open-Source Big Data Platform | The Application Practice of Weimiao

This article shares the application practice of Weimiao based on the big data ecosystem of Alibaba Cloud.

Streaming ETL for MySQL and Postgres with Flink CDC

This tutorial explains how to quickly build streaming ETL for MySQL and Postgres with Flink CDC.

How We Improved Scheduler Performance for Large-Scale Jobs

This article discusses scheduler performance improvements for large-scale jobs in Flink 1.13 and 1.14.

A Demo of the Scenario Solution Based on Realtime Compute for Apache Flink

The article mainly introduces two applications of real-time big data based on Flink.

Sort-Based Blocking Shuffle Implementation in Flink – Part 2

Part 2 of this 2-part series will give you insight into some core design considerations and implementation details of the sort-based blocking shuffle in Flink.

Sort-Based Blocking Shuffle Implementation in Flink – Part 1

Part 1 of this 2-part series will introduce the sort-based blocking shuffle, present benchmark results, and provide guidelines on how to use this new feature.

Four Billion Records per Second! What is Behind Alibaba Double 11 - Flink Stream-Batch Unification Practice during Double 11 for the Very First Time

This article analyzes the practice of stream and batch unification for big data processing within Alibaba's core business scenarios.

Evolution of the Real-time Data Warehouses of the Alibaba Search and Recommendation Data Platform

This article shares the results of explorations into real-time data warehouses focusing on the evolution and best practices for data warehouses based on Apache Flink and Hologres.

Flink Course Series (8): Detailed Interpretation of Flink Connector

This article gives a detailed interpretation of Flink Connector from the four aspects: connectors, Source API, Sink API, and the future development of collectors.

Flink Course Series (7): Flink Ecosystems

This article describes how Flink SQL connects to external systems and introduces commonly used Flink SQL Connectors.

Flink Course Series (6): A Quick Start for Using PyFlink

This article introduces the objectives and the development of the PyFlink project as well as its current core features.

Flink Course Series (5): Introduction and Practice of Flink SQL Table

This article mainly introduces the background, concepts, and features of the Flink SQL and Table API.

Flink Course Series (4): Fault Tolerance in Flink

This article mainly introduces Flink fault tolerance mechanism principles along with stateful stream computing, global consistency snapshots, and Flink state management.

Flink Course Series (3): Flink Runtime Architecture

This article focuses on the underlying Flink Runtime Architecture with four parts, including runtime overview, Jobmaster, TaskExecutor, and ResourceManager.

Flink Course Series (2): Stream Processing with Apache Flink

This article describes stream processing with Apache Flink from three different aspects.

Flink Course Series (1): A General Introduction to Apache Flink

This article describes the basic concepts, importance, development, and current applications of Apache Flink.

Application of Real-Time Compute for Apache Flink in Weibo

This article introduces the application of Realtime Compute for Apache Flink with Weibo.