×

Apache Flink Community China

5043 Reputation

Apache Flink Community China

Follow
Activities(50) Posts(98) Series(4) Areas of Expertise Following Followers
Areas of Expertise

Following (0)

See All

Followers (19)

See All

Flink CDC Series – Part 2: Flink MongoDB CDC Production Practices in XTransfer

Part 2 of this 5-part series explains how to realize Flink MongoDB CDC Connector through MongoDB Change Streams features based on Flink CDC.

Flink CDC Series – Part 1: How Flink CDC Simplifies Real-Time Data Ingestion

Part 1 of this 5-part series explains how to use Flink CDC to simplify the entry of real-time data into the database.

Exploration of Advanced Functions in Pravega Flink Connector Table API

This article is compiled from the topic "Exploration of Advanced Functions in Pravega Flink Connector Table API," shared by Zhou Yumin in Flink Forward Asia 2021.

Packaging Issues in Datastream Development

This article mainly explains which dependencies need to be introduced and which need to be packaged into the job JAR during the job development.

More Than Computing: A New Era Led by the Warehouse Architecture of Apache Flink

Mowen discusses the future of Apache Flink regarding its core capabilities of stream computing and improving the processing standards of the entire industry.

Application of Alink and Tensorflow on Flink in JD

This article is compiled from the presentation of JD search and recommendation algorithm engineers Zhang Ying and Liu Lu at Flink Forward Asia 2021.

Flink Remote Shuffle Open-Source: Shuffle Service for Cloud-Native and Unified Batch and Stream Processing

This article introduces the research and development background and the design and use of Flink Remote Shuffle.

Streaming ETL for MySQL and Postgres with Flink CDC

This tutorial explains how to quickly build streaming ETL for MySQL and Postgres with Flink CDC.

How We Improved Scheduler Performance for Large-Scale Jobs

This article discusses scheduler performance improvements for large-scale jobs in Flink 1.13 and 1.14.

Flink Practices in iQiyi's Advertising Business

This article explains thoroughly how iQiyi (a Chinese online video platform) utilizes Apache Flink.

A Demo of the Scenario Solution Based on Realtime Compute for Apache Flink

The article mainly introduces two applications of real-time big data based on Flink.

Sort-Based Blocking Shuffle Implementation in Flink – Part 2

Part 2 of this 2-part series will give you insight into some core design considerations and implementation details of the sort-based blocking shuffle in Flink.

Sort-Based Blocking Shuffle Implementation in Flink – Part 1

Part 1 of this 2-part series will introduce the sort-based blocking shuffle, present benchmark results, and provide guidelines on how to use this new feature.

Kwai Builds Real-Time Data Warehouse Scenario-Based Practice on Flink

This article introduces the real-time data warehouse architecture built by Kwai based on Flink and offers solutions to some difficult problems.

Jingdong: Flink SQL Optimization Practice

This article focuses on the optimization measures of Jingdong in Flink SQL tasks, focusing on the aspects of shuffle, join mode selection, object reuse, and UDF reuse.

Zeppelin Notebook: An Important Tool for PyFlink Development Environment

This article introduces a PyFlink development environment tool that can help users solve various problems.

Use Flink Hudi to Build a Streaming Data Lake

This article introduces the optimization and evolution of Flink Hudi's original mini-batch-based incremental computing model through stream computing.

Flink Course Series (8): Detailed Interpretation of Flink Connector

This article gives a detailed interpretation of Flink Connector from the four aspects: connectors, Source API, Sink API, and the future development of collectors.

Flink Course Series (7): Flink Ecosystems

This article describes how Flink SQL connects to external systems and introduces commonly used Flink SQL Connectors.

Flink Course Series (6): A Quick Start for Using PyFlink

This article introduces the objectives and the development of the PyFlink project as well as its current core features.

Latest Comments

5444248861672821 Commented on Streaming ETL for MySQL and Postgres with Flink CDC

Really Nice post! I have a question regarding the elasticsearch sql connector part, it seems like they don't have SSL options (like ca.crt file path...) in current connector, does anybody have any idea how to connect to ES as a sink with ssl?

Arman Ali Commented on Flink: How to Optimize SQL Performance Using Multiple-input Operators

Great!