×

Apache Flink Community

6880 Reputation

Apache Flink Community

Follow
Activities(50) Posts(131) Series(5) Areas of Expertise Following Followers
Areas of Expertise

Following (0)

See All

Followers (40)

See All

Understanding Stream Processing: Real-Time Data Analysis and Use Cases

Learn about stream processing, its applications, challenges, and Alibaba Cloud's Realtime Compute for Apache Flink solution for real-time data analysis.

What is Change Data Capture (CDC)?

Change Data Capture (CDC) detects and captures data changes as they occur in source systems, such as databases or applications.

Complex Event Processing (CEP): A Comprehensive Guide

Discover the power of Complex Event Processing (CEP) in deciphering real-time cause-and-effect relationships from diverse data streams.

What is Apache Flink ?

Learn about Apache Flink, a distributed data processing engine for real-time analytics. Explore its features, use cases, and comparisons with other frameworks like Kafka and Spark.

What is Batch Processing ?

Batch processing is a method of handling data where transactions are collected over a period and processed together as a group, or batch.

Interpretation of Gemini: an Enterprise-level State Storage Engine of Alibaba Cloud Realtime Compute for Apache Flink

This article gives a deep interpretation on Gemini, an enterprise-level state storage engine of Alibaba Cloud Realtime Compute for Apache Flink.

Building a Streaming Lakehouse: Performance Comparison Between Paimon and Hudi

This article compares the performance of Paimon and Hudi on Alibaba Cloud EMR and explores their respective roles in building quasi-real-time data warehouses.

The Next Generation of Apache Flink

This article discusses the main technical directions and plans of the Apache Flink community for the coming year, and the preparations for the Flink 2.

One-Click Database Synchronization from MongoDB to Paimon Using Flink CDC

This article explores the process of achieving one-click database synchronization from MongoDB to Paimon using Flink CDC.

Announcement of the Release of Apache Flink 1.18

The Apache Flink PMC is pleased to announce the release of Apache Flink 1.18.0. As usual, we are looking at a packed release with a wide variety of improvements and new features.

All You Need to Know About PyFlink

This article discusses the structure of a PyFlink job, operational mechanisms, performance optimization strategies, and future projections for PyFlink.

Performance Analysis and Tuning Guides for Hybrid Shuffle Mode

This article introduces how to optimize the performance of Hybrid Shuffle Mode with performance analysis and tuning guides.

How to Write Simple and Efficient Flink SQL

This article is compiled from Xiaolin He’s presentation at the 2022 Flink Forward Asia (FFA) Conference, discussing Flink SQL insight, best practices, and future works.

The Thinking and Design of a Quasi-Real-Time Data Warehouse with Stream and Batch Integration

This article explores the practice of stream-batch integrated Flink SQL based on data lakes and explores the expression consistency, result consistenc...

Generic Log-based Incremental Checkpoint - Performance Evaluation & Analytics

This blog post aims to provide a comprehensive analysis of GIC's advantages and disadvantages by conducting thorough experiments and analysis.

Apache Flink Table Store 0.3.0 Release Announcement

The Apache Flink community has released version 0.3.0 of the Flink Table Store, which includes many new features and improvements.

Announcement of the Release of Apache Flink 1.17

Apache Flink, a leading stream processing standard, has released version 1.17.0, which includes new features and improvements.

Flink Shuffle 3.0: Vision, Roadmap, and Progress

This article was compiled from a speech by Song Xintong (Wuzang) (an Alibaba Cloud Senior Technical Expert) during FFA 2022, discussing Flink Shuffle 3.

Flink 1.16: How Does Hive SQL Migrate to Flink SQL?

This article was compiled from a speech from the Apache Flink Meetup, discussing the release of Flink 1.16.

Streaming Data Warehouse Storage: Requirements and Architecture

This article discusses the requirements and architecture of streaming data warehouse storage.

Latest Comments

Dikky Ryan Pratama Commented on The Thinking and Design of a Quasi-Real-Time Data Warehouse with Stream and Batch Integration

Awesome!

5444248861672821 Commented on Streaming ETL for MySQL and Postgres with Flink CDC

Really Nice post! I have a question regarding the elasticsearch sql connector part, it seems like they don't have SSL options (like ca.crt file path...) in current connector, does anybody have any idea how to connect to ES as a sink with ssl?

Arman Ali Commented on Flink: How to Optimize SQL Performance Using Multiple-input Operators

Great!