×
Real-Time Analytics

Flink 2.1 SQL: Unlocking Real-time Data & AI Integration for Scalable Stream Processing

This article is adapted from Lincoln Lee’s presentation in the Real-Time AI track at Flink Forward Asia Singapore 2025.

Apache Flink FLIP-12: Asynchronous I/O

Follow the Apache Flink® Community for making Flink's External System Data Processing More Efficient.

Understanding Fluss Partial Update

Traditional streaming data pipelines often need to join many tables or streams on a primary key to create a wide view.

vivo's Lakehouse Integration Practice Based on Paimon

This article is compiled from the presentation by Xu Yu, an internet big data expert at vivo and Apache Paimon Committer, during the Flink Forward Asia 2024 Streaming Lakehouse session (Part One).

Apache Flink FLIP-11: Simplified Stream Aggregation

Follow the Apache Flink® Community for making Stream Aggregation Simpler in Table API.

FLIP-10: Unified Checkpoints and Savepoints

Follow the Apache Flink® Community for making Data Backup Simpler in Flink.

Apache Flink FLIP-8: Scalable Non-Partitioned State

Follow the Apache Flink® Community for making Non-Partitioned State Scalable in Flink.

Streaming Data Integration from MySQL to Kafka using Flink CDC YAML

This tutorial demonstrates how to build a real-time MySQL-to-Kafka data pipeline using Flink CDC YAML without coding.

Flink SQL 101: Embrace Unified Stream and Batch Processing

This article introduces Flink SQL, a unified stream-batch processing engine, focusing on key concepts like Stream-Table Duality, event time/watermarks.

Apache Flink FLIP-7: Visualizing Monitoring Metrics in Web UI

Follow the Apache Flink® Community for making Flink Metrics More Accessible Through Web UI Visualization.

Apache Flink FLIP-4: Enhanced Window Evictor for Flexible Data Eviction Before/After Processing

This is Technical Insights Series by Perry Ma | Product Lead, Real-time Compute for Apache Flink at Alibaba Cloud.

Compare Flink SQL and DataStream API: Comprehensive Guide for New Developers

Apache Flink is a stream processing framework with two main interfaces: Flink SQL, which uses SQL for batch and streaming data, and the DataStream API...

Best Practices for Flink CDC YAML in Realtime Compute for Apache Flink

This article is authored by the data pipeline team of Alibaba Cloud's open-source big data platforms.

Flash: A Next-gen Vectorized Stream Processing Engine Compatible with Apache Flink

This article is based on a presentation by Mr. Wang Feng (nickname: Mowen), senior director at Alibaba Cloud and head of the open source big data de.

Apache Flink: Powering Real-Time Personalization in Retail and E-Commerce

Article emphasizes real-time data processing's importance in retail/e-commerce for tailored recommendations, highlighting Apache Flink's pivotal role.

Real-Time Analytics with Alibaba Cloud: High-Performance Data Warehousing

In this demo, we'll guide you through creating a high-performance, cost-effective data warehouse solution for real-time analytics.

Introducing Fluss: Streaming Storage for Real-Time Analytics

Today, we are excited to introduce Fluss, a cutting-edge streaming storage system designed to power real-time analytics.

Data Lake for Stream Computing: The Evolution of Apache Paimon

Uncover the advancements from Apache Hive to Hudi and Iceberg in stream computing, as our expert navigates the transformative landscape of real-time data lakes.

What is Change Data Capture (CDC)?

Change Data Capture (CDC) detects and captures data changes as they occur in source systems, such as databases or applications.

The Next Step of Flink CDC

This article is based on a keynote speech given by Jark Wu, head of Flink SQL and Flink CDC at Alibaba Cloud, during Flink Forward Asia 2023.