×
Real-time Data Warehouse

The Delta Join in Apache Flink: Architectural Decoupling for Hyper-Scale Stream Processing

Discover how Delta Join in Apache Flink revolutionizes stream processing, reducing state and costs while boosting performance and stability.

Apache Flink FLIP-15: Smart Stream Iterations & Optimization

Learn Apache Flink FLIP-15 smart iterations with StreamScope and intelligent termination. Master backpressure optimization, deadlock prevention, and advanced loop processing for real-time analytics.

Real-Time Lakehouse Solutions: Apache Flink & Apache Paimon Integration

Alibaba Cloud presents key optimizations in Flink-Paimon real-time lakehouse architecture, including the Variant data type for efficient semi-structur...

Apache Flink FLIP-13: Side Outputs for Multi-Stream Processing

Master Apache Flink FLIP-13 side outputs for flexible data processing. Handle corrupted data, late arrivals, and multi-stream routing with OutputTag and CollectorWrapper patterns.

Building a Real-Time Advertising Lakehouse: Alibaba Mama's Practice with Flink & Paimon

We will first introduce the business background of Alibaba Mama's advertising platform, then explore the design and evolution of its real-time advertising system and data lake architecture.

Flink SQL 101: Embrace Unified Stream and Batch Processing

Master Flink SQL fundamentals with Stream-Table Duality, event time, and watermarks. Build unified stream-batch processing pipelines for modern data engineering.

vivo's Lakehouse Integration Practice Based on Paimon

Discover vivo's real-world Lakehouse integration using Apache Paimon. Learn architecture design, performance optimization, and unified stream-batch processing.

Flink 2.1 SQL: Unlocking Real-time Data & AI Integration for Scalable Stream Processing

Master Flink 2.1 SQL's AI functions with ML_PREDICT, Delta Join optimizations, and real-time AI integration for scalable stream processing applications.

Fluss: Redefining Streaming Storage for Real-time Data Analytics and AI

Explore Apache Fluss, the revolutionary streaming storage solution bridging traditional systems and lakehouse architectures for real-time data analytics and AI.

FLIP-9: Trigger Language - Apache Flink Rule Definition Guide

description" content="Learn about FLIP-9 proposal for Apache Flink trigger language. Discover why this rule language for Flink triggers was shelved an.

Apache Flink FLIP-12: Complete Guide to Asynchronous I/O for High-Performance Stream Processing

Master Apache Flink FLIP-12 asynchronous I/O for high-performance stream processing. Learn implementation patterns, external system integration, and p...

Apache Flink FLIP-10: Complete Guide to Unified Checkpoints and Savepoints

Learn about Apache Flink FLIP-10 proposal for unified checkpoints and savepoints management. Improve fault tolerance and operational simplicity with configuration examples and best practices.

Apache Flink Broadcast Variable Optimization: FLIP-5's Approach to Reducing Network Overhead

Learn Apache Flink FLIP-5 broadcast variable optimization strategies for reducing network overhead. Discover performance lessons and modern scaling approaches for production stream processing.

Apache Flink Table API Aggregation Made Easy: FLIP-11 Windowing Guide for Developers

Master Apache Flink FLIP-11 stream aggregation with Table API. Learn tumbling, sliding, and session windows for efficient real-time data processing.

Mastering Flink State Scaling: FLIP-8 Non-Partitioned State Management for Distributed Systems

Master Apache Flink FLIP-8 scalable non-partitioned state management. Learn dynamic scaling solutions, OperatorStateStore implementation, and state re...

Understanding Fluss Partial Update

Traditional streaming data pipelines often need to join many tables or streams on a primary key to create a wide view.

Best Practices for Flink CDC YAML in Realtime Compute for Apache Flink

This article is authored by the data pipeline team of Alibaba Cloud's open-source big data platforms.

Build an All-in-one Real-time Data Warehouse (Code-level) Based on AnalyticDB for PostgreSQL

This article introduces the process of building an all-in-one real-time data warehouse using AnalyticDB for PostgreSQL at the code level.

Technical Principle of Hologres Binlog

This article describes an overview of the implementation principles and best practices of Hologres Binlog.

Implementation of Real-Time Data Warehouse Storage and Analysis of Various Technical Architectures

This article discusses several facets of real-time data warehouses, including characteristics, benefits, and building them.