×
Real-time Data Warehouse

Apache Flink FLIP-12: Asynchronous I/O

Follow the Apache Flink® Community for making Flink's External System Data Processing More Efficient.

Understanding Fluss Partial Update

Traditional streaming data pipelines often need to join many tables or streams on a primary key to create a wide view.

vivo's Lakehouse Integration Practice Based on Paimon

This article is compiled from the presentation by Xu Yu, an internet big data expert at vivo and Apache Paimon Committer, during the Flink Forward Asia 2024 Streaming Lakehouse session (Part One).

Apache Flink FLIP-11: Simplified Stream Aggregation

Follow the Apache Flink® Community for making Stream Aggregation Simpler in Table API.

FLIP-10: Unified Checkpoints and Savepoints

Follow the Apache Flink® Community for making Data Backup Simpler in Flink.

FLIP-9: Trigger Language

An Attempt at Defining a Rule Language for Flink Triggers

Apache Flink FLIP-8: Scalable Non-Partitioned State

Follow the Apache Flink® Community for making Non-Partitioned State Scalable in Flink.

Flink SQL 101: Embrace Unified Stream and Batch Processing

This article introduces Flink SQL, a unified stream-batch processing engine, focusing on key concepts like Stream-Table Duality, event time/watermarks.

Apache Flink Broadcast Variable Optimization: FLIP-5's Approach to Reducing Network Overhead

This is Technical Insights Series by Perry Ma | Product Lead, Real-time Compute for Apache Flink at Alibaba Cloud.

Best Practices for Flink CDC YAML in Realtime Compute for Apache Flink

This article is authored by the data pipeline team of Alibaba Cloud's open-source big data platforms.

Build an All-in-one Real-time Data Warehouse (Code-level) Based on AnalyticDB for PostgreSQL

This article introduces the process of building an all-in-one real-time data warehouse using AnalyticDB for PostgreSQL at the code level.

Technical Principle of Hologres Binlog

This article describes an overview of the implementation principles and best practices of Hologres Binlog.

Implementation of Real-Time Data Warehouse Storage and Analysis of Various Technical Architectures

This article discusses several facets of real-time data warehouses, including characteristics, benefits, and building them.

StarRocks x Flink CDC for End-to-End Real-Time Links

This article discusses real-time data warehouse construction and offers examples of using Flink CDC and StarRocks for real-time links and data updates.

Introduction to Alibaba Cloud AnalyticDB: A Real-Time Data Warehouse Service for PB Data

In this video, we will introduce Alibaba Cloud AnalyticDB, a high-performance real-time cloud-native data warehouse by Alibaba Cloud.

Data Lake House: Technical Principle Analysis of Hologres, Accelerating Cloud DLF

This article analyzes the technical principles of the Hologres high-performance analytics engine to accelerate the query of Cloud Data Lake Formation (DLF).

Looking at the Development Trend of Real-Time Data Warehouses from the Core Scenarios of Alibaba

This article explores real-time data warehouses using core scenarios of Alibaba.

The Open-Source Real-Time Data Warehouse Solution Based on EMR OLAP - ClickHouse Transaction Implementation

This article describes the solution of an open-source real-time data warehouse based on EMR OLAP.

The Evolution of the Wanli Niu Real-Time Data Warehouse

This article explains how Hupan Network (Wanli Niu) uses Alibaba Cloud's big data components to build a data middle platform step by step.

Flink Practices in iQiyi's Advertising Business

This article explains thoroughly how iQiyi (a Chinese online video platform) utilizes Apache Flink.