×
Stream Processing

VLDB Paper Deep Dive: As AI Advances, Data Analysis Enters the Era of Incremental Computation

This article introduces StreamingView, AnalyticDB for PostgreSQL's native incremental engine that delivers efficient real-time materialized views for AI-scale data without redundant movement.

Building a Unified Lakehouse for Large-Scale Recommendation Systems with Apache Paimon at TikTok

TikTok transitioned to a unified Lakehouse architecture, powered by Apache Paimon, to optimize large-scale recommendation models (LRMs) that utilize user behavior sequences.

Flink State Management: A Journey from Core Primitives to Next-Generation Incremental Computation

Explore Flink 2.0's evolution in state management, from core primitives to cloud-native architecture and next-gen incremental computation.

From Data Streams to Actionable Insights: Grab's Journey with Apache Flink in Real-Time Analytics and Data Quality

Discover how Grab leverages Apache Flink for real-time analytics and data quality, transforming raw data into actionable insights.

Building an Efficient Product Selection Platform for Lazada: A Real-Time Analytics Journey with Apache Flink and Hologres

Discover how Lazada Group built a large-scale e-commerce product selection platform using Apache Flink and Hologres for real-time analytics and stream processing.

Flink SQL 101: Embrace Unified Stream and Batch Processing

Master Flink SQL fundamentals with Stream-Table Duality, event time, and watermarks. Build unified stream-batch processing pipelines for modern data engineering.

Apache Flink FLIP-4: Enhanced Window Evictor for Flexible Data Eviction Before/After Processing

Master Apache Flink FLIP-4 enhanced window evictor for flexible data eviction. Learn real-time quality control, anomaly detection, and production window processing strategies.

Mastering Flink Fault Tolerance: FLIP-1 Smart Recovery Strategies for Distributed Systems

Master Apache Flink FLIP-1 intelligent task failure recovery strategies. Learn partial recovery, smart intermediate data management, and fault toleran...

Compare Flink SQL and DataStream API: Comprehensive Guide for New Developers

Apache Flink is a stream processing framework with two main interfaces: Flink SQL, which uses SQL for batch and streaming data, and the DataStream API...

Apache Flink FLIP-2: Context-Aware Window Functions for Real-Time Analytics

This is Technical Insights Series by Perry Ma | Product Lead, Real-time Compute for Apache Flink at Alibaba Cloud.

Flash: A Next-gen Vectorized Stream Processing Engine Compatible with Apache Flink

This article is based on a presentation by Mr. Wang Feng (nickname: Mowen), senior director at Alibaba Cloud and head of the open source big data de.

Introduction to Unified Batch and Stream Processing of Apache Flink

Unified batch and stream processing of Flink is a well-established concept in the stream computing field.

Hands-on Labs | Get Started with Flink MySQL Connector in 5 Minutes

This step-by-step tutorial introduces how to get started with Flink MySQL Connector in 5 minutes.

Accelerated Integration: Unveiling Flink Connector's API Design and Latest Advances

This article is compiled from the presentation by Ren Qingsheng, committer and PMC member of Apache Flink, at the Flink Forward Asia 2023 core technology session (Part 2).

Understanding Batch Processing vs Stream Processing: Key Differences and Applications

Explore the differences between Batch Processing vs Stream Processing and their applications in data management for better decision-making.

Data Lake for Stream Computing: The Evolution of Apache Paimon

Uncover the advancements from Apache Hive to Hudi and Iceberg in stream computing, as our expert navigates the transformative landscape of real-time data lakes.

Understand Flink SQL: Real-Time SQL Query Execution for Stream and Batch Data

Discover Flink SQL, the high-level API for executing SQL queries across streaming and batch data sets in Apache Flink.

Complex Event Processing (CEP): A Comprehensive Guide

Discover the power of Complex Event Processing (CEP) in deciphering real-time cause-and-effect relationships from diverse data streams.

The Next Step of Flink CDC

This article is based on a keynote speech given by Jark Wu, head of Flink SQL and Flink CDC at Alibaba Cloud, during Flink Forward Asia 2023.

Apache Flink Has Become the De Facto Standard for Stream Computing

This article is based on a keynote speech given by WANG Feng, initiator of Apache Flink Community China and head of Open-Source Big Data Platform at Alibaba Cloud, at Flink Forward Asia 2023.