×
flink

Paimon 1.0: Unified Lake Format for Data + AI

Explore how Apache Paimon addresses big data system challenges by offering a unified data lake storage solution.

Harnessing Streaming Data for AI-Driven Applications with Apache Flink

Discover how to harness streaming data for AI-driven applications using Apache Flink, based on insights from the Flink Forward Asia 2024 keynote by Ashish Sharma and Ganireddy Jyothi Swaroop.

Flink Course Series (1): A General Introduction to Apache Flink

This article describes the basic concepts, importance, development, and current applications of Apache Flink.

Flink SQL Development Experience Sharing

This article introduces the author's experiences in tackling issues encountered while developing real-time data processing tasks using Apache Flink.

Big Data Cloud Fighter Bootcamp

This article introduces the Big Data Cloud Fighters bootcamp, which provides an intensive, hands-on experience in mastering big data principles and technologies.

Hands-on Labs | Get Started with Flink MySQL Connector in 5 Minutes

This step-by-step tutorial introduces how to get started with Flink MySQL Connector in 5 minutes.

Understand Flink SQL: Real-Time SQL Query Execution for Stream and Batch Data

Discover Flink SQL, the high-level API for executing SQL queries across streaming and batch data sets in Apache Flink.

Practice of Flink 2.0 State Storage-computing Separation

This article provides an overview of the research on the transformation practice of Flink 2.0 state storage-computing separation.

Analysis and Application of New Features of Flink ML

This article covers an overview of Flink ML and discusses the design and application of online learning, online inference, and feature engineering algorithms.

The Next Generation of Apache Flink

This article is based on a keynote speech given by SONG Xintong during Flink Forward Asia 2023. SONG leads a team that mainly works on Apache Flink's ...

Alibaba Cloud Open Data Platform and Service | Realtime Compute for Apache Flink

In this episode, we will introduce Alibaba Cloud's Realtime Compute for Apache Flink

Interpretation of Gemini: an Enterprise-level State Storage Engine of Alibaba Cloud Realtime Compute for Apache Flink

This article gives a deep interpretation on Gemini, an enterprise-level state storage engine of Alibaba Cloud Realtime Compute for Apache Flink.

Building a Streaming Lakehouse: Performance Comparison Between Paimon and Hudi

This article compares the performance of Paimon and Hudi on Alibaba Cloud EMR and explores their respective roles in building quasi-real-time data warehouses.

Writing Flink SQL for Weakly Structured Logs: Leveraging SLS SPL

This article describes how to use SLS SPL (Structured Programming Language) to configure the SLS Connector to structure data.

How to Use Confluent with FlinkSQL

This article describes step-by-step instructions on how to use Confluent with FlinkSQL.

Observability | Key Metrics to Focus On When Using Prometheus to Monitor E-MapReduce

This article explains how to monitor big data in EMR using Prometheus Service.

AnalyticDB for MySQL: Implementing High Throughput, Exactly-Once Data Ingestion with Flink

This article introduces how the data source SLS achieves high-speed and precise consistency in data ingestion through APS, and the related challenges and solutions.

Announcement of the Release of Apache Flink 1.18

The Apache Flink PMC is pleased to announce the release of Apache Flink 1.18.0. As usual, we are looking at a packed release with a wide variety of improvements and new features.

Learning about Distributed Systems - Part 27: From Batch Processing to Stream Computing

Part 27 of this series discusses distributed systems in terms of throughput and latency.

All You Need to Know About PyFlink

This article discusses the structure of a PyFlink job, operational mechanisms, performance optimization strategies, and future projections for PyFlink.