×
Batch computing

Cloud-native Offline Workflow Orchestration: Kubernetes Clusters for Distributed Argo Workflows

This article explores the distinctions between mainstream batch computing systems and Kubernetes clusters for distributed Argo Workflows.

How to Write Simple and Efficient Flink SQL

This article is compiled from Xiaolin He’s presentation at the 2022 Flink Forward Asia (FFA) Conference, discussing Flink SQL insight, best practices, and future works.

The Thinking and Design of a Quasi-Real-Time Data Warehouse with Stream and Batch Integration

This article explores the practice of stream-batch integrated Flink SQL based on data lakes and explores the expression consistency, result consistenc...

Flink Shuffle 3.0: Vision, Roadmap, and Progress

This article was compiled from a speech by Song Xintong (Wuzang) (an Alibaba Cloud Senior Technical Expert) during FFA 2022, discussing Flink Shuffle 3.

Flink ML API: The Algorithmic Interface and Iterative Engine Designed for Real-Time Machine Learning

This article explains Flink ML API, its construction, and its use cases.

Adaptive Batch Scheduler Automatically Decide Parallelism of Flink Batch Jobs

We introduce Apache Flink's adaptive batch scheduler and detail how it can automatically decide parallelism of Flink batch jobs.

Principle Analysis of Apache Flink CDC Batch and Stream Integration

This article focuses on the processing logic of Flink CDC.

Flink Practices in iQiyi's Advertising Business

This article explains thoroughly how iQiyi (a Chinese online video platform) utilizes Apache Flink.

Flink 1.11: An Engine with Unified SQL Support for Batch and Streaming Data

This article introduces the enhanced capabilities of Flink 1.11 to support SQL to process batch and streaming data

A Deep Dive into Apache Flink 1.11: Stream-Batch Integrated Hive Data Warehouse

Li Jinsong and Li Rui, Alibaba Technical Experts, talk about the features, revisions, and improvements of Apache Flink 1.11.

A Big Data-Based Public Opinion Analysis System: Architecture Anatomy

This article, as part one of a two part series, describes the architecture behind mainstream big data models and looks at the architecture behind the public opinion analysis system.

Fault Tolerance with Application High Availability or Batch Compute

We will talk about two seemingly opposing ideas – high availability and batch computing – can be integrated into a single solution using Alibaba Cloud's services.

Creating Custom Environments for Batch Services

In this article, we will not explore how to create jobs rather we will take a look at how we can customize the underlying infrastructure as needed or required by our software packages.