This article describes the basic concepts, importance, development, and current applications of Apache Flink.
This article introduces a method that leverages distributed computing resources to calculate the global dictionary index.
Uncover the advancements from Apache Hive to Hudi and Iceberg in stream computing, as our expert navigates the transformative landscape of real-time data lakes.
This article is based on a keynote speech given by WANG Feng, initiator of Apache Flink Community China and head of Open-Source Big Data Platform at Alibaba Cloud, at Flink Forward Asia 2023.
Ready to dive into real-time data processing? Learn Apache Flink basics & set up with Alibaba Cloud's Realtime Compute for Apache Flink.
Part 10 of this series introduces several implementations of distributed transactions as a second preventive solution to data inconsistency.
Part 9 of this series introduces the replica mechanism for high availability and discusses data consistency.
Part 8 of this series discusses one of the core problems of distributed systems: availability.
This article describes common problems and optimization methods of data read/write in computing-storage separation scenarios, and introduces data cache acceleration with JindoFS.
This article provides deep insights into the data lake concept and compares some common solutions available in the market.
This is an extra article from the 10-part series, discussing the engineering implementation of Paxos.
Part 6 of this 10-part series focuses on the source codes of the distributed deadlock detection function in PolarDB-X.
This short article explains the benefits of Alibaba Cloud Tablestore.
This article describes the resource definition, visualized control capability, and distributed batch processing capability of the task scheduling platform.
This article explains the zero-trust concept and how to use it to enhance application security in ASM.
This article reviews Alibaba Cloud's enterprise-level cloud-native data lake solution launched during the double 11 festival and discusses its key benefits.
This article introduces the EPaxos algorithm in a simple and easy-to-understand way, suitable even for those with basic knowledge of Paxos or Raft algorithms.
This article introduces the core protocol process of EPaxos from the perspective of the comparison between Paxos and EPaxos.
This article discusses the practices and challenges of EMR Spark on Alibaba Cloud Kubernetes.
This article mainly introduces Flink fault tolerance mechanism principles along with stateful stream computing, global consistency snapshots, and Flink state management.