×
Data Processing

How Does the Asynchronous Task Processing System Solve the Problems of Time Consuming and High Concurrency?

This article introduces the applicable scenarios and benefits of the asynchronous task processing system and discusses the architecture, functions, an...

Process and Analyze Massive Logs Centrally with Alibaba Cloud Log Service

This short article explains how to get the most out of Alibaba Cloud Log Service.

Avirtech: Bringing Fresh Ideas to Farming | Video Story

Indonesia-based Avirtech provides data processing and drone-based solutions to improve farmers’ crop yields and formulate deeper understandings of conditions in their fields.

Streamline Data Management, Storage, and Analysis with Alibaba Cloud Data Lake

This short article discusses the definition and benefits of data lakes and the Alibaba Cloud Data Lake solution.

Improving speed and stability of checkpointing with generic log-based incremental checkpoints

In this article, we discuss several ways to improve the speed and stability of checkpointing with generic log-based incremental checkpoints.

Looking at the Development Trend of Real-Time Data Warehouses from the Core Scenarios of Alibaba

This article explores real-time data warehouses using core scenarios of Alibaba.

Frontier Technology | AI on the Cloud Helps Scientific Research

This article discusses Alibaba DAMO Academy's latest release, AI Earth.

Principle Analysis of Apache Flink CDC Batch and Stream Integration

This article focuses on the processing logic of Flink CDC.

Flink CDC Series – Part 5: Implement Real-Time Writing of MySQL Data to Apache Doris

Part 5 of this 5-part series explains how to use Flink CDC and Doris Flink Connector to monitor data from MySQL databases and store data in the tables in real-time.

Flink CDC Series – Part 3: Synchronize MySQL Database and Table Shard to Build an Iceberg Real-Time Database

Part 3 of this 5-part series shows how to use Flink CDC to build a real-time database and handle database and table shard merge synchronization.

The Internet of Things : Accelerate Success with Alibaba Cloud IoT Platform

This short article explains the Internet of Things (IoT) and Alibaba Cloud IoT Platform briefly.

Storing Data Has Never Been Easier - Part 3 of About Distributed Systems

As the data grows rapidly and exponentially, cloud servers often run out of space to store them. Luckily, with distributed file systems like HDFS, we are now cracking the problem of low memory.

Why Should There Be Distributed Systems? - Part 1 of About Distributed Systems

This is the first part of a carefully conceived series of 20-30 articles on distributed systems, I hope to take the journey with you to understand the ins and outs of the distributed systems.

Flink CDC Series – Part 1: How Flink CDC Simplifies Real-Time Data Ingestion

Part 1 of this 5-part series explains how to use Flink CDC to simplify the entry of real-time data into the database.

How to Build a Cloud-Native Open-Source Big Data Platform | Best Practices of InMobi

This article shares the best practices of InMobi based on the open-source big data service of Alibaba Cloud.

OpenYurt Teamed with eKuiper to Solve the Processing Problems of Edge Streaming Data in IoT Scenarios

This article discusses the new partnership between OpenYurt and eKuiper.

Compilation Optimization: LLVM Code Generation Technology Details and Its Application in Databases

This article mainly introduces the code generation technology based on LLVM (Codegen).

The Practice of Semi-Structured Data Processing Based on MaxCompute SQL

This article mainly discusses the semi-structured processing capability of MaxCompute.

Friday Blog - Week 32 - No-code APIs with DataService Studio

Learn how to quickly and easily deploy data-driven HTTP APIs from within DataWorks, without writing any code!

The Practice of Real-Time Data Processing Based on MaxCompute

This article explains how to write real-time streaming data based on BinLog, Flink, and Spark Streaming into MaxCompute.