×
Data Processing

Why Should There Be Distributed Systems? - Part 1 of About Distributed Systems

This is the first part of a carefully conceived series of 20-30 articles on distributed systems, I hope to take the journey with you to understand the ins and outs of the distributed systems.

Flink CDC Series – Part 1: How Flink CDC Simplifies Real-Time Data Ingestion

Part 1 of this 5-part series explains how to use Flink CDC to simplify the entry of real-time data into the database.

How to Build a Cloud-Native Open-Source Big Data Platform | Best Practices of InMobi

This article shares the best practices of InMobi based on the open-source big data service of Alibaba Cloud.

OpenYurt Teamed with eKuiper to Solve the Processing Problems of Edge Streaming Data in IoT Scenarios

This article discusses the new partnership between OpenYurt and eKuiper.

Compilation Optimization: LLVM Code Generation Technology Details and Its Application in Databases

This article mainly introduces the code generation technology based on LLVM (Codegen).

The Practice of Semi-Structured Data Processing Based on MaxCompute SQL

This article mainly discusses the semi-structured processing capability of MaxCompute.

Friday Blog - Week 32 - No-code APIs with DataService Studio

Learn how to quickly and easily deploy data-driven HTTP APIs from within DataWorks, without writing any code!

The Practice of Real-Time Data Processing Based on MaxCompute

This article explains how to write real-time streaming data based on BinLog, Flink, and Spark Streaming into MaxCompute.

Jingdong: Flink SQL Optimization Practice

This article focuses on the optimization measures of Jingdong in Flink SQL tasks, focusing on the aspects of shuffle, join mode selection, object reuse, and UDF reuse.

Use Case | Precision Marketing with Low-Cost: Game Publisher Best Practices

This article explains how a gaming company used Alibaba Cloud products to expand its business.

Best Practices for Big Data Processing in Spark

This article is an overview of the best practices for big data processing in Spark taken from a lecture.

Using DataWorks and MaxCompute: First Steps

Learn how to import, analyze, and export data using Alibaba Cloud's DataWorks and MaxCompute.

Data Creation Methods for MaxCompute

This article discusses several methods for creating data on MaxCompute for functional testing and demos.

Progress in Dialog Management Model Research

This article provides a summary of the progress made in dialog management model research over the years.

Serverless Practices for Large-Scale Data Processing

This article elaborates on serverless practices for large-scale data processing and describes specific practical cases.

The Discovery of a Promising Technology

In this article, Zhang Jianfeng, a veteran in the open-source community, explains how to evaluate whether the technology is worth learning using three key dimensions.

Basic Machine Learning: How to Recognize a Cat

This article explains the main steps for training a machine and obtaining a model, provides some simple practices, and shares a basic principle of machine learning.

Setting up Spark on MaxCompute

This post provides a walkthrough on how to set up Spark on MaxCompute on Alibaba Cloud.

Analyzing Hot and Cold Tables with MaxCompute

This article shows how you can analyze hot and cold tables by using MaxCompute metadata.

An Enterprise-level Data Empowerment System That Features a Closed Loop, Accumulation, and Sustainability

Learn about the development of an enterprise-level data empowerment system as well as the Umeng Databank.