×
Big Data

Use SPL to Efficiently Implement Flink SLS Connector Pushdown

This article introduces SPL and its application in the Realtime Compute for Apache Flink SLS Connector.

Integration of Paimon and Spark - Part 2: Query Optimization

This article introduces the integration of Paimon and Spark, specifically focusing on query optimization.

The Next Generation of Apache Flink

This article is based on a keynote speech given by SONG Xintong during Flink Forward Asia 2023. SONG leads a team that mainly works on Apache Flink's ...

Alibaba Cloud Open Data Platform and Service | Realtime Compute for Apache Flink

In this episode, we will introduce Alibaba Cloud's Realtime Compute for Apache Flink

Analysis of Alibaba Cloud Realtime Compute for Apache Flink: Deep Exploration into MongoDB Schema Inference

This article provides a deep exploration into MongoDB schema inference, focusing on the core features of MongoDB CDC Community Edition and its implementation in Realtime Compute for Apache Flink.

Blogs of the Week – Ep. 6, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the sixth episode of Blogs of the Week in 2024.

Mastering Interview Questions: A Comprehensive Guide to Elasticsearch

This article is part of a series on interview questions for technical professionals, offering an in-depth exploration of Elasticsearch.

Integration of Paimon and Spark - Part I

This article introduces the main features in the new version of Paimon that are supported by the Spark-based computing engine.

Interpretation of Gemini: an Enterprise-level State Storage Engine of Alibaba Cloud Realtime Compute for Apache Flink

This article gives a deep interpretation on Gemini, an enterprise-level state storage engine of Alibaba Cloud Realtime Compute for Apache Flink.

[Infographic] Tech for Innovation | Alibaba Cloud Spring Launch 2024

To foster long-term AI growth, Alibaba Cloud is reducing the cost of essential public cloud products for international customers.

Building a Streaming Lakehouse: Performance Comparison Between Paimon and Hudi

This article compares the performance of Paimon and Hudi on Alibaba Cloud EMR and explores their respective roles in building quasi-real-time data warehouses.

Writing Flink SQL for Weakly Structured Logs: Leveraging SLS SPL

This article describes how to use SLS SPL (Structured Programming Language) to configure the SLS Connector to structure data.

AliORC: A Combination of MaxCompute and Apache ORC

In this blog, Senior Technical Expert Wu Gang discusses the differences between open-source storage formats ORC and Parquet, and the reason why MaxCompute chose ORC.

Combining Elasticsearch with DBs: Offline Data Synchronization

This article describes how to synchronize data from databases (DBs) to Elasticsearch in offline mode.

Blogs of the Week – Ep. 5, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the fifth episode of Blogs of the Week in 2024.

Practical Use of MaxCompute Metadata: Data Permission Statistics

This article introduces how to conduct permission statistics by using metadata-related permission views.

Giới thiệu về Link Vision của Alibaba Cloud để dựng Camera AI tại thị trường Việt Nam

Link Vision, dịch vụ của Alibaba Cloud, là một giải pháp giám sát video thông minh tích hợp công nghệ Internet of Things (IoT) và trí tuệ nhân tạo (AI.

Blogs of the Week – Ep. 4, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the fourth episode of Blogs of the Week in 2024.

Practical Use of MaxCompute Metadata: Job Accounting

This article mainly introduces MaxCompute's tenant-level Information Schema and focuses on job accounting through the TASKS_HISTORY view of metadata.

Practical Use of MaxCompute Metadata: Statistical Analysis of Project Information

This article mainly introduces MaxCompute's tenant-level Information Schema and focuses on using the CATALOGS view of metadata for project-related statistics.