×
Big Data

Mastering Interview Questions: A Comprehensive Guide to Elasticsearch

This article is part of a series on interview questions for technical professionals, offering an in-depth exploration of Elasticsearch.

Integration of Paimon and Spark - Part I

This article introduces the main features in the new version of Paimon that are supported by the Spark-based computing engine.

Interpretation of Gemini: an Enterprise-level State Storage Engine of Alibaba Cloud Realtime Compute for Apache Flink

This article gives a deep interpretation on Gemini, an enterprise-level state storage engine of Alibaba Cloud Realtime Compute for Apache Flink.

[Infographic] Tech for Innovation | Alibaba Cloud Spring Launch 2024

To foster long-term AI growth, Alibaba Cloud is reducing the cost of essential public cloud products for international customers.

Building a Streaming Lakehouse: Performance Comparison Between Paimon and Hudi

This article compares the performance of Paimon and Hudi on Alibaba Cloud EMR and explores their respective roles in building quasi-real-time data warehouses.

Writing Flink SQL for Weakly Structured Logs: Leveraging SLS SPL

This article describes how to use SLS SPL (Structured Programming Language) to configure the SLS Connector to structure data.

AliORC: A Combination of MaxCompute and Apache ORC

In this blog, Senior Technical Expert Wu Gang discusses the differences between open-source storage formats ORC and Parquet, and the reason why MaxCompute chose ORC.

Combining Elasticsearch with DBs: Offline Data Synchronization

This article describes how to synchronize data from databases (DBs) to Elasticsearch in offline mode.

Blogs of the Week – Ep. 5, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the fifth episode of Blogs of the Week in 2024.

Practical Use of MaxCompute Metadata: Data Permission Statistics

This article introduces how to conduct permission statistics by using metadata-related permission views.

Giới thiệu về Link Vision của Alibaba Cloud để dựng Camera AI tại thị trường Việt Nam

Link Vision, dịch vụ của Alibaba Cloud, là một giải pháp giám sát video thông minh tích hợp công nghệ Internet of Things (IoT) và trí tuệ nhân tạo (AI.

Blogs of the Week – Ep. 4, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the fourth episode of Blogs of the Week in 2024.

Practical Use of MaxCompute Metadata: Job Accounting

This article mainly introduces MaxCompute's tenant-level Information Schema and focuses on job accounting through the TASKS_HISTORY view of metadata.

Practical Use of MaxCompute Metadata: Statistical Analysis of Project Information

This article mainly introduces MaxCompute's tenant-level Information Schema and focuses on using the CATALOGS view of metadata for project-related statistics.

Practical Use of MaxCompute Metadata: Data Download Audit

This article mainly introduces MaxCompute's tenant-level Information Schema and focuses on statistical analysis through the TUNNELS_HISTORY view of metadata.

Alibaba Cloud Technical Salon | Building Data Utilization Platform with Alibaba Cloud Quick BI

Learn what Quick BI is and how to build data utilization platform with Alibaba Cloud Cloud-Native BI.

miHoYo Big Data Cloud-Native Practices

The article introduces the process of upgrading MiHoYo's big data architecture to cloud-native and the benefits of using Spark on K8s.

Lakehouse: AnalyticDB for MySQL Ingests Data from Multiple Tables to Data Lakes with Flink CDC + Hudi

This article explores how AnalyticDB for MySQL uses Apache Hudi to ingest complete and incremental data from multiple CDC tables into data lakes.

Blogs of the Week – Ep. 3, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the third episode of Blogs of the Week in 2024.

Running ODPS PySpark using CLI

In this article we will discuss about Spark in general, its uses in the Big Data workflow and how to configure and run Spark in the CLI mode for CI/CD purposes.