Community

Blog Events Webinars Tutorials Forum

Create Account

×

Data Lake

Understanding Fluss Partial Update

Traditional streaming data pipelines often need to join many tables or streams on a primary key to create a wide view.

Apache Flink Community June 5, 2025 392

vivo's Lakehouse Integration Practice Based on Paimon

This article is compiled from the presentation by Xu Yu, an internet big data expert at vivo and Apache Paimon Committer, during the Flink Forward Asia 2024 Streaming Lakehouse session (Part One).

Apache Flink Community May 30, 2025 515

Alibaba Cloud Data Lake: The Smart Choice for Modern Data Management

Alibaba Cloud Data Lake provides a robust, secure, and cost-effective solution for modern data management, addressing the limitations of traditional on-premises data lake systems.

Rupal_Click2Cloud August 19, 2024 1,512

Data Lake for Stream Computing: The Evolution of Apache Paimon

Uncover the advancements from Apache Hive to Hudi and Iceberg in stream computing, as our expert navigates the transformative landscape of real-time data lakes.

Apache Flink Community June 11, 2024 3,912

Integration of Paimon and Spark - Part 2: Query Optimization

This article introduces the integration of Paimon and Spark, specifically focusing on query optimization.

Alibaba EMR April 25, 2024 3,055

Integration of Paimon and Spark - Part I

This article introduces the main features in the new version of Paimon that are supported by the Spark-based computing engine.

Alibaba EMR April 15, 2024 3,420

Lakehouse: AnalyticDB for MySQL Ingests Data from Multiple Tables to Data Lakes with Flink CDC + Hudi

This article explores how AnalyticDB for MySQL uses Apache Hudi to ingest complete and incremental data from multiple CDC tables into data lakes.

ApsaraDB February 29, 2024 2,078

Secure Marketing Data Management on Alibaba Cloud: Best Practices for Marketers

This post discusses secure marketing data management, emphasizing the importance of data security in marketing.

Nick Patrocky January 24, 2024 2,807

How Generative AI Can Revolutionize Data Engineering

This article describes how Generative AI can be utilized along with Common Data Engineering terms such as Data Lake, ETL Pipeline, Data Lineage, Data Warehouse and Data Visualization.

GAVASKAR S August 14, 2023 9,793

Alibaba Cloud Open Data Platform and Service | Lakehouse of MaxCompute

In this episode, we will introduce the idea of lakehouse and Alibaba Cloud Lakehouse of MaxCompute.

Alibaba Cloud Data Intelligence July 26, 2023 1,260

Data into the Lake Based on Flink High-Throughput Exactly-Once Consistency

This article describes the challenges and solutions of SLS using APS to quickly enter the lake with Exactly-Once consistency.

ApsaraDB July 25, 2023 2,486

The Intelligent Evolution of the Data Middle Platform – 12 Years of Development from Alibaba's Data Platform

This article explains the developmental stages of Alibaba’s data middle platform.

Alibaba Cloud Community September 17, 2021 12,482

Analysis on the Serverless Elasticity of Cloud-Native AnalyticDB for MySQL

This article discusses data lakehouse edition, AnalyticDB for MySQL, and cost reduction and efficiency enhancement.

ApsaraDB March 15, 2023 2,124

Data Lake Management and Optimization

This article was compiled from a speech from Qingwei Yang at the Alibaba Cloud Data Lake Technology Special Exchange Meeting on July 17, 2022.

Alibaba EMR February 20, 2023 2,588

Unified Metadata and Permissions for Data Lakes

This article was compiled from a speech from Xiong Jiashu at the Alibaba Cloud Data Lake Technology Special Exchange Meeting.

Alibaba EMR February 15, 2023 3,403

AnalyticDB for MySQL Data Lakehouse Edition: Build a Cloud-Native Comprehensive Data Analysis Platform from Lake to Warehouse

This article introduces AnalyticDB for MySQL Data Lakehouse Edition, its architecture, and its advantages.

ApsaraDB January 9, 2023 2,442

Data Lake: Concepts, Characteristics, Architecture, and Case Studies

This article provides deep insights into the data lake concept and compares some common solutions available in the market.

ApsaraDB November 17, 2020 41,726

Achieving Cost Reduction and Efficiency Enhancement with Alibaba Cloud Storage Data Lake 3.0

This article discusses how data lakes can offer cost savings and the future possibilities of data lake architecture.

Alibaba Cloud Community December 7, 2022 2,451

Data Lake House: Technical Principle Analysis of Hologres, Accelerating Cloud DLF

This article analyzes the technical principles of the Hologres high-performance analytics engine to accelerate the query of Cloud Data Lake Formation (DLF).

Hologres December 2, 2022 2,140

Alibaba Cloud Cloud-Native Integrated Data Warehouse: An Interpretation of Data Security Capabilities

This article discusses MaxCompute's architecture, ecosystem, subproducts, and security capabilities.

Alibaba Cloud MaxCompute October 31, 2022 8,002

Related Tags

artificial intelligence big data cloud computing