×
Data Lake

Alibaba Cloud Data Lake: The Smart Choice for Modern Data Management

Alibaba Cloud Data Lake provides a robust, secure, and cost-effective solution for modern data management, addressing the limitations of traditional on-premises data lake systems.

Data Lake for Stream Computing: The Evolution of Apache Paimon

Uncover the advancements from Apache Hive to Hudi and Iceberg in stream computing, as our expert navigates the transformative landscape of real-time data lakes.

Integration of Paimon and Spark - Part 2: Query Optimization

This article introduces the integration of Paimon and Spark, specifically focusing on query optimization.

Integration of Paimon and Spark - Part I

This article introduces the main features in the new version of Paimon that are supported by the Spark-based computing engine.

Lakehouse: AnalyticDB for MySQL Ingests Data from Multiple Tables to Data Lakes with Flink CDC + Hudi

This article explores how AnalyticDB for MySQL uses Apache Hudi to ingest complete and incremental data from multiple CDC tables into data lakes.

Secure Marketing Data Management on Alibaba Cloud: Best Practices for Marketers

This post discusses secure marketing data management, emphasizing the importance of data security in marketing.

How Generative AI Can Revolutionize Data Engineering

This article describes how Generative AI can be utilized along with Common Data Engineering terms such as Data Lake, ETL Pipeline, Data Lineage, Data Warehouse and Data Visualization.

Alibaba Cloud Open Data Platform and Service | Lakehouse of MaxCompute

In this episode, we will introduce the idea of lakehouse and Alibaba Cloud Lakehouse of MaxCompute.

Data into the Lake Based on Flink High-Throughput Exactly-Once Consistency

This article describes the challenges and solutions of SLS using APS to quickly enter the lake with Exactly-Once consistency.

The Intelligent Evolution of the Data Middle Platform – 12 Years of Development from Alibaba's Data Platform

This article explains the developmental stages of Alibaba’s data middle platform.

Analysis on the Serverless Elasticity of Cloud-Native AnalyticDB for MySQL

This article discusses data lakehouse edition, AnalyticDB for MySQL, and cost reduction and efficiency enhancement.

Data Lake Management and Optimization

This article was compiled from a speech from Qingwei Yang at the Alibaba Cloud Data Lake Technology Special Exchange Meeting on July 17, 2022.

Unified Metadata and Permissions for Data Lakes

This article was compiled from a speech from Xiong Jiashu at the Alibaba Cloud Data Lake Technology Special Exchange Meeting.

AnalyticDB for MySQL Data Lakehouse Edition: Build a Cloud-Native Comprehensive Data Analysis Platform from Lake to Warehouse

This article introduces AnalyticDB for MySQL Data Lakehouse Edition, its architecture, and its advantages.

Data Lake: Concepts, Characteristics, Architecture, and Case Studies

This article provides deep insights into the data lake concept and compares some common solutions available in the market.

Achieving Cost Reduction and Efficiency Enhancement with Alibaba Cloud Storage Data Lake 3.0

This article discusses how data lakes can offer cost savings and the future possibilities of data lake architecture.

Data Lake House: Technical Principle Analysis of Hologres, Accelerating Cloud DLF

This article analyzes the technical principles of the Hologres high-performance analytics engine to accelerate the query of Cloud Data Lake Formation (DLF).

Alibaba Cloud Cloud-Native Integrated Data Warehouse: An Interpretation of Data Security Capabilities

This article discusses MaxCompute's architecture, ecosystem, subproducts, and security capabilities.

Alibaba Cloud Cloud-Native Integrated Data Warehouse – An Interpretation of the New Capabilities of Lakehouse

This article discusses the overall updates to Lakehouse architecture.

Databricks Data Insight Open Course - An Introduction to Delta Lake (Open-Source Edition)

This part of the Databricks Data Insight Open Course article series introduces Delta Lake Basics (Open-Source Edition).