×
Data Lake

Deploying A Data Warehouse on Alibaba Cloud

This article explains how to deploy a data warehouse on Alibaba Cloud using AnalyticDB for MySQL service.

The Practice of Lake House in the FinTech Industry

This article explains the four stages of lake house evolution within the Shanghai Shuhe Group.

Introducing Data Lake Analytics for Fintechs

This blog shows you how you can analyze your data stored in the cloud quickly, securely, and at low costs with Data Lake Analytics.

Building Your Data Lake on Alibaba Cloud

This article explains data lakes and how to build data lakes on Alibaba Cloud.

Alibaba Cloud Launches Enterprise-Level Cloud-Native Data Lake during 2020 Double 11

This article reviews Alibaba Cloud's enterprise-level cloud-native data lake solution launched during the double 11 festival and discusses its key benefits.

DLF + DDI Best Practices for One-Stop Data Lake Formation and Analysis

This article aims to give readers a deeper understanding of Alibaba Cloud Data Lake Formation (DLF) and Databricks DataInsight (DDI).

Use Flink Hudi to Build a Streaming Data Lake

This article introduces the optimization and evolution of Flink Hudi's original mini-batch-based incremental computing model through stream computing.

The Intelligent Evolution of the Data Middle Platform – 12 Years of Development from Alibaba's Data Platform

This article explains the developmental stages of Alibaba’s data middle platform.

How ADB for PG Provide Data Analysis for Lake-Warehouse Integration

This article describes how to use ApsaraDB for PostgreSQL to achieve data lake analysis based on the foreign table object type of PostgreSQL.

Construction, Analysis, and Development Governance of a Cloud-Native Data Lake

This article introduces the best practices and cases for building, analyzing, developing, and governing cloud-native data lakes.

Quick Implementation of Data Lake House Based on MaxCompute

This article is a translation of the speech on how to quickly implement data warehouse and lake house based on MaxCompute.

Continuous Evolution and Development of Data Warehouse Architecture

Continuous Evolution and Development of Data Warehouse Architecture – Cloud Native, Lake house, Offline-Realtime Unification and SaaS Mode

Fluid with JindoFS: An Acceleration Tool for Alibaba Cloud OSS

This article introduces Fluid, an open source Kubernetes-native distributed dataset orchestrator and accelerator for data-intensive applications, and talks about the advantages of JindoRuntime.

StarLake: Exploration and Practice of Mobvista in Cloud-Native Data Lake

This article introduces the exploration and practice of Mobvista in the field of cloud-native data lakes, as well as the architecture of StarLake.

Application of Apache Flink in Real-time Financial Data Lake

This article provides an in depth introduction to the architecture, application, and best practices of real-time financial data lakes by Zhongyuan Bank.

How to Analyze CDC Data in Iceberg Data Lake Using Flink

This article discusses the challenges and limitations of various solutions in CDC data analysis and describes how to use Flink and Iceberg to overcome them.

An Overview of Alibaba Cloud's Comprehensive Cloud-Native Data Lake System

This article introduces the establishment of a cloud-native data lake system based on Alibaba Cloud OSS, Data Lake Formation (DLF), and various computing engines present in Alibaba Cloud.

How Delta Lake and DLF Service Facilitate Real-time CDC Synchronization in a Data Lake

This article explains how to perform real-time CDC synchronization in a data lake using Alibaba Cloud's Data Lake Formation (DLF) service.

How to Use JindoDistCp for Offline Data Migration to a Data Lake

This article discusses the data lake offline data migration process using JindoDistCp and explains how it improves the migration performance in different scenarios.

Integrating Apache Hudi and Apache Flink for New Data Lake Solutions

This article explains Apache Hudi and Apache Flink and the benefits of implementation.