×
Data Lake

Application of Apache Flink in Real-time Financial Data Lake

This article provides an in depth introduction to the architecture, application, and best practices of real-time financial data lakes by Zhongyuan Bank.

How to Analyze CDC Data in Iceberg Data Lake Using Flink

This article discusses the challenges and limitations of various solutions in CDC data analysis and describes how to use Flink and Iceberg to overcome them.

An Overview of Alibaba Cloud's Comprehensive Cloud-Native Data Lake System

This article introduces the establishment of a cloud-native data lake system based on Alibaba Cloud OSS, Data Lake Formation (DLF), and various computing engines present in Alibaba Cloud.

How Delta Lake and DLF Service Facilitate Real-time CDC Synchronization in a Data Lake

This article explains how to perform real-time CDC synchronization in a data lake using Alibaba Cloud's Data Lake Formation (DLF) service.

How to Use JindoDistCp for Offline Data Migration to a Data Lake

This article discusses the data lake offline data migration process using JindoDistCp and explains how it improves the migration performance in different scenarios.

Integrating Apache Hudi and Apache Flink for New Data Lake Solutions

This article explains Apache Hudi and Apache Flink and the benefits of implementation.

Building an Enterprise-Level Real-Time Data Lake Based on Flink and Iceberg

This article explains real-time data lakes based on Apache Flink and Apache Iceberg.

JindoTable for Data Optimization and Query Acceleration in a Data Lake

The article briefly discusses Alibaba Cloud's JindoTable and explains how it solves the data management problems in a data lake.

Cloud-Native Compute Engine: Challenges and Solutions

This article explains some of the challenges in cloud-native compute engines, and discusses some solutions and future directions.

Implementation and Challenges of Data Lake Metadata Services

This article explains the benefits, architecture, and implementation challenges of data lake metadata services.

EB-level Data Lake Based on OSS

This article briefly discusses data lake systems, their features, and describes the process of building a data lake storage based on Alibaba Cloud OSS.

Efficient Data Lake Formation Based on JindoFS and OSS

This article explains the process of data lake formation based on Alibaba Cloud OSS and JindoFS big data cache acceleration service.

JindoFS: Computing and Storage Separation for Cloud-native Big Data

In this blog, we'll introduce the origins of JindoFS and discuss the problems its

Build a Real-time Cloud Data Lake Based on Alibaba Cloud DLA and Apache Hudi

This article describes how to build a real-time data lake in the cloud based on Alibaba Cloud's Data Lake Analytics (DLA) and Apache Hudi

Use Data Lake Analytics (DLA) to Analyze Data in MaxCompute External Tables

This article describes how to use Alibaba Cloud DLA's function to analyze data in MaxCompute external tables.

Performing Daily Incremental Upload from OSS to MaxCompute Using Data Integration

This tutorial describes how we can easily import data from OSS into MaxCompute on a daily basis with Data Integration.

Data Lake Acceleration in Data Lake Architecture

This article introduces the reasons for choosing data lake acceleration, and shares Alibaba Cloud's practical experience and technical solutions.

Data Lake: Concepts, Characteristics, Architecture, and Case Studies

This article provides deep insights into the data lake concept and compares some common solutions available in the market.

Alibaba Cloud Launches Enterprise-Level Cloud-Native Data Lake during 2020 Double 11

This article reviews Alibaba Cloud's enterprise-level cloud-native data lake solution launched during the double 11 festival and discusses its key benefits.

Build a Cloud Data Lake Using E-MapReduce

This article is based on the enterprise data lake construction solution using E-MapReduce and customer best practices shared by Ziguan.