×
Hive

Data Lake: How to Explore the Value of Data Using Multi-engine Integration

This article briefly discusses the metadata service and multi-engine support capabilities of the Alibaba Cloud Data Lake Formation (DLF) service.

Implementation and Challenges of Data Lake Metadata Services

This article explains the benefits, architecture, and implementation challenges of data lake metadata services.

Introduction to SQL in Flink 1.11

This article introduces the major changes and new features of Flink 1.11

Flink 1.11: An Engine with Unified SQL Support for Batch and Streaming Data

This article introduces the enhanced capabilities of Flink 1.11 to support SQL to process batch and streaming data

The New Major Features of Flink 1.11.0

One of the release managers of Flink 1.11.0 shares his deep insights into the long-awaited features and explains them from different perspectives.

So How Did Flink Double Its GitHub Stars in Just One Year?

Read on to see exactly what happened to Flink in 2019, in particular how Alibaba has contributed to Flink.

Architecture Evolution and Application Scenarios of Real-time Warehouses in the Cainiao Supply Chain

In this blog, we'll discuss the evolution of Cainiao's Flink implementation solution and supply chain data in terms of real-time data technology architecture.

OPPO's Use of Flink-based Real-time Data Warehouses

This article covers the evolution of the OPPO real-time data warehouse and development of Flink SQL.

Netflix: Evolving Keystone to an Open Collaborative Real-time ETL Platform

This article briefly introduces Netflix's data platform team and its key product, Keystone.

Meituan-Dianping's Use of Flink-based Real-time Data Warehouse Platforms

In this article, Lu Hao of Meituan-Dianping shares the company's practices using the Flink-based real-time data warehouse platform.

Architecture and Practices of Bilibili's Real-time Platform

This article introduces the architecture and practices of the Bilibili's Saber real-time computing platform by considering the pain points of real-time computing.

Trillions of Bytes of Data Per Day! Application and Evolution of Apache Flink in Kuaishou

This article introduces the technical evolution of Apache Flink during its application in Kuaishou and Kuaishou's future plans regarding Apache Flink.

Lyft's Large-scale Flink-based Near Real-time Data Analytics Platform

This blog shares how Lyft built a large-scale near real-time data analytics platform based on Apache Flink.

Sneak Peek: Apache Flink 1.11 Is Coming Soon!

This article describes the new features, improvements, and important changes of Flink 1.11 and Flink's future development plans.

The Run-In Period for Flink and Hive

Jason addresses the bugs and compatibility issues with Flink-Hive by operating on a Hive database using Flink SQL to demonstrate some of the features provided.

Hive Finally Has Flink!

Jason introduces the architecture of Hive integration in Flink, discusses problems, and how to solve them.

A Deep Dive into Apache Flink 1.11: Stream-Batch Integrated Hive Data Warehouse

Li Jinsong and Li Rui, Alibaba Technical Experts, talk about the features, revisions, and improvements of Apache Flink 1.11.

How to Synchronize Data from Hive to MaxCompute?

This article discusses how to migrate data from Hive to MaxCompute using MaxCompute Migration Assist (MMA) and its functions, technical architecture, and implementation principles.

Flink 1.10 vs. Hive 3.0 - A Performance Comparison

This blog compares the performance of Flink 1.10 against Hive 3.0 using the TPC-DS Benchmark 10-TB dataset and 20 hosts to test 3 engines.

Using Hive in Apache Flink 1.9

This article describes the integration of Hive with Apache Flink 1.9.0 and discusses this feature from the perspectives of design architecture, the latest progress, and usage instructions.