Community

Blog Events Webinars Tutorials Forum

Create Account

×

Hive

Enable Hive to Write and Read Data from Alibaba Cloud Elasticsearch using ES-Hadoop

In this guide, we'll dive deep into leveraging ES-Hadoop to enable Hive to write data to and read from Alibaba Cloud Elasticsearch, transforming your data analytics operations.

Data Geek May 13, 2024 1,001

Learning about Distributed Systems – Part 18: Run AND Write Fast

Part 18 of this series explains how to improve application development efficiency on distributed systems.

Alibaba Cloud_Academy July 24, 2023 3,218

A Method for MaxCompute-UNION Data Type Alignment

This short article discusses conversion issues in UNION and problem-solving tactics.

Alibaba Cloud MaxCompute August 15, 2022 2,639

Big Data Q&A - Friday Blog, Week 65

Friday Q&A is back! Let's take a look at some of the many very interesting questions I was asked during Alibaba Cloud training sessions this week!

JDP June 17, 2022 2,985

The Principles of EMR StarRocks' Blazing-Fast Data Lake Analytics

This article focuses on the technology, performance, and future planning of StarRocks' blazing-fast data lake analytics.

Alibaba EMR May 20, 2022 11,608

Zuoyebang's Best Practices for Building Data Lakes Based on Delta Lake

This article aims to solve the performance problems of offline data warehouses (daily and hourly) during production and usage.

Alibaba EMR May 13, 2022 4,238

Best Practices for Big Data Processing in Spark

This article is an overview of the best practices for big data processing in Spark taken from a lecture.

Alibaba EMR October 12, 2021 4,736

Use Flink Hudi to Build a Streaming Data Lake

This article introduces the optimization and evolution of Flink Hudi's original mini-batch-based incremental computing model through stream computing.

Apache Flink Community September 26, 2021 6,680

Application of Delta Lake in Soul

This article explains the background of Delta Lake along with practices, problems, and solutions.

Alibaba EMR July 19, 2021 3,485

Data Lake: How to Explore the Value of Data Using Multi-engine Integration

This article briefly discusses the metadata service and multi-engine support capabilities of the Alibaba Cloud Data Lake Formation (DLF) service.

Alibaba EMR May 10, 2021 5,070

Implementation and Challenges of Data Lake Metadata Services

This article explains the benefits, architecture, and implementation challenges of data lake metadata services.

Alibaba EMR May 6, 2021 13,439

Introduction to SQL in Flink 1.11

This article introduces the major changes and new features of Flink 1.11

Apache Flink Community February 19, 2021 8,606

Flink 1.11: An Engine with Unified SQL Support for Batch and Streaming Data

This article introduces the enhanced capabilities of Flink 1.11 to support SQL to process batch and streaming data

Apache Flink Community February 19, 2021 6,463

The New Major Features of Flink 1.11.0

One of the release managers of Flink 1.11.0 shares his deep insights into the long-awaited features and explains them from different perspectives.

Apache Flink Community November 6, 2020 6,159

So How Did Flink Double Its GitHub Stars in Just One Year?

Read on to see exactly what happened to Flink in 2019, in particular how Alibaba has contributed to Flink.

Apache Flink Community September 27, 2020 10,107

Architecture Evolution and Application Scenarios of Real-time Warehouses in the Cainiao Supply Chain

In this blog, we'll discuss the evolution of Cainiao's Flink implementation solution and supply chain data in terms of real-time data technology architecture.

Apache Flink Community September 27, 2020 5,703

OPPO's Use of Flink-based Real-time Data Warehouses

This article covers the evolution of the OPPO real-time data warehouse and development of Flink SQL.

Apache Flink Community September 27, 2020 5,473

Netflix: Evolving Keystone to an Open Collaborative Real-time ETL Platform

This article briefly introduces Netflix's data platform team and its key product, Keystone.

Apache Flink Community September 27, 2020 12,337

Meituan-Dianping's Use of Flink-based Real-time Data Warehouse Platforms

In this article, Lu Hao of Meituan-Dianping shares the company's practices using the Flink-based real-time data warehouse platform.

Apache Flink Community September 27, 2020 7,753

Architecture and Practices of Bilibili's Real-time Platform

This article introduces the architecture and practices of the Bilibili's Saber real-time computing platform by considering the pain points of real-time computing.

Apache Flink Community September 27, 2020 12,930

Related Tags

artificial intelligence big data cloud computing