Community

Blog Events Webinars Tutorials Forum

Create Account

×

EMR

Big Data & AI Platform Monthly Newsletter — April 2026

Brought to you by the Alibaba Cloud Big Data & AI Product Team.

Alibaba Cloud Big Data and AI May 15, 2026 938

Alibaba Cloud E-MapReduce: Serverless Open-Source Big Data Platform

Fully managed cloud-native big data platform for elastic Data+AI lakehouse solutions.

Alibaba Cloud Big Data and AI April 24, 2026 2,057

Use Cases for EMR Serverless Spark | Use the spark-submit CLI to Submit a Spark Job

This article describes how to use the spark-submit command line interface (CLI) to submit a Spark job after EMR Serverless Spark is connected to ECS.

Alibaba EMR November 22, 2024 4,395

EMR Serverless Spark: Using Realtime Compute for Apache Flink + Apache Paimon to Implement Batch and Streaming Integration

This article introduces a data processing workflow that integrates Realtime Compute for Apache Flink, EMR Serverless Spark, and Apache Paimon to enable real-time data ingestion.

Alibaba EMR November 14, 2024 3,621

Use Cases for EMR Serverless Spark | Use EMR Serverless Spark to Submit a PySpark Streaming Job

This artile introduces the usability and maintainability of EMR Serverless Spark in stream processing.

Alibaba EMR November 8, 2024 2,823

High-speed and Unified New Data Lakehouse Paradigm: Alibaba Cloud E-MapReduce Serverless StarRocks 3.x

This article is compiled from the first session of the EMR StarRocks online open class - EMR Serverless StarRocks3.

Alibaba EMR August 5, 2024 7,071

Integration of Paimon and Spark - Part 2: Query Optimization

This article introduces the integration of Paimon and Spark, specifically focusing on query optimization.

Alibaba EMR April 25, 2024 6,368

Building a Streaming Lakehouse: Performance Comparison Between Paimon and Hudi

This article compares the performance of Paimon and Hudi on Alibaba Cloud EMR and explores their respective roles in building quasi-real-time data warehouses.

Apache Flink Community April 8, 2024 13,321

Observability | Key Metrics to Focus On When Using Prometheus to Monitor E-MapReduce

This article explains how to monitor big data in EMR using Prometheus Service.

Alibaba Cloud Native December 28, 2023 4,728

The Open-Source Folks Talk - Episode 4: Remain True to Original Aspirations in the Cloud-Native Age

The latest entry of the Open-Source Folks Talk discusses the history of the first Apache Incubation Project on Alibaba Cloud.

Alibaba Cloud Community March 9, 2023 4,731

Running Mapreduce Workload in Alibaba Cloud EMR Cluster

In this article, we’ll explain how to run map-reduce jobs in the Alibaba Cloud EMR Cluster.

GAVASKAR S June 21, 2023 3,287

Working with E-MapReduce in Alibaba Cloud

In this article, we'll introduce how to create an Alibaba Cloud EMR cluster step by step.

GAVASKAR S June 21, 2023 3,732

The Spark and Delta Lake Engine Enterprise Edition of Databricks Helps Efficiently Access Lake Houses

This article describes how to optimize the performance of the product features provided by the Enterprise Edition to help you efficiently access lake houses.

Alibaba EMR May 16, 2022 7,028

How to Build a Cloud-Native Open-Source Big Data Platform | Best Practices of InMobi

This article shares the best practices of InMobi based on the open-source big data service of Alibaba Cloud.

Alibaba EMR March 18, 2022 4,597

The Open-Source Real-Time Data Warehouse Solution Based on EMR OLAP - ClickHouse Transaction Implementation

This article describes the solution of an open-source real-time data warehouse based on EMR OLAP.

Alibaba EMR February 10, 2022 6,625

Setup EMR Yarn authentication using Active Directory with Apache Knox

A guide to configure integration between Alibaba Cloud EMR with Active Directory.

Alibaba Cloud Indonesia February 8, 2022 4,742

The Practice of Lake House in the FinTech Industry

This article explains the four stages of lake house evolution within the Shanghai Shuhe Group.

Alibaba Cloud MaxCompute December 22, 2021 5,192

Alibaba Cloud E-MapReduce vs AWS EMR vs. Azure HDInsight

Big Data is among the biggest IT trends of the last years. Maintaining a large infrastructure for analytics is a major challenge for Big Data.

Best Practices for Big Data Processing in Spark

This article is an overview of the best practices for big data processing in Spark taken from a lecture.

Alibaba EMR October 12, 2021 5,067

Alibaba Cloud BigData Pipeline 구축하기

이 블로그는 빅데이터 플랫폼 도입을 고려 중이고, 어떤 조합으로 시스템을 구축할지 고민이신 분들을 위해 알리바바가 제공하는 모든 서비스들을 나열해 놓고, 각 서비스들의 적용 가능한 시나리오와 서비스 도입 시 고려해야 할 점등을 설명합니다.

Haemi Kim September 15, 2021 4,410

Related Tags

artificial intelligence big data cloud computing