Community

Blog Events Webinars Tutorials Forum

Create Account

×

E-MapReduce

Alibaba Cloud E-MapReduce: Serverless Open-Source Big Data Platform

Fully managed cloud-native big data platform for elastic Data+AI lakehouse solutions.

Alibaba Cloud Big Data and AI April 24, 2026 1,906

Unlocking the Power of Big Data: The Strategic Benefits of Alibaba Cloud's Comprehensive Big Data Platform Integration

This article introduces how enterprises can integrate Alibaba Cloud's comprehensive big data platform into multi-cloud and hybrid environments to achi...

Kidd Ip July 31, 2025 4,177

Use Cases for EMR Serverless Spark | Use the spark-submit CLI to Submit a Spark Job

This article describes how to use the spark-submit command line interface (CLI) to submit a Spark job after EMR Serverless Spark is connected to ECS.

Alibaba EMR November 22, 2024 4,311

EMR Serverless Spark: Using Realtime Compute for Apache Flink + Apache Paimon to Implement Batch and Streaming Integration

This article introduces a data processing workflow that integrates Realtime Compute for Apache Flink, EMR Serverless Spark, and Apache Paimon to enable real-time data ingestion.

Alibaba EMR November 14, 2024 3,548

Use Cases for EMR Serverless Spark | Use EMR Serverless Spark to Submit a PySpark Streaming Job

This artile introduces the usability and maintainability of EMR Serverless Spark in stream processing.

Alibaba EMR November 8, 2024 2,773

High-speed and Unified New Data Lakehouse Paradigm: Alibaba Cloud E-MapReduce Serverless StarRocks 3.x

This article is compiled from the first session of the EMR StarRocks online open class - EMR Serverless StarRocks3.

Alibaba EMR August 5, 2024 6,824

Integration of Paimon and Spark - Part 2: Query Optimization

This article introduces the integration of Paimon and Spark, specifically focusing on query optimization.

Alibaba EMR April 25, 2024 6,259

Building a Streaming Lakehouse: Performance Comparison Between Paimon and Hudi

This article compares the performance of Paimon and Hudi on Alibaba Cloud EMR and explores their respective roles in building quasi-real-time data warehouses.

Apache Flink Community April 8, 2024 13,124

Observability | Key Metrics to Focus On When Using Prometheus to Monitor E-MapReduce

This article explains how to monitor big data in EMR using Prometheus Service.

Alibaba Cloud Native December 28, 2023 4,670

Alibaba Cloud Open Source Big Data Platform | E-MapReduce

In this episode, we will introduce Alibaba Cloud Open Source Big Data Platform, Elastic MapReduce.

Alibaba Cloud Data Intelligence July 31, 2023 3,630

Running Mapreduce Workload in Alibaba Cloud EMR Cluster

In this article, we’ll explain how to run map-reduce jobs in the Alibaba Cloud EMR Cluster.

GAVASKAR S June 21, 2023 3,247

Working with E-MapReduce in Alibaba Cloud

In this article, we'll introduce how to create an Alibaba Cloud EMR cluster step by step.

GAVASKAR S June 21, 2023 3,683

Storage Policies and Read/Write Optimization in JindoFS

This article describes common problems and optimization methods of data read/write in computing-storage separation scenarios, and introduces data cache acceleration with JindoFS.

Alibaba EMR March 1, 2021 3,786

Data Lake Management and Optimization

This article was compiled from a speech from Qingwei Yang at the Alibaba Cloud Data Lake Technology Special Exchange Meeting on July 17, 2022.

Alibaba EMR February 20, 2023 3,898

Unified Metadata and Permissions for Data Lakes

This article was compiled from a speech from Xiong Jiashu at the Alibaba Cloud Data Lake Technology Special Exchange Meeting.

Alibaba EMR February 15, 2023 5,871

StarRocks x Flink CDC for End-to-End Real-Time Links

This article discusses real-time data warehouse construction and offers examples of using Flink CDC and StarRocks for real-time links and data updates.

Alibaba EMR January 10, 2023 10,860

Databricks Data Insight Open Course - Use Databricks + MLFlow to Train and Deploy Machine Learning Models

This article describes how to use Databricks and MLflow to build a machine learning lifecycle management platform.

Alibaba EMR January 10, 2023 4,804

Databricks Data Insight Open Course - An Introduction to Delta Lake (Open-Source Edition)

This part of the Databricks Data Insight Open Course article series introduces Delta Lake Basics (Open-Source Edition).

Alibaba EMR September 23, 2022 11,310

Databricks Data Insight Open Course - An Introduction to Delta Lake (Commercial Edition)

This part of the Databricks Data Insight Open Course article series introduces Delta Lake Basics (Commercial Edition).

Alibaba EMR September 13, 2022 4,451

Databricks Data Insight Open Course - How to Use Delta Lake to Build a Batch-Stream Unified Data Warehouse

This article discusses using Delta Lake to build a batch-stream unified data warehouse and putting it into practice.

Alibaba EMR September 2, 2022 6,930

Related Tags

artificial intelligence big data cloud computing