Community

Create Account

: Alibaba EMR

3387 Reputation

EMR is an all-in-one enterprise-ready big data platform that provides cluster, job, and data management services based on open-source ecosystems, such as Hadoop, Spark, Kafka, Flink, and Storm.
Follow

Activities(50) Posts(62) Series(1) Areas of Expertise Following Followers

Areas of Expertise

Following (0)

Followers (7)

Use Cases for EMR Serverless Spark | Use the spark-submit CLI to Submit a Spark Job

This article describes how to use the spark-submit command line interface (CLI) to submit a Spark job after EMR Serverless Spark is connected to ECS.

Alibaba EMR November 22, 2024 4,589

EMR Serverless Spark: Using Realtime Compute for Apache Flink + Apache Paimon to Implement Batch and Streaming Integration

This article introduces a data processing workflow that integrates Realtime Compute for Apache Flink, EMR Serverless Spark, and Apache Paimon to enable real-time data ingestion.

Alibaba EMR November 14, 2024 3,793

Use Cases for EMR Serverless Spark | Use EMR Serverless Spark to Submit a PySpark Streaming Job

This artile introduces the usability and maintainability of EMR Serverless Spark in stream processing.

Alibaba EMR November 8, 2024 2,927

High-speed and Unified New Data Lakehouse Paradigm: Alibaba Cloud E-MapReduce Serverless StarRocks 3.x

This article is compiled from the first session of the EMR StarRocks online open class - EMR Serverless StarRocks3.

Alibaba EMR August 5, 2024 7,738

Integration of Paimon and Spark - Part 2: Query Optimization

This article introduces the integration of Paimon and Spark, specifically focusing on query optimization.

Alibaba EMR April 25, 2024 6,638

Integration of Paimon and Spark - Part I

This article introduces the main features in the new version of Paimon that are supported by the Spark-based computing engine.

Alibaba EMR April 15, 2024 7,238

Data Lake Management and Optimization

This article was compiled from a speech from Qingwei Yang at the Alibaba Cloud Data Lake Technology Special Exchange Meeting on July 17, 2022.

Alibaba EMR February 20, 2023 4,074

Unified Metadata and Permissions for Data Lakes

This article was compiled from a speech from Xiong Jiashu at the Alibaba Cloud Data Lake Technology Special Exchange Meeting.

Alibaba EMR February 15, 2023 6,261

StarRocks x Flink CDC for End-to-End Real-Time Links

This article discusses real-time data warehouse construction and offers examples of using Flink CDC and StarRocks for real-time links and data updates.

Alibaba EMR January 10, 2023 11,556

Databricks Data Insight Open Course - Use Databricks + MLFlow to Train and Deploy Machine Learning Models

This article describes how to use Databricks and MLflow to build a machine learning lifecycle management platform.

Alibaba EMR January 10, 2023 5,027

Databricks Data Insight Open Course - An Introduction to Delta Lake (Open-Source Edition)

This part of the Databricks Data Insight Open Course article series introduces Delta Lake Basics (Open-Source Edition).

Alibaba EMR September 23, 2022 11,759

Databricks Data Insight Open Course - An Introduction to Delta Lake (Commercial Edition)

This part of the Databricks Data Insight Open Course article series introduces Delta Lake Basics (Commercial Edition).

Alibaba EMR September 13, 2022 4,634

Databricks Data Insight Open Course - How to Use Delta Lake to Build a Batch-Stream Unified Data Warehouse

This article discusses using Delta Lake to build a batch-stream unified data warehouse and putting it into practice.

Alibaba EMR September 2, 2022 7,740

Databricks Data Insight Open Course - An Evolution History and Current Situation of Delta Lake

This part of the Databricks Data Insight Open Course article series discusses the evolution history of Delta Lake and its current situation.

Alibaba EMR August 22, 2022 8,666

Data Lake Exploration – Delta Lake

This article explores Delta Lake and discusses the implementation of two solutions related to traditional data warehouses based on Hive tables.

Alibaba EMR July 20, 2022 5,789

New Features of Alibaba Cloud Remote Shuffle Service: AQE and Throttling

This article introduces the latest two important features of RSS: support for Adaptive Query Execution (AQE) and throttling.

Alibaba EMR July 18, 2022 4,967

The Principles of EMR StarRocks' Blazing-Fast Data Lake Analytics

This article focuses on the technology, performance, and future planning of StarRocks' blazing-fast data lake analytics.

Alibaba EMR May 20, 2022 13,971

The Spark and Delta Lake Engine Enterprise Edition of Databricks Helps Efficiently Access Lake Houses

This article describes how to optimize the performance of the product features provided by the Enterprise Edition to help you efficiently access lake houses.

Alibaba EMR May 16, 2022 7,241

Zuoyebang's Best Practices for Building Data Lakes Based on Delta Lake

This article aims to solve the performance problems of offline data warehouses (daily and hourly) during production and usage.

Alibaba EMR May 13, 2022 4,861

How to Build a Blazing-Fast Data Lake Analytics Engine

This article reveals the key technologies of the data lake analytics engine in detail and uses StarRocks to help users understand the architecture of the system.

Alibaba EMR April 21, 2022 5,888

Latest Comments

: Santhakumar Munuswamy Commented on High-speed and Unified New Data Lakehouse Paradigm: Alibaba Cloud E-MapReduce Serverless StarRocks 3.x

August 5, 2024 at 12:00 am

Thanks for the sharing

: 5260485642767126 Commented on Using Data Preorganization for Faster Queries in Spark on EMR

March 31, 2020 at 12:00 am

Hey, Great post! I support online learning hence sharing one online learning platform BlueMap. Visit: www.bluemap.co BlueMap specialises in providing training and services for the IT community. We provide trainings in the field of IT Infrastructure to professionals around the world. Our training methodology focuses on maintaining the right blend of theory and practical with course material and lab guides carefully designed by our highly experienced trainers preparing professionals for real-world challenges. All courses provided by BlueMap help candidates apply knowledge to practice. Apart from training, we also provide hardware setup and software implementations of technologies we have expertise in.

Alibaba EMR