Community

Blog Events Webinars Tutorials Forum

Create Account

×

Apache Spark

Introduction to Unified Batch and Stream Processing of Apache Flink

Unified batch and stream processing of Flink is a well-established concept in the stream computing field.

Apache Flink Community July 18, 2024 311

Apache Paimon: Streaming Lakehouse is Coming

This article is based on the keynote speeches given by LI Jinsong, WU Xiangping, DI Xingxing, and WANG Yunpeng during Flink Forward Asia 2023.

Apache Flink Community July 5, 2024 3,065

Understanding Stream Processing: Real-Time Data Analysis and Use Cases

Learn about stream processing, its applications, challenges, and Alibaba Cloud's Realtime Compute for Apache Flink solution for real-time data analysis.

Apache Flink Community April 18, 2024 1,321

Running ODPS PySpark using CLI

In this article we will discuss about Spark in general, its uses in the Big Data workflow and how to configure and run Spark in the CLI mode for CI/CD purposes.

Alibaba Cloud Indonesia February 19, 2024 1,263

MaxCompute2.0 Performance Metrics: Faster, Stronger Computing

MaxCompute (originally ODPS) is a Big Data processing platform used for batch structural data storage and processing, to provide massive data warehouse solutions and data modeling.

Alibaba Clouder March 1, 2018 26,906

In-depth Review of Apache Spark: Spark + AI Summit 2020

Matei Zaharia, founder of the Spark project, gave an in-depth review of Spark at the Spark + AI Summit 2020 in conjunction with its 10-year anniversary.

Alibaba EMR April 2, 2021 2,220

The Discovery of a Promising Technology

In this article, Zhang Jianfeng, a veteran in the open-source community, explains how to evaluate whether the technology is worth learning using three key dimensions.

Apache Flink Community November 6, 2020 2,604

Using Apache Spark for Data Processing and Analysis

In this article, you will learn to accelerate your data processing and analysis across Apache Spark Relational Cache, Mesos, Akka, Cassandra, and Kafka.

Alibaba Clouder July 21, 2020 4,657

Spark-TFRecord: Toward Full Support of TFRecord in Spark

In this post, we will introduce Spark-TFRecord, a new solution to enable support for native TensorFlow data format in Spark.

Alibaba EMR June 28, 2020 2,181

Eight Things You Should Know about Big Data

As a senior technical expert at Alibaba Group, I will share my thoughts on what there is to say about big data, past, present, future.

zjffdu October 24, 2019 20,632

Alibaba Cloud Security Team Discovers Apache Spark Rest API Remote Code Execution (RCE) Exploit

This article describes the discovery of the first "in-the-wild" Spark Rest API Remote Code Execution (RCE) vulnerability made by Fengwei Zhang and the team at Alibaba Cloud Security on July 7, 2018.

Alibaba Cloud Security July 31, 2018 25,436

A Quick Guide to Analyzing Apache Logs on Alibaba Cloud Log Service

Alibaba Cloud Apache Log Service, there are several methods available for you to collect upstream data.

Alibaba Clouder March 23, 2018 20,248

How to Create Virtual Cloud Desktop using Apache Guacamole

Apache Guacamole is a free and open source web application which lets you access your dashboard using a modern web browser.

Alibaba Clouder March 28, 2018 42,761

Related Tags

artificial intelligence big data cloud computing