×
Big Data

Blogs of the Week – Ep. 13, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the thirteenth episode of Blogs of the Week in 2024.

A Distributed Computing Idea of Global Dictionary Index Based on ODPS SQL

This article introduces a method that leverages distributed computing resources to calculate the global dictionary index.

Data Visualization with Alibaba Cloud ECS: From Installation to Data Analysis

This article guides you through setting up an Alibaba Cloud ECS Windows 2022 server, deploying MySQL and Grafana for data visualization using GUI-base...

Blogs of the Week – Ep. 12, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the twelfth episode of Blogs of the Week in 2024.

Alibaba Cloud Data Lake: The Smart Choice for Modern Data Management

Alibaba Cloud Data Lake provides a robust, secure, and cost-effective solution for modern data management, addressing the limitations of traditional on-premises data lake systems.

Blogs of the Week – Ep. 11, 2024

Each week, we compile the hottest and most impactful topics. Let’s look at the eleventh episode of Blogs of the Week in 2024.

Big Data Cloud Fighter Bootcamp

This article introduces the Big Data Cloud Fighters bootcamp, which provides an intensive, hands-on experience in mastering big data principles and technologies.

High-speed and Unified New Data Lakehouse Paradigm: Alibaba Cloud E-MapReduce Serverless StarRocks 3.x

This article is compiled from the first session of the EMR StarRocks online open class - EMR Serverless StarRocks3.

Distributed Pandas Processing with MaxCompute MaxFrame

This article introduces how to use common Pandas operators with MaxFrame.

Blogs of the Week – Ep. 10, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the tenth episode of Blogs of the Week in 2024.

Hands-on Labs | Get Started with Flink MySQL Connector in 5 Minutes

This step-by-step tutorial introduces how to get started with Flink MySQL Connector in 5 minutes.

Use Kibana for Querying and Visualizing SLS Data

This article introduces how to use Kibana to connect to the SLS ES-compatible API for query and analysis.

Technical Principle of Hologres Binlog

This article describes an overview of the implementation principles and best practices of Hologres Binlog.

Hologres Technology: Extreme Analysis Performance of JSON Semi-structured Data

This article describes the technical principles of Hologres' JSONB semi-structured data and highlights the exceptional analysis performance of JSON semi-structured data.

Introduction to MaxCompute's Unified Near Real-time Data Processing Architecture

This article introduces how the new offline near real-time integrated architecture based on MaxCompute supports comprehensive business scenarios.

In-depth Application of Flink in Ant Group Real-time Feature Store

This article is based on the keynote speech on AI feature engineering given by ZHAO Liangxingyun, a senior technical expert of Ant Group, during Flink Forward Asia 2023.

Life of an SQL Task

This article outlines the SQL statement execution process, offering insights and guidance for newcomers to big data development.

Data Lake for Stream Computing: The Evolution of Apache Paimon

Uncover the advancements from Apache Hive to Hudi and Iceberg in stream computing, as our expert navigates the transformative landscape of real-time data lakes.

Blogs of the Week – Ep. 9, 2024

Each week, we compile the hottest and most impactful topics. Let’s take a look at the ninth episode of Blogs of the Week in 2024.

Understand Flink SQL: Real-Time SQL Query Execution for Stream and Batch Data

Discover Flink SQL, the high-level API for executing SQL queries across streaming and batch data sets in Apache Flink.