×
Data Analytics

Apache Flink FLIP-12: Asynchronous I/O

Follow the Apache Flink® Community for making Flink's External System Data Processing More Efficient.

Modern Web Scraping: Evolving with AI and Cloud Integration

This blog explores how modern web scraping, when combined with AI tools and cloud platforms, can power smarter automation, like building chatbots that...

vivo's Lakehouse Integration Practice Based on Paimon

This article is compiled from the presentation by Xu Yu, an internet big data expert at vivo and Apache Paimon Committer, during the Flink Forward Asia 2024 Streaming Lakehouse session (Part One).

FLIP-10: Unified Checkpoints and Savepoints

Follow the Apache Flink® Community for making Data Backup Simpler in Flink.

FLIP-9: Trigger Language

An Attempt at Defining a Rule Language for Flink Triggers

Apache Flink FLIP-8: Scalable Non-Partitioned State

Follow the Apache Flink® Community for making Non-Partitioned State Scalable in Flink.

Flink SQL 101: Embrace Unified Stream and Batch Processing

This article introduces Flink SQL, a unified stream-batch processing engine, focusing on key concepts like Stream-Table Duality, event time/watermarks.

Apache Flink FLIP-7: Visualizing Monitoring Metrics in Web UI

Follow the Apache Flink® Community for making Flink Metrics More Accessible Through Web UI Visualization.

Apache Flink Broadcast Variable Optimization: FLIP-5's Approach to Reducing Network Overhead

This is Technical Insights Series by Perry Ma | Product Lead, Real-time Compute for Apache Flink at Alibaba Cloud.

Future-Proofing Business with Cloud AI Tools and Employee Analytics

AI is becoming the foundation of future success. Companies that integrate AI-driven tools today aren’t just optimizing efficiency; they’re ensuring long-term resilience.

Streaming processing vs. Batch processing: A Comprehensive Guide to Choosing the Right Approach

This blog is written by Wencong Liu, a senior engineer of Alibaba Cloud's Realtime Compute for Apache Flink team.

Core Design and Scenario Applications of ApsaraDB for SelectDB Multi-Compute Cluster

This article discusses the origin and introduction of the multi-compute cluster architecture of ApsaraDB for SelectDB, a fully managed real-time data warehouse service.

Alibaba Cloud QuickBI Demo: Analyzing US Census Bureau Data Set

In this guide, we will demonstrate the features of dashboards and OLAP modelling of the US Census Bureau data set using Alibaba Cloud's QuickBI.

The Right Platform for Big Data Analytics

This post is going to share how important choosing the right platform for Big Data Analytics, Data governance

Data Visualization & Analytics with Alibaba Cloud QuickBI: From Setup to Optimization

This article guides you through the implementation of big data visualization solutions using QuickBI in conjunction with other Alibaba Cloud services.

ODPS SQL - Transpose Column to Row or Row to Column

This article describes how to use TRANS_ARRAY and LATERAL VIEW EXPLODE functions to transpose columns to rows in MaxCompute.

Understand Flink SQL: Real-Time SQL Query Execution for Stream and Batch Data

Discover Flink SQL, the high-level API for executing SQL queries across streaming and batch data sets in Apache Flink.

Change Data Capture (CDC) Made Easy- A Step-by-Step Guide with Debezium and Kafka

This article provides a detailed guide on implementing Change Data Capture (CDC) using Debezium and ApsaraMQ for Apache Kafka

Apache Flink Has Become the De Facto Standard for Stream Computing

This article is based on a keynote speech given by WANG Feng, initiator of Apache Flink Community China and head of Open-Source Big Data Platform at Alibaba Cloud, at Flink Forward Asia 2023.

DataV: Your Gateway to Powerful and Accessible Data Visualization

This article provides an overview of DataV, highlighting its core features, key benefits, standout functions, and its role as a powerful tool.