×
Big Data

Seeding Digital Transformation in Southeast Asia – Part 2: Challenges and Focus Areas

Part 2 of this 8-part series discusses the first set of challenges business owners face while strategizing digital transformation for their enterprises and the focus points for a strong strategy.

Seeding Digital Transformation in Southeast Asia – Part 8: Introducing AI for Better Workflow

Part 8 of this 8-part series discusses AI and how it adds to the overall value of your organization’s steps towards digital transformation.

Efficient Data Lake Formation Based on JindoFS and OSS

This article explains the process of data lake formation based on Alibaba Cloud OSS and JindoFS big data cache acceleration service.

All-in-one Lake Migration of Multiple Data Sources

This article briefly discusses Alibaba Cloud Data Lake Formation (DLF) service and explains how it solves the data migration challenges during lake migration of data from heterogeneous data sources.

JindoFS Cache-based Acceleration for Machine Learning Training in a Data Lake

The article explains how JindoFS cache-based acceleration service improves machine learning training speed in a data lake.

DataWorks: A Platform for Developing and Governing a Data Lake

This article briefly discusses Alibaba Cloud's big data platform, DataWorks, and explains how it solves the common challenges of a data lake.

Big Data Made Simpler with E-MapReduce – Part 1

Part 1 of this 2-part series discusses how E-MapReduce provides a simple and highly effective big data practice.

"50-Year-Old" Doraemon "Meets" "63-Year-Old" Casio: Alibaba Cloud Data Mid-End Platform Shows What Happens Next

This article discusses the Alibaba Cloud Data Mid-End Platform and how collaborators like Casio utilize it.

The Flink Ecosystem: A Quick Start to PyFlink

This article will introduce PyFlink's architecture and provide a quick demo in which PyFlink is used to analyze CDN logs.

Fluid: An Important Piece for Big Data and AI to Embrace Cloud-Native

This blog introduces Fluid and discusses how it can efficiently drive the operations of big data and AI applications in cloud-native scenarios.

How Big Data is Helping Logistics Firms Fly High

Find out how big data is driving the logistics and transport industry forward in our Global Industry Best Practices for Logistics and Transportation whitepaper.

5 Steps to Successfully Manage Peaks in Online Demand

Read this blog to learn how Alibaba Cloud tackled the challenges of managing massive peaks in online demand for the Double 11 Shopping Festival.

JindoFS: Computing and Storage Separation for Cloud-native Big Data

In this blog, we'll introduce the origins of JindoFS and discuss the problems its

In-depth Review of Apache Spark: Spark + AI Summit 2020

Matei Zaharia, founder of the Spark project, gave an in-depth review of Spark at the Spark + AI Summit 2020 in conjunction with its 10-year anniversary.

Introducing Intelligent Retail: The Next Revolution for E-Commerce

In this blog, we'll examine the basics of intelligent retail and introduce Alibaba Cloud's range of intelligent solutions for retailers.

Alibaba Cloud: The Only Cloud Provider in China Named as Forrester's "Global Cloud Data Warehouse Strong Performer"

This article reviews Alibaba Cloud's recent accolades from Forrester through its flagship cloud data warehouse service, MaxCompute.

Big Data Made Simpler with E-MapReduce – Part 2

Part 2 of this 2-part series discusses E-MapReduce cluster management and how it works in real-world scenarios and various usage scenarios.

How Does Flink Maintain Rapid Development Even as One of the Most Active Apache Project?

This article mainly introduces the current development and future plans of Flink as a unified stream-batch processing engine.

Application of Apache Flink in Real-time Financial Data Lake

This article provides an in depth introduction to the architecture, application, and best practices of real-time financial data lakes by Zhongyuan Bank.

Integrating Serverless with SaaS-based Cloud Data Warehouses

This article describes business scenarios and resource usage requirements of modern cloud data warehouses and analyzes different resource delivery modes.