×
Elastic Training

Koordinator Column 1: Viewing AI Computing Power's "Rigidity" and "Elasticity" through Gang Scheduling

This article traces Gang Scheduling's evolution to analyze the rigidity-elasticity balance in AI resource orchestration, its Kubernetes implementation, and future trends.

Practices for Distributed Elasticity Training in the ACK Cloud-native AI Suite

This article introduces the practices and architectures for distributed elastic training of Alibaba Cloud ACK cloud-native AI suite to enhance the eff...