Demystify the Practice of Large Language Models: Exploring Distributed Inference

This article uses the Bloom7B1 model as an example to demonstrate the distributed inference method for large language models in ACK.

How Does DeepSpeed + Kubernetes Easily Implement Large-Scale Distributed Training?

This article describes how to build and run DeepSpeed distributed training tasks based on the cloud-native AI suite of ACK.