×
ACK

Best Practices for Large Model Inference in ACK: TensorRT-LLM

This article uses the Llama-2-7b-hf model as an example to demonstrate how to deploy the Triton framework using KServe in Alibaba Cloud ACK.

Development Practice of ACK Serverless: On-demand Use of Heterogeneous Resources

This article introduces the development practices of using Kubernetes to deliver Serverless capabilities and on-demand utilization of heterogeneous resources such as GPUs.

ACK Cloud Native AI Suite | Training and Inference of Open-Source Large Models on Kubernetes

The sixth episode of ACK Cloud Native AI Suite series introduces how to train and infer open-source foundation models based on the ACK Cloud-Native AI suite.

ACK Cloud Native AI Suite | Scaling Distributed Elastic Training for Large Models

The fifth episode of ACK Cloud Native AI Suite series introduces how to perform large-scale distributed elastic training based on the ACK Cloud-Native AI suite.

ACK Cloud Native AI Suite | Elastic Acceleration of Generative AI Model Inference with Fluid

The fourth episode of ACK Cloud Native AI Suite series introduces Fluid, the data orchestration acceleration engine in the ACK Cloud-Native AI suite.

Cloud Native Game Solution | Deploying PvE Zone server Games with OpenKruiseGame

The fourth episode of cloud native game solution series introduces the cloud-native landing of PvE server-based games based on OKG.

ACK Cloud Native AI Suite | Efficiently Scheduling Large Scale AI Big Data Tasks on Kubernetes

The third episode of ACK Cloud Native AI Suite series introduces how the ACK Cloud-Native AI suite efficiently schedules AI and big data tasks.

ACK Cloud Native AI Suite | Simplifying GPU Cluster Operations and Improving GPU Utilization

The second episode of ACK Cloud Native AI Suite series introduces how to simplify the complexity of GPU cluster operations and improve GPU resource utilization through the ACK Cloud-Native AI suite.

ACK Cloud Native AI Suite | Implementing Cloud Native AI based on Kubernetes

The first episode of ACK Cloud Native AI Suite series introduces Alibaba Cloud's Cloud-Native AI Suite.

Cloud Native Game Solution | Global Delivery and O&M Management Via ACK One + OpenKruiseGame

The sixth episode of cloud native game solution series introduces the global delivery and operation and maintenance management of game servers based on ACK One + OKG.

Cloud Native Game Solution | Migrating H5 and Comprehensive Games to the Cloud with OpenKruiseGame

The fifth episode of the cloud native game solution series introduces the cloud-native landing of H5 games and comprehensive games based on OKG.

Cloud Native Game Solution | Implementing PvP Session Based Games with OpenKruiseGame

The third episode of cloud native game solution series explore the implementation of cloud-native transformation for PvP session-based games based on OKG.

Cloud Native Game Solution | OpenKruiseGame for Game Workloads

The second episode of cloud native game solution series introduces the cloud-native game workload OpenKruiseGame, and how it meets the needs of complex game server architectures.

Cloud Native Game Solution | Cost Efficiency and DevOps Boost Optimization in the Gaming Industry

The first episode of cloud native game solution series introduces how cloud-native technology helps the gaming industry achieve cost reduction and efficiency improvement.

Utilize Terraform to Install Alibaba Cloud Container for Kubernetes (ACK)

This step-by-step tutorial introduces how to utilize Terraform to install Alibaba Cloud Container Service for Kubernetes (ACK).

Alibaba Cloud ACK One: Quickly Build A Zone-disaster Recovery System with Multi-cluster Gateways

This article introduces the ACK One multi-cluster gateways and their benefits in implementing zone-disaster recovery for multi-cluster applications.

Serverless Cost Optimization: Knative Supports Preemptible Instances

This article describes how to use preemptible instances in Knative.

Interview Questions We've Learned Over the Years: The Distributed System

This article is part of a series focusing on interview questions for technicians, with a specific emphasis on the distributed system.

Deploy Magento on Alibaba Cloud Container Service for Kubernetes (ACK)

This article focuses on deploying Magento on Alibaba Cloud Container Service for Kubernetes (ACK).

ACK One Argo Workflows: Implementing Dynamic Fan-out/Fan-in Task Orchestration

This article explains how to use Argo Workflow to orchestrate dynamic DAG fan-out/fan-in tasks.