×

Alibaba Container Service

6456 Reputation

Container Service simplifies establishment of container management clusters and integrates Alibaba Cloud virtualization, storage, network, and security capabilities to create the optimal container running environment on the cloud.

Follow
Activities(50) Posts(210) Series(3) Areas of Expertise Following Followers
Areas of Expertise

Following (0)

See All

Followers (33)

See All

ACK Gateway with AI Extension: Intelligent Routing Practice for Kubernetes Large Model Inference

This article describes how to use the ACK Gateway with AI Extension plug-in to provide production-level load balancing and intelligent routing capabilities for QwQ-32B models deployed in ACK clusters.

ACK Gateway with AI Extension: Model Canary Release Practice for Large Model Inference

This article focuses on the canary release of models after the large model inference service is deployed in the cloud and the practices of model canary release based on ACK Gateway with AI Extension.

From Minutes to Seconds: Yahaha's Cloud-Native UE5 Game Practice Powered by OpenKruiseGame

The article introduces how Yahaha migrated its UE5 game STRIDEN to a cloud-native architecture powered by OpenKruiseGame.

OpenYurt Becomes a CNCF Incubating Project

The Cloud Native Computing Foundation (CNCF) Technical Oversight Committee (TOC) has voted to accept OpenYurt as an incubating project.

ACK One Multi-cluster Spark and AI Job Scheduling

This article presents ACK One's multi-cluster AI job scheduling solution that optimizes resource utilization by distributing Spark jobs across multipl...

Ray on Alibaba Cloud: Building an ML Platform

This blog briefly introduces Ray and KubeRay, along with the related efforts to support Ray on ACK.

Observability Principles and Best Practices of GPU-accelerated Edge Nodes

This article describes the observability principles and best practices of GPU-accelerated edge nodes connected to ACK Edge.

Solving GPU Shortages in IDC with Alibaba Cloud ACK Edge and Virtual Nodes for DeepSeek Deployment

This article describes how to use ACK Edge and virtual nodes to meet the elasticity requirements of DeepSeek deployment.

Quick Deployment of DeepSeek-R1 in Knative

This article describes how to deploy a DeepSeek-R1 inference service in Knative.

Service Mesh Disaster Recovery Scenarios (3): Use Service Mesh to Deal with Service-level Disaster Recovery

This article introduces the practice of using service mesh to deal with service-level disaster recovery.

Service Mesh Disaster Recovery Scenarios (2): Use Service Mesh to Deal with Zone-level Disaster Recovery

This article describes how to use Alibaba Cloud Service Mesh (ASM) and Alibaba Cloud Container Service for Kubernetes (ACK) to address zone-level disaster recovery.

Service Mesh Disaster Recovery Scenarios (1): Use Service Mesh to Deal with Region-level Disaster Recovery

This article introduces how to achieve automatic detection of region-level faults and traffic redirection based on ASM.

How ACK Edge Solves Challenges in Elasticity for LLM Inference Services

The hybrid cloud LLM elastic inference solution based on ACK Edge dynamically adjusts GPU resource use between the on-premises data center and the cloud to meet tidal inference traffic demands.

ACK One Registered Clusters Help Solve GPU Resource Shortage in Data Centers

With the help of ACK One registered clusters, we can make full use of ACS GPU computing power of Alibaba Cloud to efficiently deploy the DeepSeek inference model.

Use Alibaba Cloud ASM to Efficiently Manage LLM Traffic Part 2: Traffic Observability

This article describes how to use the enhanced capabilities provided by Alibaba Cloud Service Mesh (ASM) to flexibly and comprehensively observe LLM traffic in a cluster.

An Automatic Scaling Solution for LLM Inference Services Based on Knative

This article describes an automatic scaling solution for LLM inference services based on Knative.

Overview of ASM Data Plane Proxy Extension Capabilities

This article describes the various extension capabilities provided by the ASM data plane proxy, making it easier for you to choose the most suitable extension method to meet your business needs.

ACK Container Storage Monitoring: Making Your Applications Run More Stably and Transparently

This article describes the upgraded container monitoring system of ACK, including the display and overview of major dashboard interfaces.

Disaster Recovery Solutions Based on Multi-Cluster Gateways of ACK One

This article describes how to use the multi-cluster gateway ACK One to implement zone-disaster recovery of public cloud applications.

OpenYurt v1.6: Introduce Node-level Traffic Multiplexing Capability

The key features of OpenYurt v1.6 include node-level traffic multiplexing and enhanced edge autonomy.

Latest Comments

zonghe Commented on Ray on Alibaba Cloud: Building an ML Platform

good!

Santhakumar Munuswamy Commented on Fluid 1.0: Bridging the Last Mile for Efficient Cloud-Native Data Usage

Thank for sharing

Kidd Ip Commented on Connect Cluster Cost Analysis to an ACK Registered Cluster

Thank you for sharing, it is good to have cost analysis in K8S!

aNDREUET Commented on The Burgeoning Kubernetes Scheduling System – Part 1: Scheduling Framework

interesting article

5015614084248347 Commented on Knative on Alibaba Cloud: The Ultimate Serverless Experience

You might find many businesses deploying serverless computing out there. However, if you are looking for all-inclusiveness, DataVizz is your one-stop-solution. From Data Management