×

Alibaba Container Service

6478 Reputation

Container Service simplifies establishment of container management clusters and integrates Alibaba Cloud virtualization, storage, network, and security capabilities to create the optimal container running environment on the cloud.

Follow
Activities(50) Posts(228) Series(3) Areas of Expertise Following Followers
Areas of Expertise

Following (0)

See All

Followers (33)

See All

Caching is Efficiency: Achieving Precise LLM Cache Hits with Alibaba Cloud ACK GIE

This article introduces ACK GIE's precision-mode prefix cache-aware routing that maximizes KV-Cache hit rates for distributed LLM inference.

ACK One Fleet Multi-Cluster Canary Release: A "Safety Valve" for AI Inference Services

This article introduces ACK One Fleet's multi-cluster canary release solution, integrated with Kruise Rollout, for safe AI inference deployments across hybrid and geo-distributed clouds.

Intelligent Scheduling for AI Inference: Cluster-Level Priority Elastic Scheduling

This article introduces ACK One Fleet's priority elastic scheduling for AI inference across hybrid and cross-region multi-cluster environments.

When Agents Meet Workflows—Can Intelligence Become More Controllable?

This article introduces how combining LLM Agents with deterministic Workflows like Argo enables controllable, production-ready AI systems.

Koordinator Column 1: Viewing AI Computing Power's "Rigidity" and "Elasticity" through Gang Scheduling

This article traces Gang Scheduling's evolution to analyze the rigidity-elasticity balance in AI resource orchestration, its Kubernetes implementation, and future trends.

Container Technology Evolution for LLMs and AI Agents

The article outlines how container technology is advancing to support LLMs and AI agents across data processing, training, inference, and deployment.

How ASM Ambient Mode Innovates Kubernetes Egress Traffic Management

ASM Ambient mode simplifies Kubernetes egress traffic management through Waypoint proxies, significantly reducing configuration complexity.

Best Practices for Ray on ACK: Secure Deployment of AI Data Processing/Training/Inference Environments

This article provides best practices for securely deploying and operating Ray on Alibaba Cloud ACK for AI data processing, training, and inference environments.

Alibaba Cloud Service Mesh Supports Ambient Mode

This article introduces how Alibaba Cloud Service Mesh (ASM) now supports Ambient Mode.

How to Use Istio Service Mesh on ACK Clusters Through Compute Nest

The article provides a guide on using Istio Service Mesh within Alibaba Cloud Container Service for Kubernetes (ACK) clusters through Compute Nest.

Compute Nest Uses Helm Hooks to Deploy Helm Charts in ACK Clusters

This article describes how to deploy a Helm chart to an ACK cluster in Compute Nest.

Use Flux CD to Deploy a Helm Chart in an ACK Cluster Through Compute Nest

The article explains how to use Flux CD to deploy a Helm chart in an Alibaba Cloud Container Service for Kubernetes (ACK) cluster within Compute Nest.

How to Deploy FlowiseAI with One Click on Alibaba Cloud?

On Alibaba Cloud, you can quickly deploy FlowiseAI by using a one-click deployment link and start using it with a few simple configuration steps.

Simplified Deployment Tutorial of the Qwen3 LLM on Alibaba Cloud Container Service for Kubernetes

The article explains how to deploy the Qwen3 large language model on Alibaba Cloud ACK and ACS serverless GPU resources.

Transformation from a Single Cluster to Multiple Clusters: Multi-cluster Application Distribution of ACK One

The article shows how Alibaba Cloud ACK One transforms a single-cluster Kubernetes deployment into a multi-cluster continuous-delivery system through its application-distribution features.

StrmVol Volumes: Boosting Kubernetes Object Storage Performance for Small Files

Alibaba Cloud ACK supports StrmVol volumes. Based on underlying virtual block devices and the file system in kernel mode, this significantly reduces the access latency of massive small files.

ACK AI Profiling: An Analysis of the Problem from Black Box to Transparency

This article describes how to troubleshoot a problem from pre-troubleshooting to in-depth analysis using AI Profiling, culminating in problem resolution and business execution analysis.

ACK Gateway with Inference Extension: A Practice for Optimizing Large Model Inference Service Deployed across Multiple Nodes

This article introduces how to use ACK Gateway with Inference Extension to optimize multi-node large-model inference performance.

ACK Gateway with AI Extension: Intelligent Routing Practice for Kubernetes Large Model Inference

This article describes how to use the ACK Gateway with AI Extension plug-in to provide production-level load balancing and intelligent routing capabilities for QwQ-32B models deployed in ACK clusters.

ACK Gateway with AI Extension: Model Canary Release Practice for Large Model Inference

This article focuses on the canary release of models after the large model inference service is deployed in the cloud and the practices of model canary release based on ACK Gateway with AI Extension.

Latest Comments

zonghe Commented on Ray on Alibaba Cloud: Building an ML Platform

good!

Santhakumar Munuswamy Commented on Fluid 1.0: Bridging the Last Mile for Efficient Cloud-Native Data Usage

Thank for sharing

Kidd Ip Commented on Connect Cluster Cost Analysis to an ACK Registered Cluster

Thank you for sharing, it is good to have cost analysis in K8S!

aNDREUET Commented on The Burgeoning Kubernetes Scheduling System – Part 1: Scheduling Framework

interesting article

5015614084248347 Commented on Knative on Alibaba Cloud: The Ultimate Serverless Experience

You might find many businesses deploying serverless computing out there. However, if you are looking for all-inclusiveness, DataVizz is your one-stop-solution. From Data Management