×
cloud native

Caching is Efficiency: Achieving Precise LLM Cache Hits with Alibaba Cloud ACK GIE

This article introduces ACK GIE's precision-mode prefix cache-aware routing that maximizes KV-Cache hit rates for distributed LLM inference.

ACK One Fleet Multi-Cluster Canary Release: A "Safety Valve" for AI Inference Services

This article introduces ACK One Fleet's multi-cluster canary release solution, integrated with Kruise Rollout, for safe AI inference deployments across hybrid and geo-distributed clouds.

Intelligent Scheduling for AI Inference: Cluster-Level Priority Elastic Scheduling

This article introduces ACK One Fleet's priority elastic scheduling for AI inference across hybrid and cross-region multi-cluster environments.

When Agents Meet Workflows—Can Intelligence Become More Controllable?

This article introduces how combining LLM Agents with deterministic Workflows like Argo enables controllable, production-ready AI systems.

How to Claim PolarDB Resources for Free

This article discusses the benefits of PolarDB and explains how to claim resources for free.

Higress Has Supported the New Gateway API and Its AI Inference Extension

This article introduces the feature upgrade of Higress and provide detailed operation guidance.

Koordinator Column 1: Viewing AI Computing Power's "Rigidity" and "Elasticity" through Gang Scheduling

This article traces Gang Scheduling's evolution to analyze the rigidity-elasticity balance in AI resource orchestration, its Kubernetes implementation, and future trends.

From Symptoms to Root Causes: How MetricSet Explorer Reinvents the Metric Analysis Experience

This article introduces MetricSet Explorer, a metric analysis platform that shifts from passive display to proactive root cause discovery via intellig.

Nginx Ingress Replacement Option

This article introduces Alibaba Cloud's three Nginx Ingress replacement options—ALB, MSE, and APIG Ingress.

UModel Explorer: Redefining Observability Data Modeling with a Graphical Approach

This article introduces UModel Explorer, a visual modeling interface that enables intuitive drag-and-drop construction of observability data models.

Breaking Through the Key Bottlenecks in Observability: Ultimate Integration of Entities and Relationships

This article introduces EntityStore's graph querying in UModel, shifting observability from isolated entities to relationship-aware topology analysis.

MSE Nacos Prompt Management: Making the Core Configuration of AI Agent Truly Governable

This article introduces MSE Nacos Prompt Management, which governs AI Agent prompts as dynamic configuration assets with centralized storage, versioning, and hot updates.

Intelligently Detect Exceptions with One Line of Code: UModel PaaS API Architecture Design and Best Practices

This article introduces a unified UModel PaaS API that abstracts complex observability data access into simple one-line queries for intelligent exception detection.

The More the Agent Is Used, the Smarter It Becomes? The AgentScope Java Online Training Plugin is Here!

This article introduces AgentScope Java's online training plugin for self-evolving AI agents using real production data and Trinity-RFT reinforcement learning.

Migrated 60+ Ingress Resources in 30 Minutes Using AI - My Ingress Nginx to Higress Journey

This article introduces an AI-assisted migration workflow using OpenClaw to safely transition 60+ Ingress NGINX resources to Higress in 30 minutes.

Alibaba Cloud Tair KVCache Manager: Architecture Design and Implementation of Enterprise-Level Global KVCache Management Service

This article introduces the architecture and implementation of Tair KVCache Manager, an open-source enterprise-grade global KVCache management service for scalable Agentic AI inference.

Kubernetes Has Officially Announced Again, Emphasizing the Immediate Migration of Ingress NGINX

The Kubernetes Steering Committee and Security Response Committee have once again emphasized the immediate migration of Ingress NGINX.

Alibaba Cloud Upgrades Flagship Database PolarDB with AI-Ready Capabilities

Alibaba Cloud has unveiled AI Lakebase architecture alongside a suite of upgrades for its flagship database, PolarDB at PolarDB Developer Conference in China.

From System Monitoring to Business Insights: A Comprehensive Analysis of the Custom Metric Collection Feature of ARMS

This article introduces Alibaba Cloud ARMS' custom metric collection capability.

The Game-changing O&M Console for Anolis OS Is Live

The article introduces Alibaba Cloud’s OS Console, a game-changing O&M platform for Anolis OS that enables one-click diagnosis of hidden memory issues.