The article unveils Higress’s new Wasm Plugin Server, an HTTP-based tool that streamlines private deployment and management of Higress Wasm plugins.
The Cloud Native Computing Foundation (CNCF) Technical Oversight Committee (TOC) has voted to accept OpenYurt as an incubating project.
This article introduces Alibaba Cloud ACS, a fully managed serverless Kubernetes service with instant elasticity and pay-as-you-go pricing.
This article discusses Sealos' optimization journey for gateway performance, detailing their transition from Nginx Ingress to Higress and their in-depth performance improvements using Istio and Envoy.
This article presents ACK One's multi-cluster AI job scheduling solution that optimizes resource utilization by distributing Spark jobs across multipl...
This article introduces how Moonshot AI uses Alibaba Cloud's solutions to enhance data preprocessing for its large model, Kimi, focusing on stability, resource elasticity, and efficient management.
This article describes the observability principles and best practices of GPU-accelerated edge nodes connected to ACK Edge.
This article describes how to use ACK Edge and virtual nodes to meet the elasticity requirements of DeepSeek deployment.
This article introduces how OpenKruise v1.8 enhances cloud-native application management with new features for resource management and workload efficiency.
This article introduces the practice of using service mesh to deal with service-level disaster recovery.
This article describes how to use Alibaba Cloud Service Mesh (ASM) and Alibaba Cloud Container Service for Kubernetes (ACK) to address zone-level disaster recovery.
This article introduces how to achieve automatic detection of region-level faults and traffic redirection based on ASM.
The hybrid cloud LLM elastic inference solution based on ACK Edge dynamically adjusts GPU resource use between the on-premises data center and the cloud to meet tidal inference traffic demands.
With the help of ACK One registered clusters, we can make full use of ACS GPU computing power of Alibaba Cloud to efficiently deploy the DeepSeek inference model.
This article describes an automatic scaling solution for LLM inference services based on Knative.
This is Technical Insights Series by Perry Ma | Product Lead, Real-time Compute for Apache Flink at Alibaba Cloud.
This article describes the upgraded container monitoring system of ACK, including the display and overview of major dashboard interfaces.
The key features of OpenYurt v1.6 include node-level traffic multiplexing and enhanced edge autonomy.
This article discusses discovered security vulnerabilities in Ingress Nginx that can lead to unauthorized control over Kubernetes clusters.
This tutorial demonstrates how to use the vLLM framework to quickly deploy an inference service from the DeepSeek R1 model in ACK.