×
Kubernetes

Solving GPU Shortages in IDC with Alibaba Cloud ACK Edge and Virtual Nodes for DeepSeek Deployment

This article describes how to use ACK Edge and virtual nodes to meet the elasticity requirements of DeepSeek deployment.

OpenKruise v1.8 Unlocking Infinite Possibilities in Cloud-Native Application Management

This article introduces how OpenKruise v1.8 enhances cloud-native application management with new features for resource management and workload efficiency.

Service Mesh Disaster Recovery Scenarios (3): Use Service Mesh to Deal with Service-level Disaster Recovery

This article introduces the practice of using service mesh to deal with service-level disaster recovery.

Service Mesh Disaster Recovery Scenarios (2): Use Service Mesh to Deal with Zone-level Disaster Recovery

This article describes how to use Alibaba Cloud Service Mesh (ASM) and Alibaba Cloud Container Service for Kubernetes (ACK) to address zone-level disaster recovery.

Service Mesh Disaster Recovery Scenarios (1): Use Service Mesh to Deal with Region-level Disaster Recovery

This article introduces how to achieve automatic detection of region-level faults and traffic redirection based on ASM.

How ACK Edge Solves Challenges in Elasticity for LLM Inference Services

The hybrid cloud LLM elastic inference solution based on ACK Edge dynamically adjusts GPU resource use between the on-premises data center and the cloud to meet tidal inference traffic demands.

ACK One Registered Clusters Help Solve GPU Resource Shortage in Data Centers

With the help of ACK One registered clusters, we can make full use of ACS GPU computing power of Alibaba Cloud to efficiently deploy the DeepSeek inference model.

An Automatic Scaling Solution for LLM Inference Services Based on Knative

This article describes an automatic scaling solution for LLM inference services based on Knative.

Apache Flink FLIP-6: Dynamic Resource Management for Optimized Cluster Deployment

This is Technical Insights Series by Perry Ma | Product Lead, Real-time Compute for Apache Flink at Alibaba Cloud.

ACK Container Storage Monitoring: Making Your Applications Run More Stably and Transparently

This article describes the upgraded container monitoring system of ACK, including the display and overview of major dashboard interfaces.

OpenYurt v1.6: Introduce Node-level Traffic Multiplexing Capability

The key features of OpenYurt v1.6 include node-level traffic multiplexing and enhanced edge autonomy.

IngressNightmare: Ingress Nginx Exposed Five More Security Vulnerabilities That Can Take Over Your K8s Cluster

This article discusses discovered security vulnerabilities in Ingress Nginx that can lead to unauthorized control over Kubernetes clusters.

A Guide to Deploy a Production Environment from a DeepSeek Distilled Model in ACK

This tutorial demonstrates how to use the vLLM framework to quickly deploy an inference service from the DeepSeek R1 model in ACK.

Alibaba Cloud ACK One: Registered Clusters Support ACS Computing Power

ACK One registered clusters support the computing power of ACS, which provides more choices and more powerful computing capabilities for enterprises' containerized workloads.

Nacos-Controller 2.0: Efficiently Manage Your K8s Configuration with Nacos

This article introduces Nacos-Controller 2.0, an advanced tool for managing Kubernetes configurations through Nacos.

Deploying a Scalable Laravel Application on Kubernetes

The article provides a guide on deploying a scalable Laravel application on Kubernetes.

Unifying Kubernetes Management with Alibaba Cloud ACK One

Alibaba Cloud ACK One is a distributed cloud container management platform that enables unified lifecycle management, traffic governance, application .

Kimi Large Model-based Massive Data Preprocessing Practice of Moonshot AI

This article introduces how Moonshot AI uses Alibaba Cloud's solutions to enhance data preprocessing for its large model, Kimi, focusing on stability, resource elasticity, and efficient management.

Koordinator v1.6: Supports Heterogeneous Resource Scheduling in AI/ML Scenarios

The article introduces Koordinator v1.6 as a tool that enhances heterogeneous resource scheduling capabilities in AI and machine learning scenarios.

Decipher the Open-source Serverless Container Framework: Event-driven

This article introduces the open-source serverless container framework Knative, emphasizing its event-driven capabilities for cloud-native applications.