×
Kubernetes

How ACK Edge Solves Challenges in Elasticity for LLM Inference Services

The hybrid cloud LLM elastic inference solution based on ACK Edge dynamically adjusts GPU resource use between the on-premises data center and the cloud to meet tidal inference traffic demands.

ACK One Registered Clusters Help Solve GPU Resource Shortage in Data Centers

With the help of ACK One registered clusters, we can make full use of ACS GPU computing power of Alibaba Cloud to efficiently deploy the DeepSeek inference model.

An Automatic Scaling Solution for LLM Inference Services Based on Knative

This article describes an automatic scaling solution for LLM inference services based on Knative.

Apache Flink FLIP-6: Dynamic Resource Management for Optimized Cluster Deployment

This is Technical Insights Series by Perry Ma | Product Lead, Real-time Compute for Apache Flink at Alibaba Cloud.

ACK Container Storage Monitoring: Making Your Applications Run More Stably and Transparently

This article describes the upgraded container monitoring system of ACK, including the display and overview of major dashboard interfaces.

OpenYurt v1.6: Introduce Node-level Traffic Multiplexing Capability

The key features of OpenYurt v1.6 include node-level traffic multiplexing and enhanced edge autonomy.

IngressNightmare: Ingress Nginx Exposed Five More Security Vulnerabilities That Can Take Over Your K8s Cluster

This article discusses discovered security vulnerabilities in Ingress Nginx that can lead to unauthorized control over Kubernetes clusters.

A Guide to Deploy a Production Environment from a DeepSeek Distilled Model in ACK

This tutorial demonstrates how to use the vLLM framework to quickly deploy an inference service from the DeepSeek R1 model in ACK.

Alibaba Cloud ACK One: Registered Clusters Support ACS Computing Power

ACK One registered clusters support the computing power of ACS, which provides more choices and more powerful computing capabilities for enterprises' containerized workloads.

Nacos-Controller 2.0: Efficiently Manage Your K8s Configuration with Nacos

This article introduces Nacos-Controller 2.0, an advanced tool for managing Kubernetes configurations through Nacos.

Deploying a Scalable Laravel Application on Kubernetes

The article provides a guide on deploying a scalable Laravel application on Kubernetes.

Unifying Kubernetes Management with Alibaba Cloud ACK One

Alibaba Cloud ACK One is a distributed cloud container management platform that enables unified lifecycle management, traffic governance, application .

Kimi Large Model-based Massive Data Preprocessing Practice of Moonshot AI

This article introduces how Moonshot AI uses Alibaba Cloud's solutions to enhance data preprocessing for its large model, Kimi, focusing on stability, resource elasticity, and efficient management.

Koordinator v1.6: Supports Heterogeneous Resource Scheduling in AI/ML Scenarios

The article introduces Koordinator v1.6 as a tool that enhances heterogeneous resource scheduling capabilities in AI and machine learning scenarios.

Decipher the Open-source Serverless Container Framework: Event-driven

This article introduces the open-source serverless container framework Knative, emphasizing its event-driven capabilities for cloud-native applications.

Running a Docker in Docker (DinD) on Alibaba Cloud

This article provides a comprehensive tutorial on running Docker in Docker (DinD) environments on Alibaba Cloud's ECS and Alibaba Cloud ACK.

JoinQuant Insights: Why Do Quantitative Researchers Tend to Use Fluid to Simplify Data Management on Kubernetes?

This article explains why quantitative researchers prefer using Fluid to simplify data management and enhance efficiency on Kubernetes.

Best Practices for Kubernetes Migration: Flexible Management of Resource Backup for Application Recovery

This article introduces ACK Backup Center, a Kubernetes disaster recovery and migration solution that simplifies cross-cluster application restoration...

Backup Center Helps Enterprises Migrate Kubernetes Container Service Platforms Across Clouds

This article introduces Alibaba Cloud ACK backup center and uses a technology company's migration challenges to demonstrate how it effectively assists...

Alibaba Cloud Unveils ACS for International Customers to Revolutionize Workload Deployment

Industry-leading cloud-native container service allows scale on-demand to optimize resources while reducing overall computing costs by up to 55%.