×
Kubernetes

Cloud-native AI Engineering Practice: Accelerating LLM Inference with FasterTransformer

This article demonstrates how to use FasterTransformer to accelerate inference on the ACK container service, using the Bloom7B1 model as an example.

Demystify the Practice of Large Language Models: Exploring Distributed Inference

This article uses the Bloom7B1 model as an example to demonstrate the distributed inference method for large language models in ACK.

KServe + Fluid Accelerates Big Model Inference

This article explores how to implement the KServe big model inference in Alibaba Cloud Container Service for Kubernetes (ACK).

Deploy Enterprise-level AI Applications Based on Alibaba Cloud Serverless Container Service

This article describes how to deploy enterprise-level AI applications based on Alibaba Cloud Serverless Container Service.

In-depth Analysis of Traffic Isolation Technology of Online Application Nodes

This article discusses the need for traffic isolation in scenarios where abnormal Pod behavior affects service quality.

Born for Data Elasticity, Alibaba Cloud Cloud-native Storage Speeds Up Again

This article explores challenges faced by enterprises running AI and big data applications on Kubernetes, focusing on the decoupling of computing and storage architecture.

Best Practices of Koordinator: CPU Orchestration

This article focuses on how Koordinator helps facilitate the sharing of CPU resources between different types of workloads.

How Does DeepSpeed + Kubernetes Easily Implement Large-Scale Distributed Training?

This article describes how to build and run DeepSpeed distributed training tasks based on the cloud-native AI suite of ACK.

Use OpenKruise to Implement End-to-end Canary Release Based on Higress

This article describes how to use OpenKruise to build automated O&M.

Resolving "Address not available" Issues in a Container Environment

This article describes how the author solve the "Address not available" issues in a container environment.

How to Quickly Deploy AI Inference Services Based on ACK Serverless

This article describes how to quickly deploy AI inference services based on ACK Serverless.

Deploy TiDB on Alibaba Cloud Serverless Kubernetes (ASK)

This article describes how to deploy a TiDB database on Alibaba Cloud Serverless Kubernetes (ASK).

New Milestone of KubeVela: A Cloud-Native Application Platform with United Delivery and Day-2 Management

This article discusses the announcement of the latest KubeVela upgrades during the 2022 Apsara Conference.

An Exploration and Improvement of Dubbo in Proxyless Mesh Mode

This article discusses the advantages, deficiencies, and broad market prospects of Dubbo and Proxyless Service Mesh.

Enable ARMS Prometheus for a Registered Kubernetes Cluster

This article describes how to enable ARMS Prometheus for a registered Kubernetes cluster by deploying the application in Alibaba Cloud ACK.

O&M and Component Installation for the Registered Cluster of ACK One

This article gives a rundown on O&M and component installation for the registered cluster of ACK One (with examples).

Use CLI to Install an ACK Registered Cluster

This article explains how to use CLI to install an ACK registered cluster (with examples and FAQs).

Analysis of Alibaba Cloud Container Network Data Link (6): ASM Istio

Part 6 of this series mainly introduces the forwarding links of data plane links in ASM Istio mode.

DevOps là gì ?

DevOps là sự kết hợp của từ Development (phát triển tính năng sản phẩm) + Operations (vận hành)

Design and Implementation of Seata-go TCC

This article introduces the design idea, exception handling, and practical use of TCC in Seata-go.