Community

Blog Events Webinars Tutorials Forum

Create Account

×

KServe

A Guide to Deploy a Production Environment from a DeepSeek Distilled Model in ACK

This tutorial demonstrates how to use the vLLM framework to quickly deploy an inference service from the DeepSeek R1 model in ACK.

Alibaba Container Service April 17, 2025 1,460

Building a Large Language Model Inference Service Optimized by TensorRT-LLM Based on KServe on ASM

This article introduces how to deploy optimized LLM model inference services in a cloud-native environment using the TensorRT-LLM-optimized Llama-2-hf model as an example.

Alibaba Container Service August 30, 2024 3,234

Best Practices for Large Model Inference in ACK: TensorRT-LLM

This article uses the Llama-2-7b-hf model as an example to demonstrate how to deploy the Triton framework using KServe in Alibaba Cloud ACK.

Alibaba Container Service July 24, 2024 4,059

Model Service Mesh: Model Service Management in Cloud-native Scenario

This article introduces Model Service Mesh, an architectural pattern for deploying and managing scalable machine learning model services in a distributed environment.

Alibaba Cloud Native February 20, 2024 4,061

KServe + Fluid Accelerates Big Model Inference

This article explores how to implement the KServe big model inference in Alibaba Cloud Container Service for Kubernetes (ACK).

Alibaba Cloud Native Community September 20, 2023 4,462

How to Quickly Deploy AI Inference Services Based on ACK Serverless

This article describes how to quickly deploy AI inference services based on ACK Serverless.

Alibaba Cloud Native September 11, 2023 2,853

Istio Ecosystem on ASM (3): Integrate KServe into Alibaba Cloud Service Mesh

Part 3 of this 3-part series discusses how to use Alibaba Cloud Service Mesh (ASM) and Alibaba Cloud Container Service for Kubernetes (ACK) for deployment.

Alibaba Cloud Native October 9, 2022 3,235

The Definition of the New Service Mesh-Driven Scenario: AI Model Services - Model Mesh

This article describes how to use Alibaba Cloud Service Mesh (ASM) and Alibaba Cloud Container Service for Kubernetes (ACK) for deployment.

Alibaba Container Service September 14, 2022 3,088

Related Tags

artificial intelligence big data cloud computing