Community

Blog Events Webinars Tutorials Forum

Create Account

×

KFServing

Building a Large Language Model Inference Service Optimized by TensorRT-LLM Based on KServe on ASM

This article introduces how to deploy optimized LLM model inference services in a cloud-native environment using the TensorRT-LLM-optimized Llama-2-hf model as an example.

Alibaba Container Service August 30, 2024 3,275

The Definition of the New Service Mesh-Driven Scenario: AI Model Services - Model Mesh

This article describes how to use Alibaba Cloud Service Mesh (ASM) and Alibaba Cloud Container Service for Kubernetes (ACK) for deployment.

Alibaba Container Service September 14, 2022 3,106

Related Tags

artificial intelligence big data cloud computing