×
Ray

Analyzing the Distributed Inference Process Using vLLM and Ray from the Perspective of Source Code

This article explores how to implement distributed inference with vLLM and Ray from a source code perspective.

Best Practices for Ray Clusters - Ray on ACK

The article discusses how to set up a Ray cluster on Alibaba Cloud ACK, and the elastic scaling capabilities facilitated by the Ray autoscaler and ACK autoscaler.