This article explores how to implement distributed inference with vLLM and Ray from a source code perspective.