This article introduces how to use ACK Gateway with Inference Extension to optimize multi-node large-model inference performance.