This article uses the Bloom7B1 model as an example to demonstrate the distributed inference method for large language models in ACK.