StevessrGitHub - LMCache/LMCache: Supercharge Your LLM with the Fastest KV Cache... 中发帖

🔥 Integration with vLLM v1 with the following features:

High performance CPU KVCache offloading
Disaggregated prefill
P2P KVCache sharing


LMCache is supported in the vLLM production stack, llm-d, and KServe
Stable support for non-prefix KV caches
Storage support as follows:

CPU
Disk
NIXL


Installation support through pip and latest vLLM