Stevessr 在 GitHub - LMCache/LMCache: Supercharge Your LLM with the Fastest KV Cache... 中发帖
🔥 Integration with vLLM v1 with the following features:
High performance CPU KVCache offloading
Disaggregated prefill
P2P KVCache sharing
LMCache is supported in the vLLM production stack, llm-d, and KServe
Stable support for non-prefix KV caches
Storage support as follows:
CPU
Disk
NIXL
Installation support through pip and latest vLLM