Mooncake Performance Benchmarks#
Benchmarks evaluating Mooncake Store’s core storage, allocation, and cache hierarchy behavior.
Document |
Area |
Key Findings |
|---|---|---|
Mooncake Store storage |
Measures end-to-end storage performance across Mooncake Store operations and deployment configurations. |
|
Segment allocation |
The optimized OffsetAllocator significantly improves utilization for uniform-size LLM KV cache allocation patterns. |
|
Allocation routing |
Compares random and free-ratio-first allocation across segments, replicas, skewed capacity, and DSA-style KV+indexer workloads. |
|
Cache hierarchy |
SSD offload extends the KV cache hierarchy with NVMe, reducing the performance cliff after DRAM cache capacity is exhausted in long multi-turn conversations. |