SGLang Integration Performance Benchmarks

SGLang Integration Performance Benchmarks#

Benchmarks evaluating Mooncake’s integration with SGLang across PD disaggregation and HiCache hierarchical KV cache storage.

Document

Scenario

Key Findings

PD Disaggregation Performance

SGLang PD disaggregation with Mooncake Transfer Engine

1P1D PD disaggregation achieves approximately 30% lower ITL while maintaining comparable throughput against two regular instances.

HiCache with Mooncake Backend Benchmark

SGLang HiCache using Mooncake Store as L3 storage

Mooncake-backed HiCache improves prefill performance in multi-turn workloads by maintaining higher KV cache hit rates as conversation rounds grow.