LocalFTW
Why Local
All Posts
Guides
Contribute
Clinic
Topic Graph
Bookmarks
Tagged "kv-cache-quantization"
Community Converges on Optimal KV Cache Quantization Strategies for Qwen 3.5 Models
20 March 2026
Qwen3.5-35B RTX 5080 Experiments Confirm KV q8_0 as Free Lunch, Q4_K_M Remains Optimal
28 February 2026