LocalFTW
Why Local
All Posts
Guides
Contribute
Clinic
Topic Graph
Bookmarks
Tagged "algorithmic-optimization"
llama.cpp Merges Speculative Checkpointing for Major Inference Speed Boost
20 April 2026
DFlash Speculative Decoding Achieves 3.3x Speedup on Apple Silicon
12 April 2026
Speculative Decoding Made My Local LLM Actually Usable
9 April 2026