LocalFTW
Why Local
All Posts
Guides
Contribute
Clinic
Topic Graph
Bookmarks
Tagged "reasoning-optimization"
Llama 8B Matches 70B Performance on Multi-Hop QA Using Structured Prompting
22 March 2026
NVIDIA's Dynamic Memory Sparsification Cuts LLM Inference Costs by 8x
14 February 2026