LocalFTW
Why Local
All Posts
Guides
Contribute
Clinic
Topic Graph
Bookmarks
Tagged "llm-benchmarking"
Comprehensive Benchmark: 37 LLMs Tested on MacBook Air M5 With Open-Source Tool
7 April 2026
Gemma 4 31B Achieves Third Place on FoodTruck Bench, Beating Larger Models
5 April 2026
YC-Bench: GLM-5 Matches Claude Opus 4.6 at 11× Lower Cost
4 April 2026
Forensic Beats Mem0 with 90.1% on LOCOMO Benchmark
28 March 2026