LocalFTW
Why Local
All Posts
Guides
Contribute
About
Clinic
Trends
Bookmarks
Tagged "hybrid-inference"
Krasis Hybrid MoE Runtime Achieves 3,324 tok/s Prefill on Single RTX 5080
28 February 2026