LocalFTW
Why Local
All Posts
Guides
Contribute
About
Clinic
Bookmarks
Tagged "moe-model-deployment"
Krasis: Hybrid CPU/GPU MoE Runtime Achieves 3,324 Tokens/Second Prefill on RTX 5080
28 February 2026