Tagged "mlx-framework"

I Replaced My Local LLM With a Model Half Its Size and Got Better Results 24 April 2026
Llama 4 Scout on MLX: The Complete Apple Silicon Guide (2026) 23 April 2026
DFlash Speculative Decoding Achieves 3.3x Speedup on Apple Silicon 12 April 2026
Kokoro TTS Achieves 20× Realtime Speed on CPU-Only On-Device Inference 4 April 2026
Apple Silicon Macs Run Local AI Faster with Ollama's New MLX Support 2 April 2026
Ollama Adopts Apple's MLX Framework for Faster Local AI on Mac 1 April 2026
mlx-Code: Run Claude Code Locally with MLX-LM 27 March 2026
Google TurboQuant: Extreme Compression for Local LLM Deployment 25 March 2026