Tagged "mlx"

Samsung's Exynos 2800 Brings HBM Memory to Mobile AI, Enabling Faster Local Model Inference 26 May 2026
Why AI Hardware Is a Chip Layer Problem 24 May 2026
M5 Max MacBook Runs Local Large Language Models Efficiently 23 May 2026
Chrome Is Quietly Downloading a 4GB AI Model Without Your Permission 19 May 2026
Samsung's Exynos 2800 Brings Significant On-Device AI Capabilities 18 May 2026
The Time Bomb Went Off: AI's All-You-Can-Eat Era Just Ended in Real Time 18 May 2026
Offline Voice-to-Text and AI Keyboard App for Local Processing 16 May 2026
Chrome Silently Downloads 4GB Gemini Nano Model Without User Consent 16 May 2026
Apple's M5 MacBook Air Advances On-Device AI with Redesigned Hardware 16 May 2026
Running AI Models Locally on M4 Processors with 24GB Memory 14 May 2026
Avocado Studio: Open-Source AI Content Editor for Next.js Sites 14 May 2026
Lython: Experimental Python Compiler Toolchain Based on LLVM 11 May 2026
Cotypist – AI Autocomplete for Mac 11 May 2026
Mlx-serve: Run LLMs Natively on Your Mac 10 May 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 5 May 2026
Google's Gemma 4 Brings Powerful AI Capabilities to Phones and Laptops 30 April 2026
NVIDIA Nemotron 3 Nano Omni Powers Multimodal Agent Reasoning in a Single Efficient Open Model 29 April 2026
I Replaced My Local LLM With a Model Half Its Size and Got Better Results 24 April 2026
Llama 4 Scout on MLX: The Complete Apple Silicon Guide (2026) 23 April 2026
DFlash Doubles Token Generation Speed of Qwen3.5 27B on Mac M5 Max 15 April 2026
Sovereign AI: Why the Next GPT Will Be Born in Our Living Rooms 14 April 2026
oMLX Framework Implements DFlash Attention for Optimized Inference 14 April 2026
DFlash Speculative Decoding Achieves 3.3x Speedup on Apple Silicon 12 April 2026
Comprehensive Benchmark: 37 LLMs Tested on MacBook Air M5 With Open-Source Tool 7 April 2026
Qwen 3.6 Free Model Available via OpenRouter 5 April 2026
Ollama Gets Blazing Fast on Macs with Full MLX Support and 2× Speedups 5 April 2026
Mixed Precision Quantization on MLX with TurboQuant Implementation 4 April 2026
Kokoro TTS Achieves 20× Realtime Speed on CPU-Only On-Device Inference 4 April 2026
Apple Silicon Macs Run Local AI Faster with Ollama's New MLX Support 2 April 2026
Ollama Adopts Apple's MLX Framework for Faster Local AI on Mac 1 April 2026
Is Anyone Working on an AI Operating System? 1 April 2026
Select the Right Hardware for Your Local LLM Deployment with This Online Guide 30 March 2026
TurboQuant KV Cache Compression Achieves 22.8% Faster Decoding at 32K Context 28 March 2026
M5 Max Delivers 1.7x Faster Inference Than M3 Max on Qwen 3.5 Models 28 March 2026
mlx-Code: Run Claude Code Locally with MLX-LM 27 March 2026
Apple Plans Slimmed-Down Gemini Models for Local iPhone AI Features 26 March 2026
Google TurboQuant: Extreme Compression for Local LLM Deployment 25 March 2026
Qualcomm and Samsung's 30-Year AI Alliance Enters a New Phase as On-Device AI Chip Race Heats Up 21 March 2026
Multi-Token Prediction support coming to MLX-LM for Qwen 3.5 21 March 2026
Qwen 3.5 Emerges as Top Performer for Local Deployment with Extensive Quantization Options 20 March 2026
Snapdragon 8 Elite Gen 5 Hands the Galaxy S26 the AI Upgrade We've Been Waiting For 18 March 2026
Kimi Introduces Attention Residuals: 1.25x Compute Performance at <2% Overhead 17 March 2026
LoKI – Local AI Assistant for Linux and WSL 16 March 2026
Dictare – Open-source Voice Layer for AI Coding Agents (100% Local) 16 March 2026
AMD Declares 'AI on the PC Has Crossed an Important Line' – Agent Computers as Next Breakthrough 16 March 2026
OpenClaw vs Eigent vs Claude Cowork: Comparing Open-Source AI Collaboration Platforms 15 March 2026
Startup Transforms Mac Mini Into Full-Powered AI Inference System With External GPU 15 March 2026
Local LLMs on Apple Silicon Mac 2026: M1 M2 M3 Guide 14 March 2026
SK Hynix Completes Qualification for LPDDR6 Memory Optimized for AI Inference 11 March 2026
Apple Launches MacBook Neo with A18 Pro Chip for Affordable Local AI Inference 8 March 2026
Real-World Qwen 3.5 9B Agent Performance on M1 Pro Validates Edge Deployment 6 March 2026
Apple Unveils MacBook Pro with M5 Pro and M5 Max Featuring On-Device AI 5 March 2026
Apple Unveils MacBook Pro With M5 Pro and M5 Max for On-Device AI 4 March 2026
Apple M4 iPad Air Targets AI Users with Double M1 Speed Performance 3 March 2026
Running Local AI Models on Mac Studio 128GB: 4B, 20B & 120B Tested 2 March 2026
Qualcomm Launches Snapdragon Wear Elite for On-Device AI on Wearables 2 March 2026
Apple Neural Engine Reverse-Engineered for Local Model Training on Mac Mini M4 2 March 2026
Mirai Announces $10M to Advance On-Device AI Performance for Consumer Devices 25 February 2026
How AI is Redefining Price and Performance in Modern Laptops 25 February 2026
Apple Accelerates U.S. Manufacturing with Mac Mini Production 24 February 2026
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio 23 February 2026
Future of Mobile AI: What On-Device Intelligence Means for App Developers 23 February 2026