Tagged "apple-silicon"

Ditching Paid AI Services: Building Self-Hosted LLM Solutions as ChatGPT, Claude, and Gemini Alternatives 22 March 2026
Multi-Token Prediction support coming to MLX-LM for Qwen 3.5 21 March 2026
Apple M5 Max 128GB real-world performance benchmarks for local inference 21 March 2026
DeepSeek R1 RTX 4090 vs Apple M3 Max: Benchmark & Performance Guide 21 March 2026
NVIDIA Nemotron 3 Nano 4B Enables On-Device Inference Directly in Web Browsers via WebGPU 20 March 2026
Dictare – Open-source Voice Layer for AI Coding Agents (100% Local) 16 March 2026
Startup Transforms Mac Mini Into Full-Powered AI Inference System With External GPU 15 March 2026
Local LLMs on Apple Silicon Mac 2026: M1 M2 M3 Guide 14 March 2026
Apple M5 Max 128GB Benchmark Results for Local LLM Inference 12 March 2026
Experiment: 0.8B Model Self-Improvement on MacBook Air Yields Surprising Results 11 March 2026
M5 Max and M5 Ultra Chipsets Demonstrate Significant Bandwidth Improvements for Local LLM Inference 10 March 2026
Apple Launches MacBook Neo with A18 Pro Chip for Affordable Local AI Inference 8 March 2026
Real-World Qwen 3.5 9B Agent Performance on M1 Pro Validates Edge Deployment 6 March 2026
MediaTek Advances Omni Model for Efficient Smartphone Inference 5 March 2026
Apple Unveils MacBook Pro with M5 Pro and M5 Max Featuring On-Device AI 5 March 2026
Apple Unveils MacBook Pro With M5 Pro and M5 Max for On-Device AI 4 March 2026
Apple M5 Pro and M5 Max: 4× Faster LLM Processing 4 March 2026
AMD Launches Copilot+ Desktop Chips to Compete in On-Device AI Market 4 March 2026
VibeWhisper – macOS Voice-to-Text with 100% Local Processing Option 3 March 2026
Apple M4 iPad Air Targets AI Users with Double M1 Speed Performance 3 March 2026
Alibaba's Qwen 3.5 Small Model Runs Directly on iPhone 17 3 March 2026
Running Local AI Models on Mac Studio 128GB: 4B, 20B & 120B Tested 2 March 2026
Apple Neural Engine Reverse-Engineered for Local Model Training on Mac Mini M4 2 March 2026
Show HN: Caret – Tab to Complete at Any App on Your Mac 27 February 2026
Researchers Develop Persistent Memory System for Local LLMs—No RAG Required 26 February 2026
Apple: Python bindings for access to the on-device Apple Intelligence model 26 February 2026
Apple Accelerates U.S. Manufacturing with Mac Mini Production 24 February 2026
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio 23 February 2026
Nvidia Could Launch Its First Laptops With Its Own Processors 23 February 2026
AI-Powered Reverse-Engineering of Rosetta 2 for Linux 23 February 2026
Apple Researchers Develop On-Device AI Agent That Interacts With Apps for You 21 February 2026
PaddleOCR-VL Now Integrated into llama.cpp for Multilingual OCR 20 February 2026
Complete Offline AI System: Voice Control and Smart Home via Local LLM and Radio Without Internet 19 February 2026
Kitten TTS V0.8 Released: State-of-the-Art Super-Tiny Text-to-Speech Model Under 25MB 19 February 2026
GPT4All Replaces Ollama On Mac After Quick Trial 19 February 2026
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet 17 February 2026
Sourdine: Open-Source macOS App for 100% Local AI Transcription 16 February 2026
MiniMax Releases M2.5 Model with SOTA Coding and Agent Capabilities 14 February 2026
MiniMax-M2.5 230B MoE Model Released with GGUF Support for Local Deployment 14 February 2026