Tagged "apple-silicon"

What Apple Knows About AI That Silicon Valley Won't Admit 31 May 2026
Apple Doubles Down on On-Device AI at WWDC 2026, Setting Privacy-First Strategy 30 May 2026
Samsung's Exynos 2800 Brings HBM Memory to Mobile AI, Enabling Faster Local Model Inference 26 May 2026
Apple's 2026 AI Strategy Prioritizes On-Device Model Deployment 25 May 2026
Why AI Hardware Is a Chip Layer Problem 24 May 2026
M5 Max MacBook Runs Local Large Language Models Efficiently 23 May 2026
AMD Unveils Ryzen AI Halo Developer Platform for On-Device AI Workloads 23 May 2026
Auditing Apple's DifferentialPrivacy.framework: Bugs, Misconfig, Practical Risks 21 May 2026
Samsung's Exynos 2800 Brings Significant On-Device AI Capabilities 18 May 2026
AMD's Lemonade SDK Advances macOS Support for Local AI Inference with ROCm 7.13 18 May 2026
Apple's M5 MacBook Air Advances On-Device AI with Redesigned Hardware 16 May 2026
Running AI Models Locally on M4 Processors with 24GB Memory 14 May 2026
Lucebox Brings Faster Local AI Inference to AMD Strix Halo 13 May 2026
Cotypist – AI Autocomplete for Mac 11 May 2026
Mlx-serve: Run LLMs Natively on Your Mac 10 May 2026
Perplexity Brings On-Device AI Workflow to Macs with 'Personal Computer' Feature 8 May 2026
On-Device AI Market Poised for Explosive Growth as Major Tech Companies Invest Heavily 6 May 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 5 May 2026
Google's Gemma 4 Brings Powerful AI Capabilities to Phones and Laptops 30 April 2026
Show HN: Phonetic Formatter – Offline English Text to IPA on iPhone and iPad 26 April 2026
Llama 4 Scout on MLX: The Complete Apple Silicon Guide (2026) 23 April 2026
Running Gemma 4 on an iPhone 13 Pro 15 April 2026
DFlash Doubles Token Generation Speed of Qwen3.5 27B on Mac M5 Max 15 April 2026
oMLX Framework Implements DFlash Attention for Optimized Inference 14 April 2026
MiniMax M2.7 Achieves SOTA Performance Under 64GB on Mac with TQ Quantization 14 April 2026
DFlash Speculative Decoding Achieves 3.3x Speedup on Apple Silicon 12 April 2026
Parakeet Streaming ASR on Apple Silicon via CoreML 11 April 2026
AIYO Wisper: Local Voice-to-Text for macOS Using WhisperKit 11 April 2026
On-Device Apple Intelligence Vulnerable to Prompt Injection Attacks 10 April 2026
Running a 1.7B Parameters LLM on an Apple Watch 9 April 2026
Comprehensive Benchmark: 37 LLMs Tested on MacBook Air M5 With Open-Source Tool 7 April 2026
Google Launches Offline AI Dictation App for iOS with Gemma 7 April 2026
Real-time Multimodal AI on Apple Silicon: Gemma E2B Demo Shows Practical Edge Deployment 6 April 2026
Apple Brings Enhanced On-Device AI Features to iPhone 6 April 2026
Ollama Gets Blazing Fast on Macs with Full MLX Support and 2× Speedups 5 April 2026
Gemma 4 26B MoE Emerges as Optimal All-Around Local Model for Consumer Hardware 5 April 2026
Samsung Launches Galaxy Book6 Series with NVIDIA RTX 5070 and On-Device AI 4 April 2026
Mixed Precision Quantization on MLX with TurboQuant Implementation 4 April 2026
Kokoro TTS Achieves 20× Realtime Speed on CPU-Only On-Device Inference 4 April 2026
Gemma 4 KV Cache Memory Issues Fixed in llama.cpp 4 April 2026
April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini 3 April 2026
Google Gemma 4 Released with GGUF Quantizations 3 April 2026
Gemma 4 26B A4B Outperforms Qwen 3.5 35B on Apple Silicon 3 April 2026
Gemma 4 Makes Local AI Agents Practical 3 April 2026
Apfel – The Free AI Already on Your Mac 3 April 2026
Apple Silicon Macs Run Local AI Faster with Ollama's New MLX Support 2 April 2026
TinyGPU Adds Mac Support for External Nvidia GPU Acceleration 2 April 2026
Ollama Adopts Apple's MLX Framework for Faster Local AI on Mac 1 April 2026
Is Anyone Working on an AI Operating System? 1 April 2026
Select the Right Hardware for Your Local LLM Deployment with This Online Guide 30 March 2026
TurboQuant KV Cache Compression Achieves 22.8% Faster Decoding at 32K Context 28 March 2026
Qwen3 512k Context via TurboQuant on Mac mini 28 March 2026
M5 Max Delivers 1.7x Faster Inference Than M3 Max on Qwen 3.5 Models 28 March 2026
RotorQuant: 10-19x Faster Quantisation Alternative Using Clifford Algebra 27 March 2026
mlx-Code: Run Claude Code Locally with MLX-LM 27 March 2026
Apple Gets Full Gemini Access and Uses Distillation to Build Lightweight On-Device AI 27 March 2026
Liquid AI's LFM2-24B Achieves 50 Tokens/Second in Web Browser via WebGPU 26 March 2026
Apple Plans Slimmed-Down Gemini Models for Local iPhone AI Features 26 March 2026
Running an Open-Weight LLM Locally on an Apple Watch 25 March 2026
Ultra-Large 400B-Class LLM Runs on iPhone in Test 25 March 2026
Ditching Paid AI Services: Building Self-Hosted LLM Solutions as ChatGPT, Claude, and Gemini Alternatives 22 March 2026
Multi-Token Prediction support coming to MLX-LM for Qwen 3.5 21 March 2026
Apple M5 Max 128GB real-world performance benchmarks for local inference 21 March 2026
DeepSeek R1 RTX 4090 vs Apple M3 Max: Benchmark & Performance Guide 21 March 2026
NVIDIA Nemotron 3 Nano 4B Enables On-Device Inference Directly in Web Browsers via WebGPU 20 March 2026
Dictare – Open-source Voice Layer for AI Coding Agents (100% Local) 16 March 2026
Startup Transforms Mac Mini Into Full-Powered AI Inference System With External GPU 15 March 2026
Local LLMs on Apple Silicon Mac 2026: M1 M2 M3 Guide 14 March 2026
Apple M5 Max 128GB Benchmark Results for Local LLM Inference 12 March 2026
Experiment: 0.8B Model Self-Improvement on MacBook Air Yields Surprising Results 11 March 2026
M5 Max and M5 Ultra Chipsets Demonstrate Significant Bandwidth Improvements for Local LLM Inference 10 March 2026
Apple Launches MacBook Neo with A18 Pro Chip for Affordable Local AI Inference 8 March 2026
Real-World Qwen 3.5 9B Agent Performance on M1 Pro Validates Edge Deployment 6 March 2026
MediaTek Advances Omni Model for Efficient Smartphone Inference 5 March 2026
Apple Unveils MacBook Pro with M5 Pro and M5 Max Featuring On-Device AI 5 March 2026
Apple Unveils MacBook Pro With M5 Pro and M5 Max for On-Device AI 4 March 2026
Apple M5 Pro and M5 Max: 4× Faster LLM Processing 4 March 2026
AMD Launches Copilot+ Desktop Chips to Compete in On-Device AI Market 4 March 2026
VibeWhisper – macOS Voice-to-Text with 100% Local Processing Option 3 March 2026
Apple M4 iPad Air Targets AI Users with Double M1 Speed Performance 3 March 2026
Alibaba's Qwen 3.5 Small Model Runs Directly on iPhone 17 3 March 2026
Running Local AI Models on Mac Studio 128GB: 4B, 20B & 120B Tested 2 March 2026
Apple Neural Engine Reverse-Engineered for Local Model Training on Mac Mini M4 2 March 2026
Show HN: Caret – Tab to Complete at Any App on Your Mac 27 February 2026
Researchers Develop Persistent Memory System for Local LLMs—No RAG Required 26 February 2026
Apple: Python bindings for access to the on-device Apple Intelligence model 26 February 2026
Apple Accelerates U.S. Manufacturing with Mac Mini Production 24 February 2026
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio 23 February 2026
Nvidia Could Launch Its First Laptops With Its Own Processors 23 February 2026
AI-Powered Reverse-Engineering of Rosetta 2 for Linux 23 February 2026
Apple Researchers Develop On-Device AI Agent That Interacts With Apps for You 21 February 2026
PaddleOCR-VL Now Integrated into llama.cpp for Multilingual OCR 20 February 2026
Complete Offline AI System: Voice Control and Smart Home via Local LLM and Radio Without Internet 19 February 2026
Kitten TTS V0.8 Released: State-of-the-Art Super-Tiny Text-to-Speech Model Under 25MB 19 February 2026
GPT4All Replaces Ollama On Mac After Quick Trial 19 February 2026
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet 17 February 2026
Sourdine: Open-Source macOS App for 100% Local AI Transcription 16 February 2026
MiniMax Releases M2.5 Model with SOTA Coding and Agent Capabilities 14 February 2026
MiniMax-M2.5 230B MoE Model Released with GGUF Support for Local Deployment 14 February 2026