Tagged "on-device-deployment"

NVIDIA and Microsoft Team Up to Bring Secure On-Device AI Agents to Windows PCs 2 June 2026
JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks 2 June 2026
Lenovo Bets on On-Device AI to Lift Business PC Upgrades 28 May 2026
I Quit ChatGPT for a Free, Private, and Local AI Called Ollama – Here's Why 27 May 2026
Why Your Docker Container Is 1.2GB When It Should Be 80MB 24 May 2026
llama.cpp MTP Leak Fix Stabilizes Local AI Agents 22 May 2026
llama.cpp Adds Multi-Token Prediction, Doubles Qwen 3.6B Throughput for Local Inference 19 May 2026
Chrome Silently Installs 4GB AI Model Without User Permission 12 May 2026
Small On-Device AI Model Beats Claude Sonnet 4.5 and GPT-5 10 May 2026
Mlx-serve: Run LLMs Natively on Your Mac 10 May 2026
Chrome's On-Device AI Features Consuming 4GB of Storage for Gemini Nano 9 May 2026
Lemonade Gives AMD Startups a Wider Path to Local Inference 9 May 2026
How to make SSE token streams resumable, cancellable, and multi-device 7 May 2026
Ask HN: Real life autonomous AI Agents 7 May 2026
Show HN: Desktop Agent Center – Local AI Automation via Hotkeys 7 May 2026
On-Device AI Market Poised for Explosive Growth as Major Tech Companies Invest Heavily 6 May 2026
llama.cpp Now Supports Multi-Token Prediction in Beta 5 May 2026
Chrome LLM Prompt API Raises Local Deployment Questions 30 April 2026
AI Quota Inflation Is No Token Effort. It's Baked In 20 April 2026
Local AI Isn't Just Ollama—Here's the Ecosystem That Actually Makes It Useful 19 April 2026
Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw 18 April 2026
Self-Hosted LLM Took Personal Knowledge Management System to the Next Level 13 April 2026
On-Device AI: Achieving Powerful AI Capabilities Without Internet Connectivity 12 April 2026
CarryAI's Serverless Vision-Language Models Enable On-Device Multimodal AI 10 April 2026
Running AI Natively on Windows 11 Using an eGPU 7 April 2026
DeepSeek V3 Complete Guide: Deploy and Optimize Local AI in 2026 30 March 2026
Local AI Ecosystem Extends Far Beyond Ollama 29 March 2026
Mistral AI Releases Voxtral: Open-Source TTS Model Beating ElevenLabs on Local Hardware 27 March 2026
Pluggable's TBT5-AI: First Thunderbolt Dock Explicitly Targeting Local LLM Workstations 26 March 2026