Tagged "on-device-deployment"
- NVIDIA and Microsoft Team Up to Bring Secure On-Device AI Agents to Windows PCs
- JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks
- Lenovo Bets on On-Device AI to Lift Business PC Upgrades
- I Quit ChatGPT for a Free, Private, and Local AI Called Ollama – Here's Why
- Why Your Docker Container Is 1.2GB When It Should Be 80MB
- llama.cpp MTP Leak Fix Stabilizes Local AI Agents
- llama.cpp Adds Multi-Token Prediction, Doubles Qwen 3.6B Throughput for Local Inference
- Chrome Silently Installs 4GB AI Model Without User Permission
- Small On-Device AI Model Beats Claude Sonnet 4.5 and GPT-5
- Mlx-serve: Run LLMs Natively on Your Mac
- Chrome's On-Device AI Features Consuming 4GB of Storage for Gemini Nano
- Lemonade Gives AMD Startups a Wider Path to Local Inference
- How to make SSE token streams resumable, cancellable, and multi-device
- Ask HN: Real life autonomous AI Agents
- Show HN: Desktop Agent Center – Local AI Automation via Hotkeys
- On-Device AI Market Poised for Explosive Growth as Major Tech Companies Invest Heavily
- llama.cpp Now Supports Multi-Token Prediction in Beta
- Chrome LLM Prompt API Raises Local Deployment Questions
- AI Quota Inflation Is No Token Effort. It's Baked In
- Local AI Isn't Just Ollama—Here's the Ecosystem That Actually Makes It Useful
- Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw
- Self-Hosted LLM Took Personal Knowledge Management System to the Next Level
- On-Device AI: Achieving Powerful AI Capabilities Without Internet Connectivity
- CarryAI's Serverless Vision-Language Models Enable On-Device Multimodal AI
- Running AI Natively on Windows 11 Using an eGPU
- DeepSeek V3 Complete Guide: Deploy and Optimize Local AI in 2026
- Local AI Ecosystem Extends Far Beyond Ollama
- Mistral AI Releases Voxtral: Open-Source TTS Model Beating ElevenLabs on Local Hardware
- Pluggable's TBT5-AI: First Thunderbolt Dock Explicitly Targeting Local LLM Workstations