Tagged "model-deployment"
- Tether AI Upgrades QVAC SDK With TurboQuant for Data Center-Sized Memory on Everyday Devices
- Nvidia Enters Windows Laptop Market, Taking on Intel and AMD
- Chrome Silently Downloads 4GB AI Model for Local Inference Without User Consent
- Show HN: An Open-Source Interactive AI Engineering Syllabus (1,100 Papers)
- M5 Max MacBook Runs Local Large Language Models Efficiently
- AMD Posts HDMI 2.1 FRL Patches for Amdgpu Linux Driver
- Google's Gemma 4: Powerful AI Models Optimized for Your Phone and Laptop
- Llama.cpp's Auto Fit Feature Quietly Reshapes Local AI Inference on Consumer Hardware
- go-AI: New Inference API Library for Go Released
- Laimark – 8B LLM That Self-Improves on Consumer GPUs
- Gemini-CLI, Llama.cpp, and Qwen3.5 Running on NVIDIA Jetson TK1
- NVIDIA Releases GPT-OSS-Puzzle-88B, a Deployment-Optimized Model