Tagged "gpu-optimization"

AI Inference is Rewriting the GPU Buying Playbook 22 July 2026
Critical GPU Memory Leak Vulnerability Discovered in vLLM 7 July 2026
NVIDIA Levels Up Local AI Agents Across RTX PCs and DGX Spark 1 June 2026
Intel llm-scaler-vllm 1.4 Released With Updated Components and Arc Pro B70 Support 21 May 2026
Running a Serious AI Model on a Consumer GPU Just Got Easier and That Matters More Than the Benchmark 3 May 2026
GPU Passthrough to LXCs in Proxmox Simplifies Local Inference Infrastructure 15 April 2026
MiniMax M2.7 Advances Scalable Agentic Workflows on NVIDIA Platforms for Complex AI Applications 12 April 2026
Qwen 3.5 122B Achieves 198 Tokens/sec on Dual RTX PRO 6000 Blackwell GPUs 10 April 2026
TurboQuant-Optimized llama.cpp Fork Delivers GFX906 GPU Acceleration 7 April 2026
NVIDIA Accelerates Gemma 4 for Local Agentic AI on RTX GPUs 3 April 2026
AMD Provides Day 0 Support for Gemma 4 on Ryzen AI Processors and GPUs 3 April 2026
TurboQuant Enables Qwen 3.5-27B on 16GB Consumer GPUs 2 April 2026
GPU Passthrough to LXCs in Proxmox Simplifies Local LLM Deployment 28 March 2026