Tagged "nvidia"
-
NVIDIA Nemotron 3 Nano Omni Powers Multimodal Agent Reasoning in a Single Efficient Open Model
-
Hipfire: A Rust-Native AMD Inference Engine That Outperforms llama.cpp
-
Unsloth's Custom Kernels Make LLM Fine-Tuning Viable on Consumer GPUs
-
NVIDIA Adds Day-0 DeepSeek V4 Blackwell Support
-
Intel OpenVINO 2026.1 Integrates llama.cpp with Wildcat Lake and Arc Pro B70
-
Intel LLM-Scaler vLLM 0.14.0 Released With Official Arc Pro B70 Support
-
Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw
-
Intel's $949 GPU Has 32GB of VRAM for Local AI, but the Software Is Why Nvidia Keeps Winning
-
Google's Gemma 4 Brings Game-Changing Performance to Local Laptop Inference
-
DGX Spark Setup Guide: Running vLLM and PyTorch for Local LLM Inference Backend
-
MiniMax M2.7 Advances Scalable Agentic Workflows on NVIDIA Platforms for Complex AI Applications
-
Intel Arc Pro B70 32GB Achieves 12 Tokens/Sec on Qwen 3.5-27B
-
PyTorch Foundation Welcomes Helion as a Foundation-Hosted Project to Standardize Open, Portable, and Accessible AI Kernel Authoring
-
AMD Announces Day 0 Support for Google Gemma 4 Across Processors and GPUs
-
Ollama Gets Blazing Fast on Macs with Full MLX Support and 2× Speedups
-
DGX Spark Hardware Limitations: Missing NVFP4 Support Undermines Local AI Value Proposition
-
Samsung Launches Galaxy Book6 Series with NVIDIA RTX 5070 and On-Device AI
-
NVIDIA and Google Optimize Gemma 4 AI Models for Local RTX Deployment
-
GPUs vs. TPUs: Decoding the Powerhouses of AI
-
AMD Rolls Out Gemma 4 Model Support Across Full Range of GPUs & CPUs
-
NVIDIA Accelerates Gemma 4 for Local Agentic AI on RTX GPUs
-
Google Launches Gemma 4 Open Models for Local On-Device AI
-
AMD Provides Day 0 Support for Gemma 4 on Ryzen AI Processors and GPUs
-
TinyGPU Adds Mac Support for External Nvidia GPU Acceleration
-
Lotte Innovate and DeepX Collaborate on Mass Production of Domestic AI Semiconductors
-
Intel's $949 GPU Has 32GB of VRAM for Local AI, but Software is Why Nvidia Keeps Winning
-
ROCm Integration in Ubuntu 26.04 Advances Linux GPU Inference
-
Intel's Arc GPU Offers 32GB VRAM for Local AI, But Software Ecosystem Lags Behind
-
Is Anyone Working on an AI Operating System?
-
Samsung launches Galaxy Book6 series in India with Nvidia RTX 5070 graphics and on-device AI
-
Intel's $949 GPU has 32GB of VRAM for local AI, but the software is why Nvidia keeps winning
-
Select the Right Hardware for Your Local LLM Deployment with This Online Guide
-
Samsung Launches Galaxy Book6 Series in India with NVIDIA RTX 5070 Graphics and On-Device AI
-
Samsung Galaxy Book6 Brings Consumer-Grade On-Device AI Hardware to Market
-
RotorQuant: 10-19x Faster Quantisation Alternative Using Clifford Algebra
-
NVIDIA Releases GPT-OSS-Puzzle-88B, a Deployment-Optimized Model
-
Intel Launches Arc Pro B70/B65 with 32GB VRAM for Local AI Inference
-
Researcher Successfully Runs Local LLMs on Legacy "Dead" GPU With Surprising Results
-
Nvidia Nemotron Cascade 2 30B Emerges as Powerful Alternative to Qwen Models
-
DeepSeek R1 RTX 4090 vs Apple M3 Max: Benchmark & Performance Guide
-
Build a $1,500 AI Server with DeepSeek-R1 on RTX 4090
-
Repurpose Old GPUs as Dedicated AI Inference Accelerators
-
NVIDIA Nemotron Cascade 2 30B Delivers 120B-Class Performance in Compact Form Factor
-
NVIDIA Nemotron 3 Nano 4B Enables On-Device Inference Directly in Web Browsers via WebGPU
-
Llamafile 0.10 Released with GPU Support and Rebuilt Core
-
I Ran Local LLMs on a 'Dead' GPU, and the Results Surprised Me
-
Qwen 3.5 4B Outperforms Nvidia Nemotron 3 4B in Local Benchmarks
-
Mistral Small 4 119B Released with NVFP4 Quantisation Support
-
NVIDIA Updates Nemotron 3 122B License, Removes Deployment Restrictions
-
Qwen3.5-397B Achieves 282 tok/s on 4x RTX PRO 6000 Blackwell Through Custom CUTLASS Kernel
-
Nvidia's Nemotron 3 Super: Understanding the Significance for Local LLM Deployment
-
Running Qwen3.5-27B Across Multiple GPUs Over LAN Achieves Practical Speed for Local Inference
-
Startup Transforms Mac Mini Into Full-Powered AI Inference System With External GPU
-
Open-Source GreenBoost Driver Augments NVIDIA GPU VRAM With System RAM and NVMe Storage
-
AMD Launches Agent System Optimized for Local AI Inference With Ryzen and Radeon
-
Intel OpenVINO Backend Support Now Available in llama.cpp
-
Linux 7.0 AMDGPU Fixing Idle Power Issue For RDNA4 GPUs After Compute Workloads
-
How to Install OpenClaw with Ollama (Step-by-Step Tutorial)
-
Nvidia Pushes Jetson as Edge Hub for Open AI Models
-
Nvidia Releases Nemotron 3 Super: 120B MoE Model for Local Deployment
-
Comprehensive MoE Backend Benchmarks for Qwen3.5-397B: Real Numbers vs Hype
-
Cutile.jl Brings Nvidia CUDA Tile-Based Programming to Julia
-
NVIDIA Jetson Brings Open Models to Life at the Edge
-
Intel Arc Pro B70 Workstation GPU Confirmed via vLLM AI Release Notes
-
Qwen3.5-27B Identified as Sweet Spot for Mid-Range Local Deployment
-
Nvidia Could Launch Its First Laptops With Its Own Processors
-
Google Is Exploring Ways to Use Its Financial Might to Take on Nvidia
-
NVIDIA Releases Dynamo v0.9.0: Infrastructure Overhaul With FlashIndexer and Multi-Modal Support
-
LayerScale Launches Inference Engine Faster Than vLLM, SGLang, and TRT-LLM
-
AMD Announces Day 0 Support for Qwen 3.5 LLM on Instinct GPUs
-
NVIDIA's Dynamic Memory Sparsification Cuts LLM Inference Costs by 8x
-
Mistral AI Debugs Critical Memory Leak in vLLM Inference Engine
-
Community Member Builds 144GB VRAM Local LLM Powerhouse