Tagged "enterprise"
-
Building a Local AI Stack: Five Docker Containers to Replace ChatGPT Subscriptions
-
Economic Implications of AI Adoption: Why Local Deployment Matters for Cost Control
-
Unsloth's Custom Kernels Make LLM Fine-Tuning Viable on Consumer GPUs
-
Linux Crushes Windows on llama.cpp Inference by Double Digits
-
The New Linux Kernel AI Bot Uncovering Bugs Is A Local LLM On Framework Desktop + AMD Ryzen AI Max
-
Singapore's Foreign Minister Builds an AI "Second Brain" Using NanoClaw
-
Thinking Outside the Box: New Attack Surfaces in Sandboxed AI Agents
-
Can IBM's RITS Platform and vLLM Reset the Bar for Enterprise AI Access?
-
75% of US Health Systems Are Using AI. Only 18% of That Deployment Is Governed
-
SiGit Code: Local-First Coding Agent
-
LLMs Consume 5.4x Less Mobile Energy Than Ad-Supported Web Search
-
Build Your Own Local AI Stack with 5 Docker Containers and Eliminate ChatGPT Subscriptions
-
Hackers Exploit Ollama Model Uploads to Leak Server Data
-
Netherlands Reaches Deal to Cut Reliance on U.S. Cloud Tech
-
Using a Local LLM as a Zero-Shot Classifier
-
I Built a Local AI Stack With 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again
-
How to Make Sense of AI
-
Local LLM for Private Companies
-
Intel OpenVINO 2026.1 Integrates llama.cpp with Wildcat Lake and Arc Pro B70
-
Intel LLM-Scaler vLLM 0.14.0 Released With Official Arc Pro B70 Support
-
Cortex Auth – Rust secrets vault for AI agents (exec-based injection)
-
Developer Replaced GPT-4 with a Local SLM and CI/CD Pipeline Stability Improved
-
Malicious GGUF Models Could Trigger Remote Code Execution on SGLang Servers
-
Gemma 4 Just Replaced My Whole Local LLM Stack
-
Intel Extends AI PC Reach With New Core Ultra Series 3 Launch
-
Claude vs Local LLM: Real-World Prompt Comparison Reveals Trade-offs
-
The AI-Ready Product Data Framework for B2B Commerce
-
AI Quota Inflation Is No Token Effort. It's Baked In
-
Minisforum Launches N5 Max AI NAS with OpenClaw
-
I Connected My Local LLM to My Browser and It Changed How I Automated Tasks
-
Kilo is the VS Code Extension That Actually Works with Every Local LLM
-
I Built a Local AI Stack with 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again
-
Exposed LLM Infrastructure: How Attackers Find and Exploit Misconfigured AI Deployments
-
After Two Months of Open WebUI Updates, I'd Pick It Over ChatGPT's Interface for Local LLMs
-
Local AI Isn't Just Ollama—Here's the Ecosystem That Actually Makes It Useful
-
Intel's $949 GPU Has 32GB of VRAM for Local AI, but the Software Is Why Nvidia Keeps Winning
-
N8n, Dify, and Ollama Emerge as Leading Self-Hosted AI Automation Stack
-
LLM Personalization Breaks Down in High-Stakes Finance
-
Slop-scan – Detect AI Code Slop Patterns in Your Repo
-
DotLLM – Building an LLM Inference Engine in C#
-
DGX Spark Setup Guide: Running vLLM and PyTorch for Local LLM Inference Backend
-
Ubiquiti UniFi G6 Turret 4K Camera Features On-Device AI Processing at $199 Price Point
-
OpenNebula 7.2 "Dark Horse" Released with Enhanced Infrastructure Support
-
Minisforum N5 MAX AI NAS Delivers 126 TOPS with 200TB Storage for Local LLM Workloads
-
Build a Sovereign Local AI Stack: Ollama and Open WebUI and Pgvector 2026
-
Show HN: SkillCompass – Open-Source Quality Evaluator for Your AI Skills
-
On-Device AI Inference Emerges as New Security Blind Spot for CISOs
-
Defender – Local Prompt Injection Detection for AI Agents
-
Running Same Prompts Through Claude and Local LLM Revealed Unexpected Results
-
Gemma 4 31B vs Qwen 3.5 27B: Comprehensive Long Context Benchmark
-
ASUS ExpertBook P1 Integrates On-Device AI for Enterprise Collaboration
-
AI Workflow Evolution: From Prompts to Near-Autonomous Systems
-
AI PC Market Projected to Reach $235B by 2032, Driven by On-Device Computing Adoption
-
Ollama's Limitations for Production Local LLM Deployments
-
Local Small LLMs Match Enterprise Model Performance on Vulnerability Detection
-
LLM Wiki v2: Extended Knowledge Base for LLM Practitioners
-
5 Open-Source Projects Running Transformers on CPUs to GPUs in Pure Java
-
AI Scans 400k Reddit Posts to Flag Overlooked GLP-1 Side Effects
-
Energy Consumption: The Final Frontier for AI and Local Inference
-
Mano-P: Open-Source On-Device GUI Agent, #1 on OSWorld Benchmark
-
Ask HN: Local-First Meetings Recorder and Transcriber
-
Intel Releases OpenVINO 2026.1 With Backend For Llama.cpp, New Hardware Support
-
Privilege Escalation Attacks on GPUs Using Rowhammer
-
GitHub Copilot CLI Adds Support for BYOK and Local Model Deployment
-
Docsie Launches On-Premise AI Platform for Regulated Industries
-
Your Next Assistant is Your PC: How On-Device AI is Transforming Work, One Workflow at a Time
-
Google Launches Offline AI Dictation App for iOS with Gemma
-
Gemma 4 Achieves Top Multilingual Performance Across European Languages
-
METATRON: Open-Source AI Penetration Testing with Local LLMs
-
Lenovo Korea Launches AI-Powered Industrial Edge Solutions
-
Qwen 3.5 397B Reduced to 35% Parameters With Usable Quality on 96GB GPU
-
DGX Spark Hardware Limitations: Missing NVFP4 Support Undermines Local AI Value Proposition
-
YC-Bench: GLM-5 Matches Claude Opus 4.6 at 11× Lower Cost
-
NVIDIA and Google Optimize Gemma 4 AI Models for Local RTX Deployment
-
Building Cross-Platform Ollama Dashboards with 95% Shared Code
-
Gemma 4 on Arm: Optimized On-Device AI for Mobile and Edge Deployment
-
How to Integrate VS Code with Ollama for Local AI Assistance
-
Lotte Innovate and DeepX Collaborate on Mass Production of Domestic AI Semiconductors
-
Chinese Chipmakers Claim Nearly Half of Local Market as Nvidia's Lead Shrinks
-
Satcove – Query 5 AI Models Simultaneously and Get Structured Verdicts
-
ROCm Integration in Ubuntu 26.04 Advances Linux GPU Inference
-
Qwen 3.5-27B Demonstrates Superior Performance vs Gemini 3.1 Pro and GPT-5.3
-
Intel's Arc GPU Offers 32GB VRAM for Local AI, But Software Ecosystem Lags Behind
-
GPU Passthrough to LXCs in Proxmox Simplifies Local Inference Infrastructure
-
Closed Source AI = Neofeudalism
-
Dell Technologies Unveils 10 AI PC Models for Business, from Ultralight Laptops to Ultracompact Desktops
-
RAG Deployment Lessons from Regulated Industries
-
Miasma: A Tool to Protect Data from AI Web Scrapers
-
IBM Granite 4.0 3B Vision: Compact Enterprise-Grade Document AI
-
Prompt Security Challenges Emerge as Critical Concern for Local LLM Deployments
-
CERN Embeds Tiny AI Models in Silicon Chips for Real-Time LHC Data Filtering
-
Acer TravelMate AI Laptops Launch in UAE for Business On-Device Inference
-
This Self-Hosted Tool Makes My Local LLMs Feel Exactly Like ChatGPT, but Nothing Leaves My Network
-
Qwen 3.5 27B Achieves 1.1M Tokens/Second on B200 GPUs with Optimized vLLM Config
-
Hold on to Your Hardware: Implications for Local LLM Deployment
-
See What Your AI Agents Are Doing: Multi-Agent Observability Tool
-
Show HN: Beforeyouship – Pre-Build Tool to Estimate LLM Cost
-
Real-World Benchmark: DeepSeek-V3 Matches Claude Sonnet on Routine Coding Tasks
-
Apple Plans Slimmed-Down Gemini Models for Local iPhone AI Features
-
HP Launches IQ On-Device AI Assistant, Advancing Enterprise AI Adoption on PCs
-
Council: A Structured Deliberation Protocol Across Diverse AI Models
-
Self-Hostable AI Agents and Internal Software Framework Released
-
Qwen 3.5 Models: Optimal Settings and Reduced Overthinking Configuration
-
LM Studio Releases Reworked Plugins with Fully Local Web Research
-
Llama.cpp ROCm 7 vs Vulkan Performance Benchmarks on AMD Mi50
-
Korea to Deploy Domestic AI Chips in Smart Cities as NPU Trials Scale Up
-
Powerful AI Search Engine Built on Single GeForce RTX 5090
-
Developer Builds Fully Local Multi-Agent System Using vLLM and Parallel Inference
-
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach
-
Brezn – Decentralized Local Communication
-
Self-Hosted AI Code Review with Local LLMs: Secure Automation Guide
-
Qualcomm and Samsung's 30-Year AI Alliance Enters a New Phase as On-Device AI Chip Race Heats Up
-
Pydantic-Deep: Production Deep Agents for Pydantic AI
-
MacinAI Local brings functional LLM inference to classic Macintosh hardware
-
DeepSeek R1 RTX 4090 vs Apple M3 Max: Benchmark & Performance Guide
-
Your Site Content Is Powering AI. Your Bank Account Has No Idea
-
Build a $1,500 AI Server with DeepSeek-R1 on RTX 4090
-
What AI Augmentation Means for Technical Leaders
-
SwarmHawk – Open-Source CLI for Vulnerability Scanning with AI Synthesis
-
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks
-
Cybersecurity Skills for AI Agents – agentskills.io Standard Implementation
-
ASUS ExpertCenter PN55 Mini PC Combines AMD AI CPU and 55 TOPS NPU
-
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It
-
Dell Pro Max 16 Plus Launches With Enterprise-Grade Discrete NPU for On-Device AI
-
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training
-
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally
-
On-Device AI: Tether's QVAC Fabric Enables Local Training
-
I Switched to a Local LLM for These 5 Tasks and the Cloud Version Hasn't Been Worth It Since
-
Hugging Face Releases One-Liner for Automatic Hardware Detection and Model Selection
-
Custom GPU Multiplexer Achieves 0.3ms Model Switching on Legacy Hardware
-
Open-Source LLMs Rapidly Displacing Proprietary SOTA Models
-
NVIDIA Updates Nemotron 3 122B License, Removes Deployment Restrictions
-
Nota Added to Three Technology and Growth ETFs in a Row – Market Recognition for AI Efficiency
-
Apple's On-Device AI Raises Privacy Alarms Across British Parliament
-
Show HN: Buxo.ai – Calendly alternative where LLM decides which slots to show
-
AMD Launches Agent System Optimized for Local AI Inference With Ryzen and Radeon
-
Achieving 2000 Tokens Per Second with QWEN 3.5 27B on RTX-5090
-
Intel OpenVINO Backend Support Now Available in llama.cpp
-
Fine-Tuned 14B Model Outperforms Claude Opus 4.6 on Ada Code Generation
-
AgentArmor: Open-Source 8-Layer Security Framework for AI Agents
-
Nvidia Pushes Jetson as Edge Hub for Open AI Models
-
MeepaChat – Slack for AI Agents (iOS, macOS, Web / Cloud, Self-Hosted)
-
Local AI Coding Assistant: Complete VS Code + Ollama + Continue Setup
-
Texas Instruments Launches NPU-Powered MCUs for Low-Power Edge AI
-
Show HN: Aver – a Language Designed for AI to Write and Humans to Review
-
Gloss: Open-Source, Local-First RAG Alternative to NotebookLM Built in Rust
-
Nota AI to Showcase End-to-End On-Device AI Optimization at Embedded World 2026
-
Gyro-Claw – Secure Execution Runtime for AI Agents
-
Benchmark: Local Open-Source LLMs Competitive in Real-Time Trading Applications
-
Show HN: SimplAI – Build and Deploy AI Agents and Workflows Without Boilerplate
-
Show HN: RedDragon – LLM-Assisted IR Analysis of Code Across Languages
-
Building PyTorch-Native Support for IBM Spyre Accelerator
-
Windows 11 Notepad to Feature On-Device AI Text Generation Without Subscription
-
Show HN: BoardMint – A PCB Review Tool That Avoids AI Hallucinations
-
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support
-
Unity Showcases Manufacturing AI Workflow at Smart Factory Expo
-
RunAnywhere Launches Production-Grade On-Device AI Platform for Enterprise Scale
-
On-Device AI Laptop Lineups Become Standard Across Major Manufacturers
-
Quantifying Cost Savings with Local LLMs for Development
-
Apple Unveils MacBook Pro With M5 Pro and M5 Max for On-Device AI
-
AMD Launches Copilot+ Desktop Chips to Compete in On-Device AI Market
-
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions
-
Open-Source Article 12 Logging Infrastructure for the EU AI Act
-
Claude Opus 4.6 Solves Problem Posed by Don Knuth
-
HP ZBook Ultra 14 G1a Workstation Reclaims Local AI Workflows for Professionals
-
AMD Expands Ryzen AI 400 Series Portfolio for Consumer and Enterprise AI PC Options
-
RAG-Enterprise – 100% Local RAG System for Enterprise Documents
-
ParseHive – AI-Powered Invoice Data Extraction for Windows and Mac
-
Huawei's SuperPoD Portfolio Creates New Option for Global Computing at MWC Barcelona 2026
-
DeepSeek V4 Multimodal Model Coming Next Week With Image and Video Generation
-
Configure MCP Servers Once, Sync Them Everywhere
-
AI-Native Store Research
-
Seco Launches Edge AI System-on-Module at Embedded World 2026
-
On-Device AI in Mobile Apps: What Should Run on the Phone vs the Cloud (A 2026 Decision Guide)
-
Android Phones Are Getting Smarter Without Internet — Here's Why On-Device AI Is the Next Big Shift
-
The Complete Developer's Guide to Running LLMs Locally: From Ollama to Production
-
Show HN: Anonymize LLM traffic to dodge API fingerprinting and rate-limiting
-
Red Hat Launches AI Enterprise for Hybrid AI Deployments
-
Mirai Tech Raises $10 Million for On-Device AI Innovation
-
The Real AI Competition Is Closed-Source vs Open-Source, Not America vs China
-
Enterprise Infrastructure Guide: Running Local LLMs for 70-150 Developers
-
Comparing Manual vs. AI Requirements Gathering: 2 Sentences vs. 127-Point Spec
-
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments
-
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours
-
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio
-
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools
-
GPT-OSS 20B Demonstrates Practical Agentic Capabilities Running Fully Locally
-
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI
-
Show HN: Tickr – AI Project Manager That Lives Inside Slack (Replaces Jira)
-
At India AI Impact Summit, Intel Showcases AI PCs and Cost-Efficient Frugal AI
-
Asus ExpertBook B3 G2 with 50 TOPS AI Sets New Enterprise Standard
-
AI PCs Explained: 7 Critical Truths About NPUs and Privacy
-
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM
-
I Run Local LLMs in One of the World's Priciest Energy Markets, and I Can Barely Tell
-
Google Is Exploring Ways to Use Its Financial Might to Take on Nvidia
-
Claude Code Open – AI Coding Platform with Web IDE and Agents
-
24 Simultaneous Claude Code Agents on Local Hardware
-
TemplateFlow – Build AI Workflows, Not Prompts
-
SanityBoard Adds 27 New Model Evaluations Including Qwen 3.5 Plus, GLM 5, and Gemini 3.1 Pro
-
PaddleOCR-VL Now Integrated into llama.cpp for Multilingual OCR
-
Ollama Production Deployment: Docker-Compose Setup Guide
-
Mirai Secures $10M to Optimize On-Device AI Amid Cloud Cost Surge
-
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx
-
Self-Hosted Local LLMs for Document Management with Paperless-ngx
-
Mihup and Qualcomm Collaborate to Advance Secure On-Device Voice AI for BFSI
-
Local Vision-Language Models for Document OCR and PII Detection in Privacy-Critical Workflows
-
LayerScale Launches Inference Engine Faster Than vLLM, SGLang, and TRT-LLM
-
Aegis.rs: Open Source Rust-Based LLM Security Proxy Released
-
Tailscale Releases New Tool to Prevent Sensitive Data Leakage to Cloud AI Services
-
Alibaba's Qwen3.5-397B Achieves #3 Position in Open Weights Model Rankings
-
Qualcomm Ventures Positions India as Blueprint for Affordable On-Device AI Infrastructure
-
Same INT8 Model Shows 93% to 71% Accuracy Variance Across Snapdragon Chipsets
-
AMD Announces Day 0 Support for Qwen 3.5 LLM on Instinct GPUs
-
Show HN: PgCortex – AI enrichment per Postgres row, zero transaction blocking
-
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter
-
Show HN: Inkog – Pre-flight check for AI agents (governance, loops, injection)
-
High Bandwidth Flash Memory Could Alleviate VRAM Constraints in Local LLM Inference
-
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages
-
Chinese AI Chipmaker Axera Semiconductor Plans $379 Million Hong Kong IPO for Edge Inference Hardware
-
Asus ExpertBook B3 G2 Laptop Features Ryzen AI 9 HX 470 CPU in 1.41kg Ultraportable Form Factor
-
Sourdine: Open-Source macOS App for 100% Local AI Transcription
-
Security Alert: Open Claw Designed for Self-Hosting, Stop Sharing Credentials
-
Critical vLLM RCE Vulnerability Allows Remote Code Execution via Video Links
-
175,000 Publicly Exposed Ollama AI Servers Discovered Across 130 Countries
-
WinClaw: Windows-Native AI Assistant with Office Automation
-
Simile AI Raises $100M Series A for Local AI Infrastructure
-
Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues
-
Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues
-
Microsoft MarkItDown: Document Preprocessing Tool for LLMs
-
175,000 Publicly Exposed Ollama Servers Create Major Security Risk
-
Building a RAG Pipeline on 2M+ Pages: EpsteinFiles-RAG Project