Tagged "enterprise"
-
Self-Hostable AI Agents and Internal Software Framework Released
-
Qwen 3.5 Models: Optimal Settings and Reduced Overthinking Configuration
-
LM Studio Releases Reworked Plugins with Fully Local Web Research
-
Llama.cpp ROCm 7 vs Vulkan Performance Benchmarks on AMD Mi50
-
Korea to Deploy Domestic AI Chips in Smart Cities as NPU Trials Scale Up
-
Powerful AI Search Engine Built on Single GeForce RTX 5090
-
Developer Builds Fully Local Multi-Agent System Using vLLM and Parallel Inference
-
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach
-
Brezn – Decentralized Local Communication
-
Self-Hosted AI Code Review with Local LLMs: Secure Automation Guide
-
Qualcomm and Samsung's 30-Year AI Alliance Enters a New Phase as On-Device AI Chip Race Heats Up
-
Pydantic-Deep: Production Deep Agents for Pydantic AI
-
MacinAI Local brings functional LLM inference to classic Macintosh hardware
-
DeepSeek R1 RTX 4090 vs Apple M3 Max: Benchmark & Performance Guide
-
Your Site Content Is Powering AI. Your Bank Account Has No Idea
-
Build a $1,500 AI Server with DeepSeek-R1 on RTX 4090
-
What AI Augmentation Means for Technical Leaders
-
SwarmHawk – Open-Source CLI for Vulnerability Scanning with AI Synthesis
-
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks
-
Cybersecurity Skills for AI Agents – agentskills.io Standard Implementation
-
ASUS ExpertCenter PN55 Mini PC Combines AMD AI CPU and 55 TOPS NPU
-
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It
-
Dell Pro Max 16 Plus Launches With Enterprise-Grade Discrete NPU for On-Device AI
-
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training
-
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally
-
On-Device AI: Tether's QVAC Fabric Enables Local Training
-
I Switched to a Local LLM for These 5 Tasks and the Cloud Version Hasn't Been Worth It Since
-
Hugging Face Releases One-Liner for Automatic Hardware Detection and Model Selection
-
Custom GPU Multiplexer Achieves 0.3ms Model Switching on Legacy Hardware
-
Open-Source LLMs Rapidly Displacing Proprietary SOTA Models
-
NVIDIA Updates Nemotron 3 122B License, Removes Deployment Restrictions
-
Nota Added to Three Technology and Growth ETFs in a Row – Market Recognition for AI Efficiency
-
Apple's On-Device AI Raises Privacy Alarms Across British Parliament
-
Show HN: Buxo.ai – Calendly alternative where LLM decides which slots to show
-
AMD Launches Agent System Optimized for Local AI Inference With Ryzen and Radeon
-
Achieving 2000 Tokens Per Second with QWEN 3.5 27B on RTX-5090
-
Intel OpenVINO Backend Support Now Available in llama.cpp
-
Fine-Tuned 14B Model Outperforms Claude Opus 4.6 on Ada Code Generation
-
AgentArmor: Open-Source 8-Layer Security Framework for AI Agents
-
Nvidia Pushes Jetson as Edge Hub for Open AI Models
-
MeepaChat – Slack for AI Agents (iOS, macOS, Web / Cloud, Self-Hosted)
-
Local AI Coding Assistant: Complete VS Code + Ollama + Continue Setup
-
Texas Instruments Launches NPU-Powered MCUs for Low-Power Edge AI
-
Show HN: Aver – a Language Designed for AI to Write and Humans to Review
-
Gloss: Open-Source, Local-First RAG Alternative to NotebookLM Built in Rust
-
Nota AI to Showcase End-to-End On-Device AI Optimization at Embedded World 2026
-
Gyro-Claw – Secure Execution Runtime for AI Agents
-
Benchmark: Local Open-Source LLMs Competitive in Real-Time Trading Applications
-
Show HN: SimplAI – Build and Deploy AI Agents and Workflows Without Boilerplate
-
Show HN: RedDragon – LLM-Assisted IR Analysis of Code Across Languages
-
Building PyTorch-Native Support for IBM Spyre Accelerator
-
Windows 11 Notepad to Feature On-Device AI Text Generation Without Subscription
-
Show HN: BoardMint – A PCB Review Tool That Avoids AI Hallucinations
-
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support
-
Unity Showcases Manufacturing AI Workflow at Smart Factory Expo
-
RunAnywhere Launches Production-Grade On-Device AI Platform for Enterprise Scale
-
On-Device AI Laptop Lineups Become Standard Across Major Manufacturers
-
Quantifying Cost Savings with Local LLMs for Development
-
Apple Unveils MacBook Pro With M5 Pro and M5 Max for On-Device AI
-
AMD Launches Copilot+ Desktop Chips to Compete in On-Device AI Market
-
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions
-
Open-Source Article 12 Logging Infrastructure for the EU AI Act
-
Claude Opus 4.6 Solves Problem Posed by Don Knuth
-
HP ZBook Ultra 14 G1a Workstation Reclaims Local AI Workflows for Professionals
-
AMD Expands Ryzen AI 400 Series Portfolio for Consumer and Enterprise AI PC Options
-
RAG-Enterprise – 100% Local RAG System for Enterprise Documents
-
ParseHive – AI-Powered Invoice Data Extraction for Windows and Mac
-
Huawei's SuperPoD Portfolio Creates New Option for Global Computing at MWC Barcelona 2026
-
DeepSeek V4 Multimodal Model Coming Next Week With Image and Video Generation
-
Configure MCP Servers Once, Sync Them Everywhere
-
AI-Native Store Research
-
Seco Launches Edge AI System-on-Module at Embedded World 2026
-
On-Device AI in Mobile Apps: What Should Run on the Phone vs the Cloud (A 2026 Decision Guide)
-
Android Phones Are Getting Smarter Without Internet — Here's Why On-Device AI Is the Next Big Shift
-
The Complete Developer's Guide to Running LLMs Locally: From Ollama to Production
-
Show HN: Anonymize LLM traffic to dodge API fingerprinting and rate-limiting
-
Red Hat Launches AI Enterprise for Hybrid AI Deployments
-
Mirai Tech Raises $10 Million for On-Device AI Innovation
-
The Real AI Competition Is Closed-Source vs Open-Source, Not America vs China
-
Enterprise Infrastructure Guide: Running Local LLMs for 70-150 Developers
-
Comparing Manual vs. AI Requirements Gathering: 2 Sentences vs. 127-Point Spec
-
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments
-
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours
-
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio
-
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools
-
GPT-OSS 20B Demonstrates Practical Agentic Capabilities Running Fully Locally
-
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI
-
Show HN: Tickr – AI Project Manager That Lives Inside Slack (Replaces Jira)
-
At India AI Impact Summit, Intel Showcases AI PCs and Cost-Efficient Frugal AI
-
Asus ExpertBook B3 G2 with 50 TOPS AI Sets New Enterprise Standard
-
AI PCs Explained: 7 Critical Truths About NPUs and Privacy
-
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM
-
I Run Local LLMs in One of the World's Priciest Energy Markets, and I Can Barely Tell
-
Google Is Exploring Ways to Use Its Financial Might to Take on Nvidia
-
Claude Code Open – AI Coding Platform with Web IDE and Agents
-
24 Simultaneous Claude Code Agents on Local Hardware
-
TemplateFlow – Build AI Workflows, Not Prompts
-
SanityBoard Adds 27 New Model Evaluations Including Qwen 3.5 Plus, GLM 5, and Gemini 3.1 Pro
-
PaddleOCR-VL Now Integrated into llama.cpp for Multilingual OCR
-
Ollama Production Deployment: Docker-Compose Setup Guide
-
Mirai Secures $10M to Optimize On-Device AI Amid Cloud Cost Surge
-
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx
-
Self-Hosted Local LLMs for Document Management with Paperless-ngx
-
Mihup and Qualcomm Collaborate to Advance Secure On-Device Voice AI for BFSI
-
Local Vision-Language Models for Document OCR and PII Detection in Privacy-Critical Workflows
-
LayerScale Launches Inference Engine Faster Than vLLM, SGLang, and TRT-LLM
-
Aegis.rs: Open Source Rust-Based LLM Security Proxy Released
-
Tailscale Releases New Tool to Prevent Sensitive Data Leakage to Cloud AI Services
-
Alibaba's Qwen3.5-397B Achieves #3 Position in Open Weights Model Rankings
-
Qualcomm Ventures Positions India as Blueprint for Affordable On-Device AI Infrastructure
-
Same INT8 Model Shows 93% to 71% Accuracy Variance Across Snapdragon Chipsets
-
AMD Announces Day 0 Support for Qwen 3.5 LLM on Instinct GPUs
-
Show HN: PgCortex – AI enrichment per Postgres row, zero transaction blocking
-
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter
-
Show HN: Inkog – Pre-flight check for AI agents (governance, loops, injection)
-
High Bandwidth Flash Memory Could Alleviate VRAM Constraints in Local LLM Inference
-
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages
-
Chinese AI Chipmaker Axera Semiconductor Plans $379 Million Hong Kong IPO for Edge Inference Hardware
-
Asus ExpertBook B3 G2 Laptop Features Ryzen AI 9 HX 470 CPU in 1.41kg Ultraportable Form Factor
-
Sourdine: Open-Source macOS App for 100% Local AI Transcription
-
Security Alert: Open Claw Designed for Self-Hosting, Stop Sharing Credentials
-
Critical vLLM RCE Vulnerability Allows Remote Code Execution via Video Links
-
175,000 Publicly Exposed Ollama AI Servers Discovered Across 130 Countries
-
WinClaw: Windows-Native AI Assistant with Office Automation
-
Simile AI Raises $100M Series A for Local AI Infrastructure
-
Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues
-
Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues
-
Microsoft MarkItDown: Document Preprocessing Tool for LLMs
-
175,000 Publicly Exposed Ollama Servers Create Major Security Risk
-
Building a RAG Pipeline on 2M+ Pages: EpsteinFiles-RAG Project