Tagged "enterprise"

Self-Hostable AI Agents and Internal Software Framework Released 23 March 2026
Qwen 3.5 Models: Optimal Settings and Reduced Overthinking Configuration 23 March 2026
LM Studio Releases Reworked Plugins with Fully Local Web Research 23 March 2026
Llama.cpp ROCm 7 vs Vulkan Performance Benchmarks on AMD Mi50 23 March 2026
Korea to Deploy Domestic AI Chips in Smart Cities as NPU Trials Scale Up 23 March 2026
Powerful AI Search Engine Built on Single GeForce RTX 5090 23 March 2026
Developer Builds Fully Local Multi-Agent System Using vLLM and Parallel Inference 22 March 2026
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach 22 March 2026
Brezn – Decentralized Local Communication 22 March 2026
Self-Hosted AI Code Review with Local LLMs: Secure Automation Guide 21 March 2026
Qualcomm and Samsung's 30-Year AI Alliance Enters a New Phase as On-Device AI Chip Race Heats Up 21 March 2026
Pydantic-Deep: Production Deep Agents for Pydantic AI 21 March 2026
MacinAI Local brings functional LLM inference to classic Macintosh hardware 21 March 2026
DeepSeek R1 RTX 4090 vs Apple M3 Max: Benchmark & Performance Guide 21 March 2026
Your Site Content Is Powering AI. Your Bank Account Has No Idea 21 March 2026
Build a $1,500 AI Server with DeepSeek-R1 on RTX 4090 21 March 2026
What AI Augmentation Means for Technical Leaders 21 March 2026
SwarmHawk – Open-Source CLI for Vulnerability Scanning with AI Synthesis 20 March 2026
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks 20 March 2026
Cybersecurity Skills for AI Agents – agentskills.io Standard Implementation 20 March 2026
ASUS ExpertCenter PN55 Mini PC Combines AMD AI CPU and 55 TOPS NPU 20 March 2026
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It 19 March 2026
Dell Pro Max 16 Plus Launches With Enterprise-Grade Discrete NPU for On-Device AI 19 March 2026
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training 19 March 2026
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally 18 March 2026
On-Device AI: Tether's QVAC Fabric Enables Local Training 18 March 2026
I Switched to a Local LLM for These 5 Tasks and the Cloud Version Hasn't Been Worth It Since 18 March 2026
Hugging Face Releases One-Liner for Automatic Hardware Detection and Model Selection 18 March 2026
Custom GPU Multiplexer Achieves 0.3ms Model Switching on Legacy Hardware 18 March 2026
Open-Source LLMs Rapidly Displacing Proprietary SOTA Models 16 March 2026
NVIDIA Updates Nemotron 3 122B License, Removes Deployment Restrictions 16 March 2026
Nota Added to Three Technology and Growth ETFs in a Row – Market Recognition for AI Efficiency 16 March 2026
Apple's On-Device AI Raises Privacy Alarms Across British Parliament 16 March 2026
Show HN: Buxo.ai – Calendly alternative where LLM decides which slots to show 15 March 2026
AMD Launches Agent System Optimized for Local AI Inference With Ryzen and Radeon 15 March 2026
Achieving 2000 Tokens Per Second with QWEN 3.5 27B on RTX-5090 14 March 2026
Intel OpenVINO Backend Support Now Available in llama.cpp 14 March 2026
Fine-Tuned 14B Model Outperforms Claude Opus 4.6 on Ada Code Generation 14 March 2026
AgentArmor: Open-Source 8-Layer Security Framework for AI Agents 14 March 2026
Nvidia Pushes Jetson as Edge Hub for Open AI Models 12 March 2026
MeepaChat – Slack for AI Agents (iOS, macOS, Web / Cloud, Self-Hosted) 12 March 2026
Local AI Coding Assistant: Complete VS Code + Ollama + Continue Setup 12 March 2026
Texas Instruments Launches NPU-Powered MCUs for Low-Power Edge AI 11 March 2026
Show HN: Aver – a Language Designed for AI to Write and Humans to Review 11 March 2026
Gloss: Open-Source, Local-First RAG Alternative to NotebookLM Built in Rust 10 March 2026
Nota AI to Showcase End-to-End On-Device AI Optimization at Embedded World 2026 9 March 2026
Gyro-Claw – Secure Execution Runtime for AI Agents 9 March 2026
Benchmark: Local Open-Source LLMs Competitive in Real-Time Trading Applications 8 March 2026
Show HN: SimplAI – Build and Deploy AI Agents and Workflows Without Boilerplate 7 March 2026
Show HN: RedDragon – LLM-Assisted IR Analysis of Code Across Languages 7 March 2026
Building PyTorch-Native Support for IBM Spyre Accelerator 7 March 2026
Windows 11 Notepad to Feature On-Device AI Text Generation Without Subscription 6 March 2026
Show HN: BoardMint – A PCB Review Tool That Avoids AI Hallucinations 6 March 2026
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support 6 March 2026
Unity Showcases Manufacturing AI Workflow at Smart Factory Expo 5 March 2026
RunAnywhere Launches Production-Grade On-Device AI Platform for Enterprise Scale 4 March 2026
On-Device AI Laptop Lineups Become Standard Across Major Manufacturers 4 March 2026
Quantifying Cost Savings with Local LLMs for Development 4 March 2026
Apple Unveils MacBook Pro With M5 Pro and M5 Max for On-Device AI 4 March 2026
AMD Launches Copilot+ Desktop Chips to Compete in On-Device AI Market 4 March 2026
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions 4 March 2026
Open-Source Article 12 Logging Infrastructure for the EU AI Act 3 March 2026
Claude Opus 4.6 Solves Problem Posed by Don Knuth 3 March 2026
HP ZBook Ultra 14 G1a Workstation Reclaims Local AI Workflows for Professionals 2 March 2026
AMD Expands Ryzen AI 400 Series Portfolio for Consumer and Enterprise AI PC Options 2 March 2026
RAG-Enterprise – 100% Local RAG System for Enterprise Documents 1 March 2026
ParseHive – AI-Powered Invoice Data Extraction for Windows and Mac 1 March 2026
Huawei's SuperPoD Portfolio Creates New Option for Global Computing at MWC Barcelona 2026 1 March 2026
DeepSeek V4 Multimodal Model Coming Next Week With Image and Video Generation 1 March 2026
Configure MCP Servers Once, Sync Them Everywhere 1 March 2026
AI-Native Store Research 1 March 2026
Seco Launches Edge AI System-on-Module at Embedded World 2026 27 February 2026
On-Device AI in Mobile Apps: What Should Run on the Phone vs the Cloud (A 2026 Decision Guide) 27 February 2026
Android Phones Are Getting Smarter Without Internet — Here's Why On-Device AI Is the Next Big Shift 27 February 2026
The Complete Developer's Guide to Running LLMs Locally: From Ollama to Production 26 February 2026
Show HN: Anonymize LLM traffic to dodge API fingerprinting and rate-limiting 26 February 2026
Red Hat Launches AI Enterprise for Hybrid AI Deployments 25 February 2026
Mirai Tech Raises $10 Million for On-Device AI Innovation 24 February 2026
The Real AI Competition Is Closed-Source vs Open-Source, Not America vs China 24 February 2026
Enterprise Infrastructure Guide: Running Local LLMs for 70-150 Developers 24 February 2026
Comparing Manual vs. AI Requirements Gathering: 2 Sentences vs. 127-Point Spec 24 February 2026
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments 24 February 2026
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours 23 February 2026
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio 23 February 2026
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools 23 February 2026
GPT-OSS 20B Demonstrates Practical Agentic Capabilities Running Fully Locally 23 February 2026
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI 23 February 2026
Show HN: Tickr – AI Project Manager That Lives Inside Slack (Replaces Jira) 22 February 2026
At India AI Impact Summit, Intel Showcases AI PCs and Cost-Efficient Frugal AI 22 February 2026
Asus ExpertBook B3 G2 with 50 TOPS AI Sets New Enterprise Standard 22 February 2026
AI PCs Explained: 7 Critical Truths About NPUs and Privacy 22 February 2026
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM 21 February 2026
I Run Local LLMs in One of the World's Priciest Energy Markets, and I Can Barely Tell 21 February 2026
Google Is Exploring Ways to Use Its Financial Might to Take on Nvidia 21 February 2026
Claude Code Open – AI Coding Platform with Web IDE and Agents 21 February 2026
24 Simultaneous Claude Code Agents on Local Hardware 21 February 2026
TemplateFlow – Build AI Workflows, Not Prompts 20 February 2026
SanityBoard Adds 27 New Model Evaluations Including Qwen 3.5 Plus, GLM 5, and Gemini 3.1 Pro 20 February 2026
PaddleOCR-VL Now Integrated into llama.cpp for Multilingual OCR 20 February 2026
Ollama Production Deployment: Docker-Compose Setup Guide 20 February 2026
Mirai Secures $10M to Optimize On-Device AI Amid Cloud Cost Surge 20 February 2026
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx 20 February 2026
Self-Hosted Local LLMs for Document Management with Paperless-ngx 19 February 2026
Mihup and Qualcomm Collaborate to Advance Secure On-Device Voice AI for BFSI 19 February 2026
Local Vision-Language Models for Document OCR and PII Detection in Privacy-Critical Workflows 19 February 2026
LayerScale Launches Inference Engine Faster Than vLLM, SGLang, and TRT-LLM 19 February 2026
Aegis.rs: Open Source Rust-Based LLM Security Proxy Released 19 February 2026
Tailscale Releases New Tool to Prevent Sensitive Data Leakage to Cloud AI Services 18 February 2026
Alibaba's Qwen3.5-397B Achieves #3 Position in Open Weights Model Rankings 18 February 2026
Qualcomm Ventures Positions India as Blueprint for Affordable On-Device AI Infrastructure 18 February 2026
Same INT8 Model Shows 93% to 71% Accuracy Variance Across Snapdragon Chipsets 18 February 2026
AMD Announces Day 0 Support for Qwen 3.5 LLM on Instinct GPUs 18 February 2026
Show HN: PgCortex – AI enrichment per Postgres row, zero transaction blocking 17 February 2026
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter 17 February 2026
Show HN: Inkog – Pre-flight check for AI agents (governance, loops, injection) 17 February 2026
High Bandwidth Flash Memory Could Alleviate VRAM Constraints in Local LLM Inference 17 February 2026
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages 17 February 2026
Chinese AI Chipmaker Axera Semiconductor Plans $379 Million Hong Kong IPO for Edge Inference Hardware 17 February 2026
Asus ExpertBook B3 G2 Laptop Features Ryzen AI 9 HX 470 CPU in 1.41kg Ultraportable Form Factor 17 February 2026
Sourdine: Open-Source macOS App for 100% Local AI Transcription 16 February 2026
Security Alert: Open Claw Designed for Self-Hosting, Stop Sharing Credentials 16 February 2026
Critical vLLM RCE Vulnerability Allows Remote Code Execution via Video Links 14 February 2026
175,000 Publicly Exposed Ollama AI Servers Discovered Across 130 Countries 14 February 2026
WinClaw: Windows-Native AI Assistant with Office Automation 13 February 2026
Simile AI Raises $100M Series A for Local AI Infrastructure 13 February 2026
Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues 13 February 2026
Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues 12 February 2026
Microsoft MarkItDown: Document Preprocessing Tool for LLMs 12 February 2026
175,000 Publicly Exposed Ollama Servers Create Major Security Risk 11 February 2026
Building a RAG Pipeline on 2M+ Pages: EpsteinFiles-RAG Project 11 February 2026