Tagged "hacker-news"
-
A Cinematic Landing-Page Hero for 80 Cents (GPT Image 2 and Veo 3.1)
-
Supply Chain DLP: Stop Leaked .env Files, Credentials, SSH Keys, and API Tokens
-
Good LLM Development and Usage Patterns
-
From Specialists to Builders: How AI Agentic Coding Is Reshaping Software Teams
-
Two LLM UI Patterns That Aren't Chat
-
Nvidia Enters Windows Laptop Market, Taking on Intel and AMD
-
Netflix Wiz Creates App to Slash AI Bills, Then Open Sources It
-
Fine-tuning an LLM to Write Docs Like It's 1995
-
Proveyouragent: Cryptographic Identity for AI Agents (Ed25519 and DPoP)
-
What Apple Knows About AI That Silicon Valley Won't Admit
-
Show HN: seed – Self-Modifying Webpage with On-Device LLM
-
Netflix Wiz Creates App to Slash AI Bills by Pruning Agent Instructions, Then Open-Sources It
-
Show HN: Egress WAF to Limit AI Agents and NPM Malware Based on mitmproxy
-
Why Chinese AI Labs Went Open and Will Remain Open
-
Three Flavors of Coding with AI Agents
-
Slow Journal App with AI Integration
-
Rsync 3.4.3 Features Hundreds of Claude Commits
-
Rewriting CRIU in Zig using LLM
-
The Windows Device Manager, on Linux
-
Tiny microphone on my balcony to listen for any birds passing by
-
Real-time LLM Inference on Standard GPUs: 3k tokens/s per request
-
GPUs and RAM Are in Short Supply, but the Real Bottleneck for AI Is Electricians
-
CNN sues Perplexity over alleged AI copyright theft
-
Superpowers: An Agentic Skills Framework for AI Coding Workflows
-
Money Printer Pro – Open-source AI Content Generator
-
Mistral AI Launches Mistral Vibe
-
Local-first: Rebuilding a Read-later App with PowerSync and SQLite
-
The Anatomy of an LLM
-
Show HN: I Built a Debugging Challenge for the AI Coding Age
-
AI Guardrails Stripped From Meta and Google Models in Minutes
-
Show HN: An Open-Source Interactive AI Engineering Syllabus (1,100 Papers)
-
AgentSlice – Make AI Coding Agents Ask Before They Edit
-
Why AI Hardware Is a Chip Layer Problem
-
A Maintainability Ratchet for AI-Assisted Python
-
Google Adds llms.txt Check to Chrome Lighthouse
-
Why Your Docker Container Is 1.2GB When It Should Be 80MB
-
PLLuM: Poland's Ministry of Digital Affairs Releases Open Models on HuggingFace
-
Show HN: Interactive and Stylized AI Chat Chrome Extension
-
Google Makes Gemini 3.5 Flash the Default AI Model for Billions of Users
-
The Brain vs. Deep Learning Part I: Computational Complexity Analysis
-
A/B Tested Gemini 3.1 Pro vs. Claude Opus 4.6 – Usage Quota and Quality Comparison
-
Nvidia Raises Video Encoder Limit to 12 on Consumer GPUs
-
Hardware LLM Taalas Reaches >14,000 TPS on Llama 3.1 8B
-
Auditing Apple's DifferentialPrivacy.framework: Bugs, Misconfig, Practical Risks
-
AMD's New Ryzen AI Max Pro 400 with 192GB LPDDR5X Memory
-
AI Token Streaming Isn't About SSE vs. WebSockets
-
OpenAI Agents SDK Ported to React Native for Mobile Deployment
-
Open Source Local Audio Stem Separation Tool Released
-
LLM Wiki App Chunker: Transform Documents Into Navigable Knowledge Trees
-
Bito's AI Architect Improves Claude Opus Task Success Rate by 35%
-
Safety Paradox: How RLHF Creates the AI Psychosis Problem It's Meant to Prevent
-
Ansede-static: Offline SAST Tool Demonstrates Value of Local AI Tools
-
Linux 7.1-rc4 Released: Kernel Updates Relevant to Local LLM Inference
-
The Time Bomb Went Off: AI's All-You-Can-Eat Era Just Ended in Real Time
-
The AI Layoff Receipts: Market Consolidation Accelerates Open-Source Model Adoption
-
Towards Local Plug-and-Play AI
-
MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU
-
My Thoughts on AI, Part 1: Fears, Opinions, and Mental Journey
-
A Lo-Fi Rebellion Against A.I
-
SynapseKit: A New Production Framework for Deploying LLMs
-
Offline Voice-to-Text and AI Keyboard App for Local Processing
-
N8n-MCP: AI Assistants Can Now Build and Search n8n Workflows
-
How to Train Your GPT: Comprehensive Commented Training Guide
-
Show HN: Find the best local LLM for your hardware, ranked by benchmarks
-
RelaxAI – UK sovereign LLM inference at 80% cheaper than OpenAI/Claude
-
Kog AI – Building a Real-Time Inference Stack on AMD Instinct GPUs
-
AI, open code and vulnerability risk in the public sector
-
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training
-
Claude Opus 4.7 System Prompt Leaks Raise Local Deployment Questions
-
Avocado Studio: Open-Source AI Content Editor for Next.js Sites
-
Researchers Report AI Breaking Every Benchmark for Autonomous Cyber Capability
-
Legacy System Analysis with AI Reveals Modern Architecture Under the Hood
-
What If AI Systems Weren't Chatbots?
-
Tsjilp – AI as a Silent Communication Assistant
-
Mainline Linux 6.12 on Annapurna Labs Alpine V2 (Ubiquiti UNVR, UDM-Pro)
-
Berget AI Announces Berget Code for European Teams Powered by Kimi K2.6
-
Before Upload – Check Files Locally Before Sending to AI Tools
-
Privatemode.ai – AI Provider with Confidential Computing
-
Mass NPM Supply Chain Attack Hits TanStack, Mistral AI, and 170 Packages
-
Microsoft Researchers Find AI Models and Agents Can't Handle Long-Running Tasks
-
LLM Hallucinations in the Wild
-
I Think I Figured Out What an AI IDE Looks Like
-
MDL: Endless Visual Novel Engine Powered by AI
-
Lython: Experimental Python Compiler Toolchain Based on LLVM
-
Cotypist – AI Autocomplete for Mac
-
I Built My Second Brain for Meetings. No Monthly Subscription
-
All Those A.I. Note Takers? They're Making Lawyers Nervous
-
Mlx-serve: Run LLMs Natively on Your Mac
-
LibreOffice 26.4 Beta Integrates Local AI Writing Features
-
EU AI Act Article 50: Transparency Rules Impact on Local Deployments
-
Quest to Becoming AI Independent: Local Deployment Movement
-
Discussion: Including New Mathematical Proofs in LLM Training Data for Rediscovery
-
Dikaletus: Open-Source Meeting Recording and Transcription Using Mistral AI
-
Anthropic Develops Tool to Detect When Claude Recognizes It's Being Tested
-
Bun's Experimental Rust Rewrite Achieves 99.8% Test Compatibility on Linux
-
Show HN: A Local-First Agentic Knowledge Manager
-
Google Removes Privacy Assurances After Stuffing Devices With Their AI Model
-
Show HN: Runs AI Coding Agents Inside Isolated Docker Containers
-
Airplane AI – Local NDA Safe AI Powered by Gemma
-
0ctx – Local-First Project Memory for AI Workflows
-
How to make SSE token streams resumable, cancellable, and multi-device
-
Ask HN: Real life autonomous AI Agents
-
I got prompt-injected asking Claude on iOS to recommend a cycling route app
-
Locked, stocked, and losing budget: AI vendor lock-in bites back
-
Zed Editor Integrates AI Features with Local Deployment Focus
-
Enterprise Workplace AI: Questions on Standardizing Local vs Cloud Models
-
NHS England Withdraws AI Software Over Security and Hacking Concerns
-
Improving Code Quality with Local Claude and Codex Models
-
Agentic AI Community Focus: Building Local Agents in 2026
-
US State Dept Orders Global Warning About Alleged AI Thefts by DeepSeek
-
A 49-Line Physics Classifier That Beats kNN on 76% of Benchmarks
-
NHS to Close-Source GitHub Repos Over AI and Security Concerns
-
Show HN: Memex, Claude Memory via Local RAG with MCP and Offline Embeddings
-
Show HN: Claude Relay – Local Claude Code Sessions Message Each Other
-
Ruflo: Multi-Agent AI Orchestration for Claude Code
-
Daintree: A Delegation Environment for Orchestrating AI Coding Agents
-
Building a Jira Alternative with Claude in 8 Days
-
Control AI Risk with Pre-Built Frameworks and Ready-to-Run Evaluations
-
Thoth – Open-Source Local-First AI Assistant
-
NIST's CAISI Evaluation of DeepSeek V4 Pro Finds It On Par with GPT-5
-
Show HN: Kit – Editor, Browser, Terminal, Mail with AI Agents Sharing Context
-
Show HN: Enoch – Control Plane for Autonomous AI Research
-
How to Test AI Agents When They Never Give the Same Answer Twice
-
ScopeGuard 0.0.7: Go Linter with Model Context Protocol Support
-
Show HN: Filling PDF Forms with AI Using Client-Side Tool Calling
-
AMD Posts HDMI 2.1 FRL Patches for Amdgpu Linux Driver
-
Study: AI Models That Consider User Feelings Are More Likely to Make Errors
-
AI Coding Tools Are Silently Disagreeing with Each Other
-
Xmemory: Benchmarking Structured AI Memory Against RAG and Hybrid RAG
-
Ubuntu is Going All In on Generative AI and Other Linux Distros Might Follow
-
Meta Just Killed Open-Source AI
-
96.8% of MCP Tool Descriptions Don't Warn the Agent About Destructive Behaviour
-
How to Make SSE Token Streams Resumable, Cancellable, and Multi-Device
-
Private LLM vs. ChatGPT: When It Makes Sense for Business
-
How Much "Brain Damage" Can an LLM Tolerate?
-
Estimating Black-Box LLM Parameter Counts via Factual Capacity
-
Chrome LLM Prompt API Raises Local Deployment Questions
-
Show HN: Arkloop – Open-Source, Local-First Agent Client
-
Why the Same LLM Gives Different Answers in Different Environments
-
Show HN: Minimal Linux Sandboxes to Manage AI-Generated Code with Ease
-
An Update on GitHub Availability: Infrastructure Lessons for Hosted LLM Tools
-
Economic Implications of AI Adoption: Why Local Deployment Matters for Cost Control
-
Singapore's Foreign Minister Builds an AI "Second Brain" Using NanoClaw
-
Thinking Outside the Box: New Attack Surfaces in Sandboxed AI Agents
-
Show HN: Phonetic Formatter – Offline English Text to IPA on iPhone and iPad
-
75% of US Health Systems Are Using AI. Only 18% of That Deployment Is Governed
-
SiGit Code: Local-First Coding Agent
-
Rust Open-Source Headless Browser for AI Agents and Web Scraping
-
LLMs Consume 5.4x Less Mobile Energy Than Ad-Supported Web Search
-
Show HN: A Karpathy-Style LLM Wiki Your Agents Maintain
-
Fixing Hallucination in LLM Prediction With Only One 48GB GPU
-
Seed3D 2.0
-
Netherlands Reaches Deal to Cut Reliance on U.S. Cloud Tech
-
Mathesar 0.10.0
-
How to Make Sense of AI
-
AI Agent Designs a RISC-V CPU Core from Scratch
-
Show HN: We built an OCR server that can process 270 dense images/s on a 5090
-
I Cancelled Codex Two Months Ago. Opus 4.7 Brought Me Back
-
Local LLM for Private Companies
-
Cortex Auth – Rust secrets vault for AI agents (exec-based injection)
-
Tesseron: New API Framework for AI Agents with Developer-Defined Configuration
-
My AI Workflow: Practical Guide to Using AI Without Skill Atrophy
-
go-AI: New Inference API Library for Go Released
-
AI Licensing Marketplaces: A Guide for Publishers and Content Creators
-
ZeusHammer: Built an AI Agent That Thinks Locally
-
Controlling the Secondary Fan on Minisforum AI Pro HX 370
-
Bun v1.3.13
-
The AI-Ready Product Data Framework for B2B Commerce
-
AI Quota Inflation Is No Token Effort. It's Baked In
-
Web Agent Bridge: Open-Source OS for AI Agents
-
Waterloo's Live AI-Goose Tracker: Real-Time Edge Vision
-
PCMind: Local AI Analysis of Docs, Audio, Video and Images
-
Memjar: Uncompromising Local-First Second Brain
-
LlaMa.cpp Robot Wars
-
Show HN: I Can't Write Python. It Works Anyway – Local LLM Automation
-
Sorting 1M u64 KV-Pairs in 20ms on i9-13980HX Using Branchless Rust Implementation
-
BibCrit – LLM Grounded in ETCBC Corpus Data for Biblical Textual Criticism
-
When Should AI Step Aside?: Teaching Agents When Humans Want to Intervene
-
Show HN: An MCP server that lets AI compose music on a hardware synth
-
Community Computer: Collaborative Autoresearch on a Peer-to-Peer Network
-
Building a Voice AI Wearable in a Casio F91W with Whisper and BLE
-
Project Glasswing and the ASF: Open-Source's Chance to Win the AI Era
-
LLM Personalization Breaks Down in High-Stakes Finance
-
Book Translator: Two-Pass Local Translation with Self-Reflection via Ollama
-
Bonsai 1.7B in the Browser: A 290MB 1-bit LLM on WebGPU
-
Slop-scan – Detect AI Code Slop Patterns in Your Repo
-
SigMap – Shrink AI Coding Context 97% with Auto-Scaling Token Budget
-
GBrain – System to Make Your AI Agent Better Reflect You
-
DotLLM – Building an LLM Inference Engine in C#
-
Talking to a Local LLM in the Firefox Sidebar
-
Sovereign AI: Why the Next GPT Will Be Born in Our Living Rooms
-
Qwen 3.5 Small – On-Device Multimodal Models Released
-
OpenNebula 7.2 "Dark Horse" Released with Enhanced Infrastructure Support
-
Copilot Rate-Limiting Issues Highlight Cloud AI Service Limitations
-
Build a Sovereign Local AI Stack: Ollama and Open WebUI and Pgvector 2026
-
Show HN: SkillCompass – Open-Source Quality Evaluator for Your AI Skills
-
Defender – Local Prompt Injection Detection for AI Agents
-
Learn LLM Internals
-
AI Conditionally Allowed in the Linux Kernel
-
Universal Knowledge Store and Grounding Layer for AI Reasoning Engines
-
A Deep Dive into Tinygrad AI Compiler
-
MiniMax M2.7 Is Now Open Source
-
Rapidly Scaffold Agents, MCP Servers, APIs, Websites on AWS
-
I Gave My AI Shell Access and Felt Uneasy – So I Sandboxed It
-
AIYO Wisper: Local Voice-to-Text for macOS Using WhisperKit
-
AI Workflow Evolution: From Prompts to Near-Autonomous Systems
-
Self-Installing Skill Manager for AI Agents
-
Warp Decode vs. vLLM's Triton Kernel: Performance Crossover Analysis
-
LLM Wiki v2: Extended Knowledge Base for LLM Practitioners
-
5 Open-Source Projects Running Transformers on CPUs to GPUs in Pure Java
-
AI Scans 400k Reddit Posts to Flag Overlooked GLP-1 Side Effects
-
Energy Consumption: The Final Frontier for AI and Local Inference
-
Running a 1.7B Parameters LLM on an Apple Watch
-
Mano-P: Open-Source On-Device GUI Agent, #1 on OSWorld Benchmark
-
Ask HN: Local-First Meetings Recorder and Transcriber
-
Gemini-CLI, Llama.cpp, and Qwen3.5 Running on NVIDIA Jetson TK1
-
Privilege Escalation Attacks on GPUs Using Rowhammer
-
Show HN: Willitrun – Check if Any ML Model Runs on Any Device (Benchmark-Backed)
-
StyleSeed – Design Rules That Make AI Coding Tools Produce Professional UI
-
Quansloth Using Google's Turboquant Breaks the VRAM Wall for Local LLMs
-
MemPalace, the Highest-Scoring AI Memory System Ever Benchmarked
-
CricketBrain: Neuromorphic Signal Processor in Rust (0.175us/step, 944 bytes)
-
VLA Learns How to Act. S2S Decides Whether the Motion Is Physically Trustworthy
-
Verbatim 140W GAN: One of the First Chargers With USB PD 3.2 AVS (SPR) Support
-
GPU Memory for LLM Inference (Part 1)
-
Show HN: Turn Photos Into Wordle Puzzles with AI That Runs 100% in Your Browser
-
Vektor – Local-First Associative Memory for AI Agents
-
Qwen 3.6 Free Model Available via OpenRouter
-
Microsoft Quantum Development Kit Ported to Rust: 100x Faster and Smaller
-
Nex Life Logger: Local Activity Tracker with AI Agent Integration
-
Mixed Precision Quantization on MLX with TurboQuant Implementation
-
GPUs vs. TPUs: Decoding the Powerhouses of AI
-
Free AI Video Clipper Using Scene and Speech-Based Segmentation
-
Autonet: Decentralized AI Training with Constitutional Governance
-
SkillCompass – Diagnose and Improve AI Agent Skills Across 6 Dimensions
-
OpenUMA – Apple-Style Unified Memory for x86 AI Inference
-
April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini
-
Building Cross-Platform Ollama Dashboards with 95% Shared Code
-
Gemma 4 Makes Local AI Agents Practical
-
Apfel – The Free AI Already on Your Mac
-
Men Are Ditching TV for YouTube as AI Usage and Social Media Fatigue Grow
-
git11 Is an AI Workspace for GitHub Engineering Teams
-
Show HN: Extra-Platforms, Python Library to Detect OS, Arch, Shell, CI, AI
-
Chinese Chipmakers Claim Nearly Half of Local Market as Nvidia's Lead Shrinks
-
Satcove – Query 5 AI Models Simultaneously and Get Structured Verdicts
-
If Your AI Agent Ran NPM Install During the Axios Attack, You're Compromised
-
Gemini CLI – Open-Source AI Agent for Terminal Integration
-
Is Anyone Working on an AI Operating System?
-
Orca – Executable skills and capabilities for AI agent workflows
-
I built an O(1) physics engine to stop LLM hallucinations in construction
-
Closed Source AI = Neofeudalism
-
Ask HN: What do you use for local embeddings?
-
Scion: Running Concurrent LLM Agents with Isolated Identities and Workspaces
-
Miasma: A Tool to Protect Data from AI Web Scrapers
-
Lat.md: Agent Lattice – A Knowledge Graph for Your Codebase in Markdown
-
ESP32-S31: 320MHz 2-Core Microcontroller with 512KB SRAM and Networking
-
DaVinci-MagiHuman: Open-Source AI Model for Realistic Video Generation
-
Qwen3 512k Context via TurboQuant on Mac mini
-
Introduction to Nyreth v1.0
-
Forensic Beats Mem0 with 90.1% on LOCOMO Benchmark
-
Reverse-Engineering the Apollo 11 Code with AI
-
Why Your AI Agents Will Turn Against You
-
mlx-Code: Run Claude Code Locally with MLX-LM
-
Hold on to Your Hardware: Implications for Local LLM Deployment
-
Book on AI Agents for the Layman: Understanding Agent-Based Systems
-
See What Your AI Agents Are Doing: Multi-Agent Observability Tool
-
Why Responsible AI Is the Bedrock of AI-Powered Applications
-
Meta Releases HyperAgents: Self-Improving AI
-
MCP-Manticore: Let Your AI Assistant Write Manticore Queries for You
-
Show HN: Beforeyouship – Pre-Build Tool to Estimate LLM Cost
-
Operating Systems. One USB. ZFS on Root. AI-Powered. Free
-
Running an Open-Weight LLM Locally on an Apple Watch
-
Show HN: Open Agent Spec – Treat AI Agents Like Typed Functions, Not Prompt Chains
-
AI Slop or Quality Storytelling? – Dune Themed MCP Gateway Tutorial
-
Council: A Structured Deliberation Protocol Across Diverse AI Models
-
.APKs Are Just .ZIPs: Semi-Legally Hacking Software for Orphaned Hardware