Tagged "open-source"
-
N8n, Dify, and Ollama Might Be the Best Self-Hosted AI Automation Stack Right Now
-
Pbgopy v0.4.0: Simple Cross-Device Clipboard with History for Local Networks
-
NVIDIA Nemotron 3 Nano Omni Powers Multimodal Agent Reasoning in a Single Efficient Open Model
-
Llama.cpp Runs on SGI Power Challenge from 1995 with MIPS R8000 Kernel
-
Grokfeed: Terminal Feed Reader for HN, Reddit, and Lobste.rs Using Claude Code
-
GraphOS: Visual Runtime and Debugger for AI Agents with Local-First Execution
-
Stop Guessing: Open-Source Tool Predicts Which Local LLMs Run on Your PC
-
Show HN: Minimal Linux Sandboxes to Manage AI-Generated Code with Ease
-
An Update on GitHub Availability: Infrastructure Lessons for Hosted LLM Tools
-
Economic Implications of AI Adoption: Why Local Deployment Matters for Cost Control
-
Unsloth's Custom Kernels Make LLM Fine-Tuning Viable on Consumer GPUs
-
Pocket LLM v1.5.0 Brings Multimodal AI to Android with No Cloud Required
-
The New Linux Kernel AI Bot Uncovering Bugs Is A Local LLM On Framework Desktop + AMD Ryzen AI Max
-
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
-
Singapore's Foreign Minister Builds an AI "Second Brain" Using NanoClaw
-
Show HN: Phonetic Formatter – Offline English Text to IPA on iPhone and iPad
-
NVIDIA Adds Day-0 DeepSeek V4 Blackwell Support
-
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
-
SiGit Code: Local-First Coding Agent
-
Rust Open-Source Headless Browser for AI Agents and Web Scraping
-
Show HN: A Karpathy-Style LLM Wiki Your Agents Maintain
-
Build Your Own Local AI Stack with 5 Docker Containers and Eliminate ChatGPT Subscriptions
-
Seed3D 2.0
-
Hackers Exploit Ollama Model Uploads to Leak Server Data
-
Mathesar 0.10.0
-
I Built a Local AI Stack With 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again
-
Building Real-World On-Device AI with LiteRT and NPU
-
AI Agent Designs a RISC-V CPU Core from Scratch
-
Cortex Auth – Rust secrets vault for AI agents (exec-based injection)
-
Tesseron: New API Framework for AI Agents with Developer-Defined Configuration
-
Sarvam Edge: India's Offline AI Model Runs on Phones and Laptops Without Internet
-
go-AI: New Inference API Library for Go Released
-
Cursor-Autoresearch: AI Research Automation Port for Local Workflows
-
AI Licensing Marketplaces: A Guide for Publishers and Content Creators
-
The Open-Source AI Ecosystem Keeps Treating llama.cpp Like a Second-Class Citizen
-
ZeusHammer: Built an AI Agent That Thinks Locally
-
Running DeepSeek R1 Locally: Your Complete Setup Guide
-
Bun v1.3.13
-
Web Agent Bridge: Open-Source OS for AI Agents
-
PCMind: Local AI Analysis of Docs, Audio, Video and Images
-
Memjar: Uncompromising Local-First Second Brain
-
I Built a Local AI Stack with 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again
-
Laimark – 8B LLM That Self-Improves on Consumer GPUs
-
Show HN: I Can't Write Python. It Works Anyway – Local LLM Automation
-
Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw
-
BibCrit – LLM Grounded in ETCBC Corpus Data for Biblical Textual Criticism
-
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw at It
-
The Case for Out-of-Process Enforcement for AI Agents
-
After Two Months of Open WebUI Updates, I'd Pick It Over ChatGPT's Interface for Local LLMs
-
The 'Ollama' Tool Has Numerous Problems, and Some Argue That Llama.cpp Is Better
-
Local AI Isn't Just Ollama—Here's the Ecosystem That Actually Makes It Useful
-
Community Computer: Collaborative Autoresearch on a Peer-to-Peer Network
-
ChatMCP – Connect your AI browser chats to your coding agents
-
Researcher Discovers 221 Bugs in vLLM Stemming From Single Root Cause
-
Project Glasswing and the ASF: Open-Source's Chance to Win the AI Era
-
Open WebUI Emerges as Superior Interface for Local LLMs After Two Months of Active Development
-
N8n, Dify, and Ollama Emerge as Leading Self-Hosted AI Automation Stack
-
Book Translator: Two-Pass Local Translation with Self-Reflection via Ollama
-
Slop-scan – Detect AI Code Slop Patterns in Your Repo
-
Self-Hosted LLMs Transform Personal Knowledge Management Systems
-
Google's Gemma 4 Brings Game-Changing Performance to Local Laptop Inference
-
DGX Spark Setup Guide: Running vLLM and PyTorch for Local LLM Inference Backend
-
Sovereign AI: Why the Next GPT Will Be Born in Our Living Rooms
-
Qwen 3.5 Small – On-Device Multimodal Models Released
-
OpenNebula 7.2 "Dark Horse" Released with Enhanced Infrastructure Support
-
oMLX Framework Implements DFlash Attention for Optimized Inference
-
MiniMax Clarifies Restrictive License, Signals Policy Update for Regular Users
-
Abliterated Local LLM Models Show Distinct Behavioral Characteristics Compared to Standard Variants
-
Build a Sovereign Local AI Stack: Ollama and Open WebUI and Pgvector 2026
-
Show HN: SkillCompass – Open-Source Quality Evaluator for Your AI Skills
-
MiniMax M2.7 Open-Sources Globally as Industry's First Self-Improving Model
-
Defender – Local Prompt Injection Detection for AI Agents
-
Learn LLM Internals
-
AI Conditionally Allowed in the Linux Kernel
-
Unsloth Completes Comprehensive MiniMax M2.7 GGUF Quantization Suite
-
Universal Knowledge Store and Grounding Layer for AI Reasoning Engines
-
MiniMax M2.7 Released: New Model Available for Local Deployment
-
MiniMax M2.7 Is Now Open Source
-
Google's Gemini Nano 4 Offers Faster, Smarter Local Inference Capabilities
-
GLM 5.1 Dominates Agentic Benchmarks, Outperforming Most Models at 1/3 Opus Cost
-
DMax: New Parallel Decoding Paradigm for Diffusion Language Models
-
AIYO Wisper: Local Voice-to-Text for macOS Using WhisperKit
-
Aisbf (AI Should Be Free) Proxy 0.99.18 Released
-
Self-Installing Skill Manager for AI Agents
-
Tether Launches QVAC SDK for Cross-Platform Local AI Development
-
Local Small LLMs Match Enterprise Model Performance on Vulnerability Detection
-
LLM Wiki v2: Extended Knowledge Base for LLM Practitioners
-
5 Open-Source Projects Running Transformers on CPUs to GPUs in Pure Java
-
Community Reverse Engineers Gemma 4 Multi-Token Prediction Capability
-
VoxCPM2: New Open-Source TTS Model with Voice Cloning and Design
-
Hugging Face Moves Safetensors Under PyTorch Foundation
-
Mano-P: Open-Source On-Device GUI Agent, #1 on OSWorld Benchmark
-
Ask HN: Local-First Meetings Recorder and Transcriber
-
Gemma 4 Support Stabilized in Llama.cpp
-
EXAONE 4.5 33B Model Released with Multiple Quantization Formats
-
Google's Gemma 4 Brings Powerful On-Device AI to Android and iOS
-
StyleSeed – Design Rules That Make AI Coding Tools Produce Professional UI
-
PyTorch Foundation Welcomes Helion as a Foundation-Hosted Project to Standardize Open, Portable, and Accessible AI Kernel Authoring
-
Octopoda: Open Source Memory Layer for Fully Offline AI Agents
-
Google Launches Offline AI Dictation App for iOS with Gemma
-
AMD Announces Day 0 Support for Google Gemma 4 Across Processors and GPUs
-
METATRON: Open-Source AI Penetration Testing with Local LLMs
-
Show HN: Lightweight LLM Tracing Tool with CLI
-
Google AI Edge Gallery Tops App Store Charts with On-Device Gemma 4
-
Apple Brings Enhanced On-Device AI Features to iPhone
-
Show HN: Turn Photos Into Wordle Puzzles with AI That Runs 100% in Your Browser
-
Vektor – Local-First Associative Memory for AI Agents
-
Unpaved: Audit Toolkit for AI Developer Tool Bias in Global South Contexts
-
Satsgate: Monetize AI Agents and APIs with Lightning L402 Protocol
-
Qwen 3.6 Free Model Available via OpenRouter
-
Microsoft Quantum Development Kit Ported to Rust: 100x Faster and Smaller
-
Gemma 4 31B Achieves Third Place on FoodTruck Bench, Beating Larger Models
-
Apple Research Shows Self-Distillation Significantly Improves Local Code Generation
-
YC-Bench: GLM-5 Matches Claude Opus 4.6 at 11× Lower Cost
-
Nex Life Logger: Local Activity Tracker with AI Agent Integration
-
Netflix Open-Sources VOID Model for Video Object Deletion
-
Google Launches Gemma 4 For Advanced On-Device AI
-
Gemma 4 31B Outperforms GLM 5.1 in Real-World Testing
-
Gemma 4 KV Cache Memory Issues Fixed in llama.cpp
-
Free AI Video Clipper Using Scene and Speech-Based Segmentation
-
Autonet: Decentralized AI Training with Constitutional Governance
-
SkillCompass – Diagnose and Improve AI Agent Skills Across 6 Dimensions
-
OpenUMA – Apple-Style Unified Memory for x86 AI Inference
-
Gemma 4 Shows Strong Reasoning Performance with Thinking Tokens
-
Google Launches Gemma 4 Open Models for Local On-Device AI
-
Gemma 4 on Arm: Optimized On-Device AI for Mobile and Edge Deployment
-
Apfel – The Free AI Already on Your Mac
-
Apple Silicon Macs Run Local AI Faster with Ollama's New MLX Support
-
Show HN: Memsearch – Persistent, Cross-Agent, Cross-Session Memory for AI Agents
-
git11 Is an AI Workspace for GitHub Engineering Teams
-
Show HN: Extra-Platforms, Python Library to Detect OS, Arch, Shell, CI, AI
-
ROCm Integration in Ubuntu 26.04 Advances Linux GPU Inference
-
Qwen 3.5-27B Demonstrates Superior Performance vs Gemini 3.1 Pro and GPT-5.3
-
If Your AI Agent Ran NPM Install During the Axios Attack, You're Compromised
-
Local AI Ecosystem Extends Far Beyond Ollama
-
Gemini CLI – Open-Source AI Agent for Terminal Integration
-
Claude Code Source Leaked: Community Extracts Multi-Agent Orchestration Framework
-
Orca – Executable skills and capabilities for AI agent workflows
-
Ollama Launches Pi: The Minimal Coding Agent That Powers OpenClaw Is Now Yours to Customize
-
Intel's $949 GPU has 32GB of VRAM for local AI, but the software is why Nvidia keeps winning
-
I built an O(1) physics engine to stop LLM hallucinations in construction
-
Closed Source AI = Neofeudalism
-
Ask HN: What do you use for local embeddings?
-
DeepSeek V3 Complete Guide: Deploy and Optimize Local AI in 2026
-
DeepSeek-R1 Chain-of-Thought Debugging: A Developer's Guide
-
Scion: Running Concurrent LLM Agents with Isolated Identities and Workspaces
-
Miasma: A Tool to Protect Data from AI Web Scrapers
-
Local AI Ecosystem Extends Far Beyond Ollama
-
Lat.md: Agent Lattice – A Knowledge Graph for Your Codebase in Markdown
-
Converting a Home Server Into a Production AI Appliance
-
DaVinci-MagiHuman: Open-Source AI Model for Realistic Video Generation
-
Unsloth Studio Beta Ships 50+ New Features for Local Model Training and Inference
-
Qwen3 512k Context via TurboQuant on Mac mini
-
Introduction to Nyreth v1.0
-
GLM-5.1 Model Weights Launching Early April for Local Deployment
-
Forensic Beats Mem0 with 90.1% on LOCOMO Benchmark
-
Reverse-Engineering the Apollo 11 Code with AI
-
Why Your AI Agents Will Turn Against You
-
This Self-Hosted Tool Makes My Local LLMs Feel Exactly Like ChatGPT, but Nothing Leaves My Network
-
RotorQuant: 10-19x Faster Quantisation Alternative Using Clifford Algebra
-
Coding Implementation to Run Qwen3.5 Reasoning Models Distilled With Claude-Style Thinking Using GGUF and 4-Bit Quantization
-
Quantization Reveals Outliers Impacting LLM Accuracy
-
Mistral AI Releases Voxtral: Open-Source TTS Model Beating ElevenLabs on Local Hardware
-
See What Your AI Agents Are Doing: Multi-Agent Observability Tool
-
NVIDIA Releases GPT-OSS-Puzzle-88B, a Deployment-Optimized Model
-
Meta Releases HyperAgents: Self-Improving AI
-
Operating Systems. One USB. ZFS on Root. AI-Powered. Free
-
Real-World Benchmark: DeepSeek-V3 Matches Claude Sonnet on Routine Coding Tasks
-
Running an Open-Weight LLM Locally on an Apple Watch
-
Show HN: Open Agent Spec – Treat AI Agents Like Typed Functions, Not Prompt Chains
-
OmniCoder v2 Released: Improved Code Generation for Local Deployment
-
New Open-Weight Models Released: GigaChat-3.1-Ultra and Lightning Variants
-
Private Brain LLM Setup on Windows PC Eliminates Need for Paid Cloud Services
-
Critical: LiteLLM Supply Chain Attack Detected, Bifrost Alternative Released
-
Council: A Structured Deliberation Protocol Across Diverse AI Models
-
I built Rubric, an open source Sentry for AI. Looking for beta testers
-
Open-Source AI Text-to-Speech Models You Can Run Locally for Natural Voice
-
Open-Source Tool Helps Determine Which Local LLMs Run on Your PC
-
A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant
-
llm-d Joins the Cloud Native Computing Foundation
-
Chinese LLM Ecosystem Landscape: ByteDance Doubao, Alibaba, and Open-Source Competition
-
Self-Hostable AI Agents and Internal Software Framework Released
-
MiniMax M2.7 Model to Be Released as Open Weights
-
Alibaba Commits to Continuous Open-Sourcing of Qwen and Wan Models
-
Ditching Paid AI Services: Building Self-Hosted LLM Solutions as ChatGPT, Claude, and Gemini Alternatives
-
Qwen 3.5 122B Uncensored (Aggressive) Released with New K_P Quantisations
-
Nvidia Nemotron Cascade 2 30B Emerges as Powerful Alternative to Qwen Models
-
Developer Builds Fully Local Multi-Agent System Using vLLM and Parallel Inference
-
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach
-
Careless Whisper – Personal Local Speech to Text
-
BrowserOS 0.44.0 Release: Advances in Local AI Integration for Web-Based Applications
-
Brezn – Decentralized Local Communication
-
Automating Read-It-Later Workflows with Local LLMs for Overnight Summarization
-
AI Playground for Developers Built in Vite and Python
-
Pydantic-Deep: Production Deep Agents for Pydantic AI
-
Cursor's Composer 2 model attribution dispute highlights open-source licensing concerns
-
Your Site Content Is Powering AI. Your Bank Account Has No Idea
-
Atuin v18.13 – Better Search, a PTY Proxy, and AI for Your Shell
-
SwarmHawk – Open-Source CLI for Vulnerability Scanning with AI Synthesis
-
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks
-
Why Self-Hosted LLMs Make Financial and Privacy Sense Over Paid Services
-
Qwen 3.5 Emerges as Top Performer for Local Deployment with Extensive Quantization Options
-
NVIDIA Nemotron Cascade 2 30B Delivers 120B-Class Performance in Compact Form Factor
-
NVIDIA Nemotron 3 Nano 4B Enables On-Device Inference Directly in Web Browsers via WebGPU
-
LMCache Dramatically Accelerates LLM Inference on Oracle Data Science Platform
-
Llamafile 0.10 Released with GPU Support and Rebuilt Core
-
Cybersecurity Skills for AI Agents – agentskills.io Standard Implementation
-
Cursor's Composer 2 Model Analysis – Fine-Tuned Variant of Kimi K2.5
-
Claude Code Permissions Hook – Delegate Permission Approval to LLM
-
AI's Impact on Mathematics Analogous to Car's Impact on Cities
-
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet
-
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It
-
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training
-
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally
-
Skills Manager – manage AI agent skills across Claude, Cursor, Copilot
-
My Dinner with AI
-
LucidShark – Local-first, open-source quality and security gate
-
Show HN: Process Mining for AI Agent Systems
-
Qwen 3.5 4B Outperforms Nvidia Nemotron 3 4B in Local Benchmarks
-
Mistral Small 4 119B Released with NVFP4 Quantisation Support
-
Mistral Releases Small 4 Open-Source Model Under Apache 2.0
-
Local Qwen Models Master Browser Automation Through Iterative Replanning
-
How I Used Lima for an AI Coding Agent Sandbox
-
Mistral Releases Leanstral: First Open-Source Code Agent for Lean 4 Proof Assistant
-
Kimi Introduces Attention Residuals: 1.25x Compute Performance at <2% Overhead
-
The Moment AI Agents Stopped Being a Feature and Started Becoming a System
-
How AI Agents Should Pay for API Calls: X402 and USDC Verification on Base
-
OpenClaw Isn't the Only Raspberry Pi AI Tool—Here Are 4 Others You Can Try This Week
-
Qwen 3.5 122B Demonstrates Exceptional Reasoning for Local Deployment
-
Open-Source LLMs Rapidly Displacing Proprietary SOTA Models
-
OmniCoder-9B: Efficient Coding Model for 8GB GPUs
-
NVIDIA Updates Nemotron 3 122B License, Removes Deployment Restrictions
-
Nota Added to Three Technology and Growth ETFs in a Row – Market Recognition for AI Efficiency
-
LoKI – Local AI Assistant for Linux and WSL
-
Dictare – Open-source Voice Layer for AI Coding Agents (100% Local)
-
Show HN: Generate, Clean, and Prepare LLM Training Data, All-in-One
-
Apple's On-Device AI Raises Privacy Alarms Across British Parliament
-
Show HN: Voice-tracked teleprompter using on-device ASR in the browser
-
StepFun Releases SFT Dataset Used to Train Step 3.5 Flash for Community Fine-Tuning
-
OpenClaw vs Eigent vs Claude Cowork: Comparing Open-Source AI Collaboration Platforms
-
Nvidia's Nemotron 3 Super: Understanding the Significance for Local LLM Deployment
-
Two Local Models Prove Competitive Enough to Replace ChatGPT, Gemini, and Copilot
-
Hybrid AI Desktop Layer Combining DOM-Automation and API-Integrations
-
Open-Source GreenBoost Driver Augments NVIDIA GPU VRAM With System RAM and NVMe Storage
-
I made Karpathy's Autoresearch work on CPU
-
Intel OpenVINO Backend Support Now Available in llama.cpp
-
Local Manga Translator: Production LLM Pipeline with YOLO, OCR, and Inpainting
-
Show HN: Intake API – An Inbox for AI Coding Agents
-
Show HN: Bots of WallStreet – Multi-Agent Debate and Prediction Framework
-
Best Local LLM Models 2026: Developer Comparison
-
AgentArmor: Open-Source 8-Layer Security Framework for AI Agents
-
Runpod Report: Qwen Has Overtaken Meta's Llama As The Most-Deployed Self-Hosted LLM
-
Intel Updates LLM-Scaler-vLLM With Support For More Qwen3/3.5 Models
-
How to Install OpenClaw with Ollama (Step-by-Step Tutorial)
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
Qwodel – An Open-Source Unified Pipeline for LLM Quantization
-
Nvidia Pushes Jetson as Edge Hub for Open AI Models
-
Nvidia Releases Nemotron 3 Super: 120B MoE Model for Local Deployment
-
MeepaChat – Slack for AI Agents (iOS, macOS, Web / Cloud, Self-Hosted)
-
Texas Instruments Launches NPU-Powered MCUs for Low-Power Edge AI
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
NVIDIA Jetson Brings Open Models to Life at the Edge
-
LMF – LLM Markup Format
-
Llama.cpp Celebrates Major Milestone: From Leak to Industry Standard
-
Show HN: Aver – a Language Designed for AI to Write and Humans to Review
-
Show HN: AIWatermarkDetector: Detect AI Watermarks in Text or Code
-
PhotoPrism AI-Powered Photos App Brings Better Ollama Integration
-
Mnemos: Persistent Memory System for Local AI Agents
-
.ispec: Runtime Specification Validation for AI System Consistency
-
Google Delivers On-Device AI Features in New Chromebook Plus Model
-
Gloss: Open-Source, Local-First RAG Alternative to NotebookLM Built in Rust
-
FreeBSD 14.4 Released: Implications for Local LLM Deployment
-
Fish Audio Open-Sources S2: Expressive Text-to-Speech with Natural Language Control and 100ms Latency
-
Bash-Based Claude Code Agent: Lightweight Local AI Coding Assistant
-
Community Survey: AI Content Automation Stacks in 2026
-
VS Code Agent Kanban – Task Management for AI-Assisted Development
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
Qwen 3.5 Small Expands On-Device AI to Phones and IoT with Offline Support
-
Qwen 3.5 Derestricted Model Available for Local Deployment
-
Gyro-Claw – Secure Execution Runtime for AI Agents
-
FretBench – Testing 14 LLMs on Reading Guitar Tabs Reveals Performance Gaps
-
Engram – Open-Source Persistent Memory for AI Agents
-
commitgen-cc – Generate Conventional Commit Messages Locally with Ollama
-
VoiceShelf: Fully Offline Android Audiobook Reader Using Kokoro TTS
-
Reverse engineering a DOS game with no source code using Codex 5.4
-
Show HN: Proxly – Self-hosted tunneling on your own domain in 60 seconds
-
OpenSpec: Spec-driven development (SDD) for AI coding assistants
-
Mistral AI Prepares Workflows Integration for Le Chat
-
Benchmark: Local Open-Source LLMs Competitive in Real-Time Trading Applications
-
Show HN: Ivy – the first proactive, offline AI tutor
-
Windows 11 Notepad Gets On-Device AI Text Generation Without Subscription
-
Self-Hosted Paperless-ngx With Optional Local AI Integration
-
Sarvam AI Releases 30B and 105B Open-Source Models Trained from Scratch
-
Show HN: RedDragon – LLM-Assisted IR Analysis of Code Across Languages
-
Qwen3-Coder-Next Achieves Top Ranking on SWE-bench at Pass@5
-
Open WebUI Adds Native Terminal Tool Calling with Qwen3.5 35B Support
-
Llama.cpp Merges Automatic Parser Generator to Mainline
-
Jse v2.0 AI Output Specification
-
IBM Granite 4.0 1B Speech Model Released for Multilingual Speech Recognition
-
Show HN: Asterode – Multi-Model AI App with Memory and Power Features
-
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support
-
Show HN: TLDR – Free Chrome Extension for AI-Powered Article Summarization
-
llama.cpp Merges Agentic Loop and MCP Client Support
-
Imrobot – Reverse-CAPTCHA for Verifying AI Agents, Not Humans
-
ConsciOS v1.0: A Viable Systems Architecture for Human and AI Alignment
-
Show HN: BoardMint – A PCB Review Tool That Avoids AI Hallucinations
-
SynthesisOS – A Local-First, Agentic Desktop Layer Built in Rust
-
Qwen 3.5-35B-A3B Achieves 37.8% on SWE-bench Verified Hard
-
OpenWrt 25.12.0 – Stable Release
-
Incrmd: Incremental AI Coding by Editing PROJECT.md
-
Glyph – A Local-First Markdown Notes App for macOS Built With Rust
-
Apple M5 Pro and M5 Max: 4× Faster LLM Processing
-
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions
-
VibeWhisper – macOS Voice-to-Text with 100% Local Processing Option
-
Qwen 3.5 0.8B Successfully Deployed on 7-Year-Old Samsung S10E Using llama.cpp
-
Qwen 3.5 0.8B Running in Browser with WebGPU via Transformers.js
-
Open-Source Article 12 Logging Infrastructure for the EU AI Act
-
Continuum – CI Drift Guard for LLM Workflows
-
Jan Releases Code-Tuned 4B Model for Efficient Local Code Generation and Development Tasks
-
GitDelivr: A Free CDN for Git Clones Built on Cloudflare Workers and R2
-
C7: Pipe Up-to-Date Library Docs Into Any LLM From the Terminal
-
Alibaba's Open-Source CoPaw AI Agent Now Compatible with MCP and ClawHub Skills
-
RAG-Enterprise – 100% Local RAG System for Enterprise Documents
-
Huawei's SuperPoD Portfolio Creates New Option for Global Computing at MWC Barcelona 2026
-
4 Free Tools to Run Powerful AI on Your PC Without a Subscription
-
DeepSeek V4 Multimodal Model Coming Next Week With Image and Video Generation
-
Configure MCP Servers Once, Sync Them Everywhere
-
AgentLens – Open-Source Observability for AI Agents
-
Qwen 3.5-35B Unsloth Dynamic GGUFs Achieve SOTA Quantisation Benchmarks
-
We Audited the Security of 7 Open-Source AI Agents – Here Is What We Found
-
LLmFit: Terminal Tool for Right-Sizing LLM Models to Your Hardware
-
LLmFit: One-Command Hardware-Aware Model Selection Across 497 Models and 133 Providers
-
Krasis: Hybrid CPU/GPU MoE Runtime Achieves 3,324 Tokens/Second Prefill on RTX 5080
-
Krasis Hybrid MoE Runtime Achieves 3,324 tok/s Prefill on Single RTX 5080
-
On-Device Function Calling in Google AI Edge Gallery
-
Show HN: Caret – Tab to Complete at Any App on Your Mac
-
Arduino and Qualcomm Bring On-Device AI Learning to Indian Schools
-
Researchers Develop Persistent Memory System for Local LLMs—No RAG Required
-
DeepSeek Paper – DualPath: Breaking the Bandwidth Bottleneck in LLM Inference
-
Agent System – 7 specialized AI agents that plan, build, verify, and ship code
-
Red Hat Launches AI Enterprise for Hybrid AI Deployments
-
Qwen3.5 Series Releases Comprehensive Model Lineup Across All Tiers
-
Qwen3.5-35B-A3B Emerges as Game-Changer for Agentic Coding Tasks
-
PyTorch Foundation Announces New Members as Agentic AI Demand Grows
-
Mirai Announces $10M to Advance On-Device AI Performance for Consumer Devices
-
Show HN: A Ground Up TLS 1.3 Client Written in C
-
Meta's OpenClaw Release Raises Questions About Open-Source Model Safety and Alignment
-
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search
-
Show HN: Dypai – Build Backends from Your IDE Using AI and MCP
-
The Real AI Competition Is Closed-Source vs Open-Source, Not America vs China
-
Anthropic Has Never Open-Sourced an LLM: Implications for Local Deployment Strategy
-
Anthropic Reveals Industrial-Scale Distillation Attacks by Chinese AI Labs
-
Comparing Manual vs. AI Requirements Gathering: 2 Sentences vs. 127-Point Spec
-
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments
-
Making Wolfram Technology Available as Foundation Tool for LLM Systems
-
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours
-
How Do You Know Which SKILL.md Is Good?
-
Qwen3's Voice Embeddings Enable Local Voice Cloning and Mathematical Voice Manipulation
-
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio
-
Open-Source Framework Achieves Gemini 3 Deep Think Level Performance Through Local Model Scaffolding
-
nanollama: Open-Source Framework for Training Llama 3 from Scratch with One-Command GGUF Export
-
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools
-
Local GPT-OSS 20B Model Demonstrates Practical Agentic Capabilities
-
A Tool to Tell You What LLMs Can Run on Your Machine
-
Open-Source llama.cpp Finds Long-Term Home at Hugging Face
-
GPT-OSS 20B Demonstrates Practical Agentic Capabilities Running Fully Locally
-
GLM-5 Becomes Top Open-Weights Model on Extended NYT Connections Benchmark
-
Gix: Go CLI for AI-Generated Commit Messages
-
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI
-
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search
-
Show HN: The Only CLI Your AI Agent Will Need
-
AI-Powered Reverse-Engineering of Rosetta 2 for Linux
-
Security Alert: Fraudulent Shade Software Plagiarized from Heretic Project
-
Ollama 0.17 Released With Improved OpenClaw Onboarding
-
Show HN: Horizon – My AI-Powered Personal News Aggregator and Summarizer
-
Google Open-Sources NPU IP, Synaptics Implements It for Hardware Acceleration
-
GGML Joins Hugging Face: What This Means for Local Model Optimization
-
DietPi Released a New Version v10.1
-
CPU-Trained Language Model Outperforms GPU Baseline After 40 Hours
-
Vellium v0.3.5: Major Writing Mode Overhaul and Native KoboldCpp Support
-
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM
-
Open-Source + AI: ggml Joins Hugging Face, llama.cpp Stays Open—Local AI's Long-Term Home
-
GGML.AI Acquired by Hugging Face
-
Claude Code Open – AI Coding Platform with Web IDE and Agents
-
SanityBoard Adds 27 New Model Evaluations Including Qwen 3.5 Plus, GLM 5, and Gemini 3.1 Pro
-
PaddleOCR-VL Now Integrated into llama.cpp for Multilingual OCR
-
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx
-
Kitten TTS V0.8 Released: New State-of-the-Art Super-Tiny TTS Model Under 25 MB
-
Self-Hosted Local LLMs for Document Management with Paperless-ngx
-
Local Vision-Language Models for Document OCR and PII Detection in Privacy-Critical Workflows
-
Kitten TTS V0.8 Released: State-of-the-Art Super-Tiny Text-to-Speech Model Under 25MB
-
Aegis.rs: Open Source Rust-Based LLM Security Proxy Released
-
Why My Country's AI Scene Is Built on Sand
-
Alibaba's Qwen3.5-397B Achieves #3 Position in Open Weights Model Rankings
-
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter
-
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages
-
Ask HN: What is the best bang for buck budget AI coding?
-
Sourdine: Open-Source macOS App for 100% Local AI Transcription
-
InitRunner: YAML-Based AI Agent Framework with RAG and Memory
-
Alibaba Unveils Major AI Model Upgrade Ahead of DeepSeek Release
-
GNOME's AI Assistant Newelle Adds llama.cpp Support and Command Execution
-
ByteDance Releases Seed2.0 LLM with Complex Real-World Task Improvements
-
WinClaw: Windows-Native AI Assistant with Office Automation
-
GitHub Announces Support for Open Source AI Project Maintainers
-
MiniMax M2.5: 230B Parameter MoE Model Coming to HuggingFace
-
I Tried a Claude Code Rival That's Local, Open Source, and Completely Free
-
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts
-
Godot MCP Gives AI Assistants Full Access to Game Engine Editor
-
DeepSeek Launches Model Update with 1M Context Window
-
Anthropic Releases Claude Opus 4.6 Sabotage Risk Assessment
-
Community Member Builds 144GB VRAM Local LLM Powerhouse