Tagged "open-source"
-
I built Rubric, an open source Sentry for AI. Looking for beta testers
-
Open-Source AI Text-to-Speech Models You Can Run Locally for Natural Voice
-
Open-Source Tool Helps Determine Which Local LLMs Run on Your PC
-
A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant
-
llm-d Joins the Cloud Native Computing Foundation
-
Chinese LLM Ecosystem Landscape: ByteDance Doubao, Alibaba, and Open-Source Competition
-
Self-Hostable AI Agents and Internal Software Framework Released
-
MiniMax M2.7 Model to Be Released as Open Weights
-
Alibaba Commits to Continuous Open-Sourcing of Qwen and Wan Models
-
Ditching Paid AI Services: Building Self-Hosted LLM Solutions as ChatGPT, Claude, and Gemini Alternatives
-
Qwen 3.5 122B Uncensored (Aggressive) Released with New K_P Quantisations
-
Nvidia Nemotron Cascade 2 30B Emerges as Powerful Alternative to Qwen Models
-
Developer Builds Fully Local Multi-Agent System Using vLLM and Parallel Inference
-
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach
-
Careless Whisper – Personal Local Speech to Text
-
BrowserOS 0.44.0 Release: Advances in Local AI Integration for Web-Based Applications
-
Brezn – Decentralized Local Communication
-
Automating Read-It-Later Workflows with Local LLMs for Overnight Summarization
-
AI Playground for Developers Built in Vite and Python
-
Pydantic-Deep: Production Deep Agents for Pydantic AI
-
Cursor's Composer 2 model attribution dispute highlights open-source licensing concerns
-
Your Site Content Is Powering AI. Your Bank Account Has No Idea
-
Atuin v18.13 – Better Search, a PTY Proxy, and AI for Your Shell
-
SwarmHawk – Open-Source CLI for Vulnerability Scanning with AI Synthesis
-
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks
-
Why Self-Hosted LLMs Make Financial and Privacy Sense Over Paid Services
-
Qwen 3.5 Emerges as Top Performer for Local Deployment with Extensive Quantization Options
-
NVIDIA Nemotron Cascade 2 30B Delivers 120B-Class Performance in Compact Form Factor
-
NVIDIA Nemotron 3 Nano 4B Enables On-Device Inference Directly in Web Browsers via WebGPU
-
LMCache Dramatically Accelerates LLM Inference on Oracle Data Science Platform
-
Llamafile 0.10 Released with GPU Support and Rebuilt Core
-
Cybersecurity Skills for AI Agents – agentskills.io Standard Implementation
-
Cursor's Composer 2 Model Analysis – Fine-Tuned Variant of Kimi K2.5
-
Claude Code Permissions Hook – Delegate Permission Approval to LLM
-
AI's Impact on Mathematics Analogous to Car's Impact on Cities
-
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet
-
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It
-
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training
-
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally
-
Skills Manager – manage AI agent skills across Claude, Cursor, Copilot
-
My Dinner with AI
-
LucidShark – Local-first, open-source quality and security gate
-
Show HN: Process Mining for AI Agent Systems
-
Qwen 3.5 4B Outperforms Nvidia Nemotron 3 4B in Local Benchmarks
-
Mistral Small 4 119B Released with NVFP4 Quantisation Support
-
Mistral Releases Small 4 Open-Source Model Under Apache 2.0
-
Local Qwen Models Master Browser Automation Through Iterative Replanning
-
How I Used Lima for an AI Coding Agent Sandbox
-
Mistral Releases Leanstral: First Open-Source Code Agent for Lean 4 Proof Assistant
-
Kimi Introduces Attention Residuals: 1.25x Compute Performance at <2% Overhead
-
The Moment AI Agents Stopped Being a Feature and Started Becoming a System
-
How AI Agents Should Pay for API Calls: X402 and USDC Verification on Base
-
OpenClaw Isn't the Only Raspberry Pi AI Tool—Here Are 4 Others You Can Try This Week
-
Qwen 3.5 122B Demonstrates Exceptional Reasoning for Local Deployment
-
Open-Source LLMs Rapidly Displacing Proprietary SOTA Models
-
OmniCoder-9B: Efficient Coding Model for 8GB GPUs
-
NVIDIA Updates Nemotron 3 122B License, Removes Deployment Restrictions
-
Nota Added to Three Technology and Growth ETFs in a Row – Market Recognition for AI Efficiency
-
LoKI – Local AI Assistant for Linux and WSL
-
Dictare – Open-source Voice Layer for AI Coding Agents (100% Local)
-
Show HN: Generate, Clean, and Prepare LLM Training Data, All-in-One
-
Apple's On-Device AI Raises Privacy Alarms Across British Parliament
-
Show HN: Voice-tracked teleprompter using on-device ASR in the browser
-
StepFun Releases SFT Dataset Used to Train Step 3.5 Flash for Community Fine-Tuning
-
OpenClaw vs Eigent vs Claude Cowork: Comparing Open-Source AI Collaboration Platforms
-
Nvidia's Nemotron 3 Super: Understanding the Significance for Local LLM Deployment
-
Two Local Models Prove Competitive Enough to Replace ChatGPT, Gemini, and Copilot
-
Hybrid AI Desktop Layer Combining DOM-Automation and API-Integrations
-
Open-Source GreenBoost Driver Augments NVIDIA GPU VRAM With System RAM and NVMe Storage
-
I made Karpathy's Autoresearch work on CPU
-
Intel OpenVINO Backend Support Now Available in llama.cpp
-
Local Manga Translator: Production LLM Pipeline with YOLO, OCR, and Inpainting
-
Show HN: Intake API – An Inbox for AI Coding Agents
-
Show HN: Bots of WallStreet – Multi-Agent Debate and Prediction Framework
-
Best Local LLM Models 2026: Developer Comparison
-
AgentArmor: Open-Source 8-Layer Security Framework for AI Agents
-
Runpod Report: Qwen Has Overtaken Meta's Llama As The Most-Deployed Self-Hosted LLM
-
Intel Updates LLM-Scaler-vLLM With Support For More Qwen3/3.5 Models
-
How to Install OpenClaw with Ollama (Step-by-Step Tutorial)
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
Qwodel – An Open-Source Unified Pipeline for LLM Quantization
-
Nvidia Pushes Jetson as Edge Hub for Open AI Models
-
Nvidia Releases Nemotron 3 Super: 120B MoE Model for Local Deployment
-
MeepaChat – Slack for AI Agents (iOS, macOS, Web / Cloud, Self-Hosted)
-
Texas Instruments Launches NPU-Powered MCUs for Low-Power Edge AI
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
NVIDIA Jetson Brings Open Models to Life at the Edge
-
LMF – LLM Markup Format
-
Llama.cpp Celebrates Major Milestone: From Leak to Industry Standard
-
Show HN: Aver – a Language Designed for AI to Write and Humans to Review
-
Show HN: AIWatermarkDetector: Detect AI Watermarks in Text or Code
-
PhotoPrism AI-Powered Photos App Brings Better Ollama Integration
-
Mnemos: Persistent Memory System for Local AI Agents
-
.ispec: Runtime Specification Validation for AI System Consistency
-
Google Delivers On-Device AI Features in New Chromebook Plus Model
-
Gloss: Open-Source, Local-First RAG Alternative to NotebookLM Built in Rust
-
FreeBSD 14.4 Released: Implications for Local LLM Deployment
-
Fish Audio Open-Sources S2: Expressive Text-to-Speech with Natural Language Control and 100ms Latency
-
Bash-Based Claude Code Agent: Lightweight Local AI Coding Assistant
-
Community Survey: AI Content Automation Stacks in 2026
-
VS Code Agent Kanban – Task Management for AI-Assisted Development
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
Qwen 3.5 Small Expands On-Device AI to Phones and IoT with Offline Support
-
Qwen 3.5 Derestricted Model Available for Local Deployment
-
Gyro-Claw – Secure Execution Runtime for AI Agents
-
FretBench – Testing 14 LLMs on Reading Guitar Tabs Reveals Performance Gaps
-
Engram – Open-Source Persistent Memory for AI Agents
-
commitgen-cc – Generate Conventional Commit Messages Locally with Ollama
-
VoiceShelf: Fully Offline Android Audiobook Reader Using Kokoro TTS
-
Reverse engineering a DOS game with no source code using Codex 5.4
-
Show HN: Proxly – Self-hosted tunneling on your own domain in 60 seconds
-
OpenSpec: Spec-driven development (SDD) for AI coding assistants
-
Mistral AI Prepares Workflows Integration for Le Chat
-
Benchmark: Local Open-Source LLMs Competitive in Real-Time Trading Applications
-
Show HN: Ivy – the first proactive, offline AI tutor
-
Windows 11 Notepad Gets On-Device AI Text Generation Without Subscription
-
Self-Hosted Paperless-ngx With Optional Local AI Integration
-
Sarvam AI Releases 30B and 105B Open-Source Models Trained from Scratch
-
Show HN: RedDragon – LLM-Assisted IR Analysis of Code Across Languages
-
Qwen3-Coder-Next Achieves Top Ranking on SWE-bench at Pass@5
-
Open WebUI Adds Native Terminal Tool Calling with Qwen3.5 35B Support
-
Llama.cpp Merges Automatic Parser Generator to Mainline
-
Jse v2.0 AI Output Specification
-
IBM Granite 4.0 1B Speech Model Released for Multilingual Speech Recognition
-
Show HN: Asterode – Multi-Model AI App with Memory and Power Features
-
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support
-
Show HN: TLDR – Free Chrome Extension for AI-Powered Article Summarization
-
llama.cpp Merges Agentic Loop and MCP Client Support
-
Imrobot – Reverse-CAPTCHA for Verifying AI Agents, Not Humans
-
ConsciOS v1.0: A Viable Systems Architecture for Human and AI Alignment
-
Show HN: BoardMint – A PCB Review Tool That Avoids AI Hallucinations
-
SynthesisOS – A Local-First, Agentic Desktop Layer Built in Rust
-
Qwen 3.5-35B-A3B Achieves 37.8% on SWE-bench Verified Hard
-
OpenWrt 25.12.0 – Stable Release
-
Incrmd: Incremental AI Coding by Editing PROJECT.md
-
Glyph – A Local-First Markdown Notes App for macOS Built With Rust
-
Apple M5 Pro and M5 Max: 4× Faster LLM Processing
-
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions
-
VibeWhisper – macOS Voice-to-Text with 100% Local Processing Option
-
Qwen 3.5 0.8B Successfully Deployed on 7-Year-Old Samsung S10E Using llama.cpp
-
Qwen 3.5 0.8B Running in Browser with WebGPU via Transformers.js
-
Open-Source Article 12 Logging Infrastructure for the EU AI Act
-
Continuum – CI Drift Guard for LLM Workflows
-
Jan Releases Code-Tuned 4B Model for Efficient Local Code Generation and Development Tasks
-
GitDelivr: A Free CDN for Git Clones Built on Cloudflare Workers and R2
-
C7: Pipe Up-to-Date Library Docs Into Any LLM From the Terminal
-
Alibaba's Open-Source CoPaw AI Agent Now Compatible with MCP and ClawHub Skills
-
RAG-Enterprise – 100% Local RAG System for Enterprise Documents
-
Huawei's SuperPoD Portfolio Creates New Option for Global Computing at MWC Barcelona 2026
-
4 Free Tools to Run Powerful AI on Your PC Without a Subscription
-
DeepSeek V4 Multimodal Model Coming Next Week With Image and Video Generation
-
Configure MCP Servers Once, Sync Them Everywhere
-
AgentLens – Open-Source Observability for AI Agents
-
Qwen 3.5-35B Unsloth Dynamic GGUFs Achieve SOTA Quantisation Benchmarks
-
We Audited the Security of 7 Open-Source AI Agents – Here Is What We Found
-
LLmFit: Terminal Tool for Right-Sizing LLM Models to Your Hardware
-
LLmFit: One-Command Hardware-Aware Model Selection Across 497 Models and 133 Providers
-
Krasis: Hybrid CPU/GPU MoE Runtime Achieves 3,324 Tokens/Second Prefill on RTX 5080
-
Krasis Hybrid MoE Runtime Achieves 3,324 tok/s Prefill on Single RTX 5080
-
On-Device Function Calling in Google AI Edge Gallery
-
Show HN: Caret – Tab to Complete at Any App on Your Mac
-
Arduino and Qualcomm Bring On-Device AI Learning to Indian Schools
-
Researchers Develop Persistent Memory System for Local LLMs—No RAG Required
-
DeepSeek Paper – DualPath: Breaking the Bandwidth Bottleneck in LLM Inference
-
Agent System – 7 specialized AI agents that plan, build, verify, and ship code
-
Red Hat Launches AI Enterprise for Hybrid AI Deployments
-
Qwen3.5 Series Releases Comprehensive Model Lineup Across All Tiers
-
Qwen3.5-35B-A3B Emerges as Game-Changer for Agentic Coding Tasks
-
PyTorch Foundation Announces New Members as Agentic AI Demand Grows
-
Mirai Announces $10M to Advance On-Device AI Performance for Consumer Devices
-
Show HN: A Ground Up TLS 1.3 Client Written in C
-
Meta's OpenClaw Release Raises Questions About Open-Source Model Safety and Alignment
-
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search
-
Show HN: Dypai – Build Backends from Your IDE Using AI and MCP
-
The Real AI Competition Is Closed-Source vs Open-Source, Not America vs China
-
Anthropic Has Never Open-Sourced an LLM: Implications for Local Deployment Strategy
-
Anthropic Reveals Industrial-Scale Distillation Attacks by Chinese AI Labs
-
Comparing Manual vs. AI Requirements Gathering: 2 Sentences vs. 127-Point Spec
-
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments
-
Making Wolfram Technology Available as Foundation Tool for LLM Systems
-
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours
-
How Do You Know Which SKILL.md Is Good?
-
Qwen3's Voice Embeddings Enable Local Voice Cloning and Mathematical Voice Manipulation
-
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio
-
Open-Source Framework Achieves Gemini 3 Deep Think Level Performance Through Local Model Scaffolding
-
nanollama: Open-Source Framework for Training Llama 3 from Scratch with One-Command GGUF Export
-
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools
-
Local GPT-OSS 20B Model Demonstrates Practical Agentic Capabilities
-
A Tool to Tell You What LLMs Can Run on Your Machine
-
Open-Source llama.cpp Finds Long-Term Home at Hugging Face
-
GPT-OSS 20B Demonstrates Practical Agentic Capabilities Running Fully Locally
-
GLM-5 Becomes Top Open-Weights Model on Extended NYT Connections Benchmark
-
Gix: Go CLI for AI-Generated Commit Messages
-
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI
-
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search
-
Show HN: The Only CLI Your AI Agent Will Need
-
AI-Powered Reverse-Engineering of Rosetta 2 for Linux
-
Security Alert: Fraudulent Shade Software Plagiarized from Heretic Project
-
Ollama 0.17 Released With Improved OpenClaw Onboarding
-
Show HN: Horizon – My AI-Powered Personal News Aggregator and Summarizer
-
Google Open-Sources NPU IP, Synaptics Implements It for Hardware Acceleration
-
GGML Joins Hugging Face: What This Means for Local Model Optimization
-
DietPi Released a New Version v10.1
-
CPU-Trained Language Model Outperforms GPU Baseline After 40 Hours
-
Vellium v0.3.5: Major Writing Mode Overhaul and Native KoboldCpp Support
-
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM
-
Open-Source + AI: ggml Joins Hugging Face, llama.cpp Stays Open—Local AI's Long-Term Home
-
GGML.AI Acquired by Hugging Face
-
Claude Code Open – AI Coding Platform with Web IDE and Agents
-
SanityBoard Adds 27 New Model Evaluations Including Qwen 3.5 Plus, GLM 5, and Gemini 3.1 Pro
-
PaddleOCR-VL Now Integrated into llama.cpp for Multilingual OCR
-
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx
-
Kitten TTS V0.8 Released: New State-of-the-Art Super-Tiny TTS Model Under 25 MB
-
Self-Hosted Local LLMs for Document Management with Paperless-ngx
-
Local Vision-Language Models for Document OCR and PII Detection in Privacy-Critical Workflows
-
Kitten TTS V0.8 Released: State-of-the-Art Super-Tiny Text-to-Speech Model Under 25MB
-
Aegis.rs: Open Source Rust-Based LLM Security Proxy Released
-
Why My Country's AI Scene Is Built on Sand
-
Alibaba's Qwen3.5-397B Achieves #3 Position in Open Weights Model Rankings
-
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter
-
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages
-
Ask HN: What is the best bang for buck budget AI coding?
-
Sourdine: Open-Source macOS App for 100% Local AI Transcription
-
InitRunner: YAML-Based AI Agent Framework with RAG and Memory
-
Alibaba Unveils Major AI Model Upgrade Ahead of DeepSeek Release
-
GNOME's AI Assistant Newelle Adds llama.cpp Support and Command Execution
-
ByteDance Releases Seed2.0 LLM with Complex Real-World Task Improvements
-
WinClaw: Windows-Native AI Assistant with Office Automation
-
GitHub Announces Support for Open Source AI Project Maintainers
-
MiniMax M2.5: 230B Parameter MoE Model Coming to HuggingFace
-
I Tried a Claude Code Rival That's Local, Open Source, and Completely Free
-
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts
-
Godot MCP Gives AI Assistants Full Access to Game Engine Editor
-
DeepSeek Launches Model Update with 1M Context Window
-
Anthropic Releases Claude Opus 4.6 Sabotage Risk Assessment
-
Community Member Builds 144GB VRAM Local LLM Powerhouse