Tagged "open-source"

I built Rubric, an open source Sentry for AI. Looking for beta testers 24 March 2026
Open-Source AI Text-to-Speech Models You Can Run Locally for Natural Voice 24 March 2026
Open-Source Tool Helps Determine Which Local LLMs Run on Your PC 24 March 2026
A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant 24 March 2026
llm-d Joins the Cloud Native Computing Foundation 24 March 2026
Chinese LLM Ecosystem Landscape: ByteDance Doubao, Alibaba, and Open-Source Competition 24 March 2026
Self-Hostable AI Agents and Internal Software Framework Released 23 March 2026
MiniMax M2.7 Model to Be Released as Open Weights 23 March 2026
Alibaba Commits to Continuous Open-Sourcing of Qwen and Wan Models 23 March 2026
Ditching Paid AI Services: Building Self-Hosted LLM Solutions as ChatGPT, Claude, and Gemini Alternatives 22 March 2026
Qwen 3.5 122B Uncensored (Aggressive) Released with New K_P Quantisations 22 March 2026
Nvidia Nemotron Cascade 2 30B Emerges as Powerful Alternative to Qwen Models 22 March 2026
Developer Builds Fully Local Multi-Agent System Using vLLM and Parallel Inference 22 March 2026
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach 22 March 2026
Careless Whisper – Personal Local Speech to Text 22 March 2026
BrowserOS 0.44.0 Release: Advances in Local AI Integration for Web-Based Applications 22 March 2026
Brezn – Decentralized Local Communication 22 March 2026
Automating Read-It-Later Workflows with Local LLMs for Overnight Summarization 22 March 2026
AI Playground for Developers Built in Vite and Python 22 March 2026
Pydantic-Deep: Production Deep Agents for Pydantic AI 21 March 2026
Cursor's Composer 2 model attribution dispute highlights open-source licensing concerns 21 March 2026
Your Site Content Is Powering AI. Your Bank Account Has No Idea 21 March 2026
Atuin v18.13 – Better Search, a PTY Proxy, and AI for Your Shell 21 March 2026
SwarmHawk – Open-Source CLI for Vulnerability Scanning with AI Synthesis 20 March 2026
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks 20 March 2026
Why Self-Hosted LLMs Make Financial and Privacy Sense Over Paid Services 20 March 2026
Qwen 3.5 Emerges as Top Performer for Local Deployment with Extensive Quantization Options 20 March 2026
NVIDIA Nemotron Cascade 2 30B Delivers 120B-Class Performance in Compact Form Factor 20 March 2026
NVIDIA Nemotron 3 Nano 4B Enables On-Device Inference Directly in Web Browsers via WebGPU 20 March 2026
LMCache Dramatically Accelerates LLM Inference on Oracle Data Science Platform 20 March 2026
Llamafile 0.10 Released with GPU Support and Rebuilt Core 20 March 2026
Cybersecurity Skills for AI Agents – agentskills.io Standard Implementation 20 March 2026
Cursor's Composer 2 Model Analysis – Fine-Tuned Variant of Kimi K2.5 20 March 2026
Claude Code Permissions Hook – Delegate Permission Approval to LLM 20 March 2026
AI's Impact on Mathematics Analogous to Car's Impact on Cities 20 March 2026
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet 19 March 2026
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It 19 March 2026
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training 19 March 2026
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally 18 March 2026
Skills Manager – manage AI agent skills across Claude, Cursor, Copilot 18 March 2026
My Dinner with AI 18 March 2026
LucidShark – Local-first, open-source quality and security gate 18 March 2026
Show HN: Process Mining for AI Agent Systems 18 March 2026
Qwen 3.5 4B Outperforms Nvidia Nemotron 3 4B in Local Benchmarks 17 March 2026
Mistral Small 4 119B Released with NVFP4 Quantisation Support 17 March 2026
Mistral Releases Small 4 Open-Source Model Under Apache 2.0 17 March 2026
Local Qwen Models Master Browser Automation Through Iterative Replanning 17 March 2026
How I Used Lima for an AI Coding Agent Sandbox 17 March 2026
Mistral Releases Leanstral: First Open-Source Code Agent for Lean 4 Proof Assistant 17 March 2026
Kimi Introduces Attention Residuals: 1.25x Compute Performance at <2% Overhead 17 March 2026
The Moment AI Agents Stopped Being a Feature and Started Becoming a System 17 March 2026
How AI Agents Should Pay for API Calls: X402 and USDC Verification on Base 17 March 2026
OpenClaw Isn't the Only Raspberry Pi AI Tool—Here Are 4 Others You Can Try This Week 16 March 2026
Qwen 3.5 122B Demonstrates Exceptional Reasoning for Local Deployment 16 March 2026
Open-Source LLMs Rapidly Displacing Proprietary SOTA Models 16 March 2026
OmniCoder-9B: Efficient Coding Model for 8GB GPUs 16 March 2026
NVIDIA Updates Nemotron 3 122B License, Removes Deployment Restrictions 16 March 2026
Nota Added to Three Technology and Growth ETFs in a Row – Market Recognition for AI Efficiency 16 March 2026
LoKI – Local AI Assistant for Linux and WSL 16 March 2026
Dictare – Open-source Voice Layer for AI Coding Agents (100% Local) 16 March 2026
Show HN: Generate, Clean, and Prepare LLM Training Data, All-in-One 16 March 2026
Apple's On-Device AI Raises Privacy Alarms Across British Parliament 16 March 2026
Show HN: Voice-tracked teleprompter using on-device ASR in the browser 15 March 2026
StepFun Releases SFT Dataset Used to Train Step 3.5 Flash for Community Fine-Tuning 15 March 2026
OpenClaw vs Eigent vs Claude Cowork: Comparing Open-Source AI Collaboration Platforms 15 March 2026
Nvidia's Nemotron 3 Super: Understanding the Significance for Local LLM Deployment 15 March 2026
Two Local Models Prove Competitive Enough to Replace ChatGPT, Gemini, and Copilot 15 March 2026
Hybrid AI Desktop Layer Combining DOM-Automation and API-Integrations 15 March 2026
Open-Source GreenBoost Driver Augments NVIDIA GPU VRAM With System RAM and NVMe Storage 15 March 2026
I made Karpathy's Autoresearch work on CPU 15 March 2026
Intel OpenVINO Backend Support Now Available in llama.cpp 14 March 2026
Local Manga Translator: Production LLM Pipeline with YOLO, OCR, and Inpainting 14 March 2026
Show HN: Intake API – An Inbox for AI Coding Agents 14 March 2026
Show HN: Bots of WallStreet – Multi-Agent Debate and Prediction Framework 14 March 2026
Best Local LLM Models 2026: Developer Comparison 14 March 2026
AgentArmor: Open-Source 8-Layer Security Framework for AI Agents 14 March 2026
Runpod Report: Qwen Has Overtaken Meta's Llama As The Most-Deployed Self-Hosted LLM 13 March 2026
Intel Updates LLM-Scaler-vLLM With Support For More Qwen3/3.5 Models 13 March 2026
How to Install OpenClaw with Ollama (Step-by-Step Tutorial) 13 March 2026
Sarvam Open-Sources 30B and 105B Reasoning Models 12 March 2026
Qwodel – An Open-Source Unified Pipeline for LLM Quantization 12 March 2026
Nvidia Pushes Jetson as Edge Hub for Open AI Models 12 March 2026
Nvidia Releases Nemotron 3 Super: 120B MoE Model for Local Deployment 12 March 2026
MeepaChat – Slack for AI Agents (iOS, macOS, Web / Cloud, Self-Hosted) 12 March 2026
Texas Instruments Launches NPU-Powered MCUs for Low-Power Edge AI 11 March 2026
Sarvam Open-Sources 30B and 105B Reasoning Models 11 March 2026
NVIDIA Jetson Brings Open Models to Life at the Edge 11 March 2026
LMF – LLM Markup Format 11 March 2026
Llama.cpp Celebrates Major Milestone: From Leak to Industry Standard 11 March 2026
Show HN: Aver – a Language Designed for AI to Write and Humans to Review 11 March 2026
Show HN: AIWatermarkDetector: Detect AI Watermarks in Text or Code 11 March 2026
PhotoPrism AI-Powered Photos App Brings Better Ollama Integration 10 March 2026
Mnemos: Persistent Memory System for Local AI Agents 10 March 2026
.ispec: Runtime Specification Validation for AI System Consistency 10 March 2026
Google Delivers On-Device AI Features in New Chromebook Plus Model 10 March 2026
Gloss: Open-Source, Local-First RAG Alternative to NotebookLM Built in Rust 10 March 2026
FreeBSD 14.4 Released: Implications for Local LLM Deployment 10 March 2026
Fish Audio Open-Sources S2: Expressive Text-to-Speech with Natural Language Control and 100ms Latency 10 March 2026
Bash-Based Claude Code Agent: Lightweight Local AI Coding Assistant 10 March 2026
Community Survey: AI Content Automation Stacks in 2026 10 March 2026
VS Code Agent Kanban – Task Management for AI-Assisted Development 9 March 2026
Sarvam Open-Sources 30B and 105B Reasoning Models 9 March 2026
Qwen 3.5 Small Expands On-Device AI to Phones and IoT with Offline Support 9 March 2026
Qwen 3.5 Derestricted Model Available for Local Deployment 9 March 2026
Gyro-Claw – Secure Execution Runtime for AI Agents 9 March 2026
FretBench – Testing 14 LLMs on Reading Guitar Tabs Reveals Performance Gaps 9 March 2026
Engram – Open-Source Persistent Memory for AI Agents 9 March 2026
commitgen-cc – Generate Conventional Commit Messages Locally with Ollama 9 March 2026
VoiceShelf: Fully Offline Android Audiobook Reader Using Kokoro TTS 9 March 2026
Reverse engineering a DOS game with no source code using Codex 5.4 8 March 2026
Show HN: Proxly – Self-hosted tunneling on your own domain in 60 seconds 8 March 2026
OpenSpec: Spec-driven development (SDD) for AI coding assistants 8 March 2026
Mistral AI Prepares Workflows Integration for Le Chat 8 March 2026
Benchmark: Local Open-Source LLMs Competitive in Real-Time Trading Applications 8 March 2026
Show HN: Ivy – the first proactive, offline AI tutor 8 March 2026
Windows 11 Notepad Gets On-Device AI Text Generation Without Subscription 7 March 2026
Self-Hosted Paperless-ngx With Optional Local AI Integration 7 March 2026
Sarvam AI Releases 30B and 105B Open-Source Models Trained from Scratch 7 March 2026
Show HN: RedDragon – LLM-Assisted IR Analysis of Code Across Languages 7 March 2026
Qwen3-Coder-Next Achieves Top Ranking on SWE-bench at Pass@5 7 March 2026
Open WebUI Adds Native Terminal Tool Calling with Qwen3.5 35B Support 7 March 2026
Llama.cpp Merges Automatic Parser Generator to Mainline 7 March 2026
Jse v2.0 AI Output Specification 7 March 2026
IBM Granite 4.0 1B Speech Model Released for Multilingual Speech Recognition 7 March 2026
Show HN: Asterode – Multi-Model AI App with Memory and Power Features 7 March 2026
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support 7 March 2026
Show HN: TLDR – Free Chrome Extension for AI-Powered Article Summarization 6 March 2026
llama.cpp Merges Agentic Loop and MCP Client Support 6 March 2026
Imrobot – Reverse-CAPTCHA for Verifying AI Agents, Not Humans 6 March 2026
ConsciOS v1.0: A Viable Systems Architecture for Human and AI Alignment 6 March 2026
Show HN: BoardMint – A PCB Review Tool That Avoids AI Hallucinations 6 March 2026
SynthesisOS – A Local-First, Agentic Desktop Layer Built in Rust 4 March 2026
Qwen 3.5-35B-A3B Achieves 37.8% on SWE-bench Verified Hard 4 March 2026
OpenWrt 25.12.0 – Stable Release 4 March 2026
Incrmd: Incremental AI Coding by Editing PROJECT.md 4 March 2026
Glyph – A Local-First Markdown Notes App for macOS Built With Rust 4 March 2026
Apple M5 Pro and M5 Max: 4× Faster LLM Processing 4 March 2026
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions 4 March 2026
VibeWhisper – macOS Voice-to-Text with 100% Local Processing Option 3 March 2026
Qwen 3.5 0.8B Successfully Deployed on 7-Year-Old Samsung S10E Using llama.cpp 3 March 2026
Qwen 3.5 0.8B Running in Browser with WebGPU via Transformers.js 3 March 2026
Open-Source Article 12 Logging Infrastructure for the EU AI Act 3 March 2026
Continuum – CI Drift Guard for LLM Workflows 3 March 2026
Jan Releases Code-Tuned 4B Model for Efficient Local Code Generation and Development Tasks 2 March 2026
GitDelivr: A Free CDN for Git Clones Built on Cloudflare Workers and R2 2 March 2026
C7: Pipe Up-to-Date Library Docs Into Any LLM From the Terminal 2 March 2026
Alibaba's Open-Source CoPaw AI Agent Now Compatible with MCP and ClawHub Skills 2 March 2026
RAG-Enterprise – 100% Local RAG System for Enterprise Documents 1 March 2026
Huawei's SuperPoD Portfolio Creates New Option for Global Computing at MWC Barcelona 2026 1 March 2026
4 Free Tools to Run Powerful AI on Your PC Without a Subscription 1 March 2026
DeepSeek V4 Multimodal Model Coming Next Week With Image and Video Generation 1 March 2026
Configure MCP Servers Once, Sync Them Everywhere 1 March 2026
AgentLens – Open-Source Observability for AI Agents 1 March 2026
Qwen 3.5-35B Unsloth Dynamic GGUFs Achieve SOTA Quantisation Benchmarks 28 February 2026
We Audited the Security of 7 Open-Source AI Agents – Here Is What We Found 28 February 2026
LLmFit: Terminal Tool for Right-Sizing LLM Models to Your Hardware 28 February 2026
LLmFit: One-Command Hardware-Aware Model Selection Across 497 Models and 133 Providers 28 February 2026
Krasis: Hybrid CPU/GPU MoE Runtime Achieves 3,324 Tokens/Second Prefill on RTX 5080 28 February 2026
Krasis Hybrid MoE Runtime Achieves 3,324 tok/s Prefill on Single RTX 5080 28 February 2026
On-Device Function Calling in Google AI Edge Gallery 27 February 2026
Show HN: Caret – Tab to Complete at Any App on Your Mac 27 February 2026
Arduino and Qualcomm Bring On-Device AI Learning to Indian Schools 27 February 2026
Researchers Develop Persistent Memory System for Local LLMs—No RAG Required 26 February 2026
DeepSeek Paper – DualPath: Breaking the Bandwidth Bottleneck in LLM Inference 26 February 2026
Agent System – 7 specialized AI agents that plan, build, verify, and ship code 26 February 2026
Red Hat Launches AI Enterprise for Hybrid AI Deployments 25 February 2026
Qwen3.5 Series Releases Comprehensive Model Lineup Across All Tiers 25 February 2026
Qwen3.5-35B-A3B Emerges as Game-Changer for Agentic Coding Tasks 25 February 2026
PyTorch Foundation Announces New Members as Agentic AI Demand Grows 25 February 2026
Mirai Announces $10M to Advance On-Device AI Performance for Consumer Devices 25 February 2026
Show HN: A Ground Up TLS 1.3 Client Written in C 24 February 2026
Meta's OpenClaw Release Raises Questions About Open-Source Model Safety and Alignment 24 February 2026
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search 24 February 2026
Show HN: Dypai – Build Backends from Your IDE Using AI and MCP 24 February 2026
The Real AI Competition Is Closed-Source vs Open-Source, Not America vs China 24 February 2026
Anthropic Has Never Open-Sourced an LLM: Implications for Local Deployment Strategy 24 February 2026
Anthropic Reveals Industrial-Scale Distillation Attacks by Chinese AI Labs 24 February 2026
Comparing Manual vs. AI Requirements Gathering: 2 Sentences vs. 127-Point Spec 24 February 2026
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments 24 February 2026
Making Wolfram Technology Available as Foundation Tool for LLM Systems 23 February 2026
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours 23 February 2026
How Do You Know Which SKILL.md Is Good? 23 February 2026
Qwen3's Voice Embeddings Enable Local Voice Cloning and Mathematical Voice Manipulation 23 February 2026
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio 23 February 2026
Open-Source Framework Achieves Gemini 3 Deep Think Level Performance Through Local Model Scaffolding 23 February 2026
nanollama: Open-Source Framework for Training Llama 3 from Scratch with One-Command GGUF Export 23 February 2026
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools 23 February 2026
Local GPT-OSS 20B Model Demonstrates Practical Agentic Capabilities 23 February 2026
A Tool to Tell You What LLMs Can Run on Your Machine 23 February 2026
Open-Source llama.cpp Finds Long-Term Home at Hugging Face 23 February 2026
GPT-OSS 20B Demonstrates Practical Agentic Capabilities Running Fully Locally 23 February 2026
GLM-5 Becomes Top Open-Weights Model on Extended NYT Connections Benchmark 23 February 2026
Gix: Go CLI for AI-Generated Commit Messages 23 February 2026
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI 23 February 2026
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search 23 February 2026
Show HN: The Only CLI Your AI Agent Will Need 23 February 2026
AI-Powered Reverse-Engineering of Rosetta 2 for Linux 23 February 2026
Security Alert: Fraudulent Shade Software Plagiarized from Heretic Project 22 February 2026
Ollama 0.17 Released With Improved OpenClaw Onboarding 22 February 2026
Show HN: Horizon – My AI-Powered Personal News Aggregator and Summarizer 22 February 2026
Google Open-Sources NPU IP, Synaptics Implements It for Hardware Acceleration 22 February 2026
GGML Joins Hugging Face: What This Means for Local Model Optimization 22 February 2026
DietPi Released a New Version v10.1 22 February 2026
CPU-Trained Language Model Outperforms GPU Baseline After 40 Hours 22 February 2026
Vellium v0.3.5: Major Writing Mode Overhaul and Native KoboldCpp Support 21 February 2026
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM 21 February 2026
Open-Source + AI: ggml Joins Hugging Face, llama.cpp Stays Open—Local AI's Long-Term Home 21 February 2026
GGML.AI Acquired by Hugging Face 21 February 2026
Claude Code Open – AI Coding Platform with Web IDE and Agents 21 February 2026
SanityBoard Adds 27 New Model Evaluations Including Qwen 3.5 Plus, GLM 5, and Gemini 3.1 Pro 20 February 2026
PaddleOCR-VL Now Integrated into llama.cpp for Multilingual OCR 20 February 2026
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx 20 February 2026
Kitten TTS V0.8 Released: New State-of-the-Art Super-Tiny TTS Model Under 25 MB 20 February 2026
Self-Hosted Local LLMs for Document Management with Paperless-ngx 19 February 2026
Local Vision-Language Models for Document OCR and PII Detection in Privacy-Critical Workflows 19 February 2026
Kitten TTS V0.8 Released: State-of-the-Art Super-Tiny Text-to-Speech Model Under 25MB 19 February 2026
Aegis.rs: Open Source Rust-Based LLM Security Proxy Released 19 February 2026
Why My Country's AI Scene Is Built on Sand 18 February 2026
Alibaba's Qwen3.5-397B Achieves #3 Position in Open Weights Model Rankings 18 February 2026
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter 17 February 2026
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages 17 February 2026
Ask HN: What is the best bang for buck budget AI coding? 17 February 2026
Sourdine: Open-Source macOS App for 100% Local AI Transcription 16 February 2026
InitRunner: YAML-Based AI Agent Framework with RAG and Memory 16 February 2026
Alibaba Unveils Major AI Model Upgrade Ahead of DeepSeek Release 16 February 2026
GNOME's AI Assistant Newelle Adds llama.cpp Support and Command Execution 14 February 2026
ByteDance Releases Seed2.0 LLM with Complex Real-World Task Improvements 14 February 2026
WinClaw: Windows-Native AI Assistant with Office Automation 13 February 2026
GitHub Announces Support for Open Source AI Project Maintainers 13 February 2026
MiniMax M2.5: 230B Parameter MoE Model Coming to HuggingFace 13 February 2026
I Tried a Claude Code Rival That's Local, Open Source, and Completely Free 12 February 2026
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts 11 February 2026
Godot MCP Gives AI Assistants Full Access to Game Engine Editor 11 February 2026
DeepSeek Launches Model Update with 1M Context Window 11 February 2026
Anthropic Releases Claude Opus 4.6 Sabotage Risk Assessment 11 February 2026
Community Member Builds 144GB VRAM Local LLM Powerhouse 11 February 2026