Tagged "showcase"

Qwen3.5 Thinking Mode Can Be Disabled for Production Inference Optimization 25 February 2026
Qwen3.5-27B Identified as Sweet Spot for Mid-Range Local Deployment 25 February 2026
Show HN: Pluckr – LLM-Powered HTML Scraper That Caches Selectors and Auto-Heals 25 February 2026
Show HN: MCP-Enabled File Storage for AI Agents, Auth via Ethereum Wallet 25 February 2026
Show HN: 100% LLM Accuracy–No Fine-Tuning, JSON Only 25 February 2026
Show HN: A Human-Curated, CLI-Driven Context Layer for AI Agents 25 February 2026
Show HN: A Ground Up TLS 1.3 Client Written in C 24 February 2026
Show HN: Dypai – Build Backends from Your IDE Using AI and MCP 24 February 2026
Comparing Manual vs. AI Requirements Gathering: 2 Sentences vs. 127-Point Spec 24 February 2026
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments 24 February 2026
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours 23 February 2026
Qwen3's Voice Embeddings Enable Local Voice Cloning and Mathematical Voice Manipulation 23 February 2026
Qwen3 Demonstrates Advanced Voice Cloning via Embeddings 23 February 2026
Qwen3-Code-Next Proves Practical for Local Development: Real-World Coding Tasks on Mac Studio 23 February 2026
Custom Portable Workstation Optimized for Local AI Inference Builds 23 February 2026
Open-Source Framework Achieves Gemini 3 Deep Think Level Performance Through Local Model Scaffolding 23 February 2026
nanollama: Open-Source Framework for Training Llama 3 from Scratch with One-Command GGUF Export 23 February 2026
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools 23 February 2026
Local GPT-OSS 20B Model Demonstrates Practical Agentic Capabilities 23 February 2026
A Tool to Tell You What LLMs Can Run on Your Machine 23 February 2026
GPT-OSS 20B Demonstrates Practical Agentic Capabilities Running Fully Locally 23 February 2026
Gix: Go CLI for AI-Generated Commit Messages 23 February 2026
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI 23 February 2026
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search 23 February 2026
Show HN: The Only CLI Your AI Agent Will Need 23 February 2026
Show HN: Tickr – AI Project Manager That Lives Inside Slack (Replaces Jira) 22 February 2026
O-TITANS: Orthogonal LoRA Framework for Gemma 3 with Google TITANS Memory Architecture 22 February 2026
At India AI Impact Summit, Intel Showcases AI PCs and Cost-Efficient Frugal AI 22 February 2026
Show HN: Horizon – My AI-Powered Personal News Aggregator and Summarizer 22 February 2026
CPU-Trained Language Model Outperforms GPU Baseline After 40 Hours 22 February 2026
Vellium v0.3.5: Major Writing Mode Overhaul and Native KoboldCpp Support 21 February 2026
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM 21 February 2026
Qwen3 Coder Next Remains Effective at Aggressive Quantization Levels 21 February 2026
At India AI Impact Summit, Intel Showcases Its AI PCs and Cost-Efficient Frugal AI 21 February 2026
Apple Researchers Develop On-Device AI Agent That Interacts With Apps for You 21 February 2026
24 Simultaneous Claude Code Agents on Local Hardware 21 February 2026
Sarvam Brings AI to Feature Phones, Cars, and Smart Glasses 19 February 2026
Running Local LLMs and VLMs on Arduino UNO Q with yzma 19 February 2026
Enhanced Quantization Visualization Methods for Understanding LLM Compression Trade-offs 19 February 2026
Complete Offline AI System: Voice Control and Smart Home via Local LLM and Radio Without Internet 19 February 2026
Local Vision-Language Models for Document OCR and PII Detection in Privacy-Critical Workflows 19 February 2026
Kitten TTS V0.8 Released: State-of-the-Art Super-Tiny Text-to-Speech Model Under 25MB 19 February 2026
Clipthesis: Free Local App for Video Tagging and Search Across Drives 19 February 2026
Aegis.rs: Open Source Rust-Based LLM Security Proxy Released 19 February 2026
Show HN: Shiro.computer Static Page, Unix/NPM Shimmed to Host Claude Code 18 February 2026
Sarvam AI Launches Edge Model to Challenge Major AI Players with Local-First Approach 18 February 2026
OpenClaw Refactored in Go, Runs on $10 Hardware 18 February 2026
Matmul-Free Language Model Trained on CPU in 1.2 Hours 18 February 2026
Cloudflare Releases Agents SDK v0.5.0 with Rust-Powered Infire Engine for Edge Inference 18 February 2026
Qwen3-Next 80B MoE Achieves 39 Tokens/Second on RTX 5070/5060 Ti Dual-GPU Setup 17 February 2026
Show HN: PgCortex – AI enrichment per Postgres row, zero transaction blocking 17 February 2026
Show HN: Inkog – Pre-flight check for AI agents (governance, loops, injection) 17 February 2026
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages 17 February 2026
Sourdine: Open-Source macOS App for 100% Local AI Transcription 16 February 2026
InitRunner: YAML-Based AI Agent Framework with RAG and Memory 16 February 2026
GPU-Accelerated DataFrame Library for Local Inference Workloads 16 February 2026
WinClaw: Windows-Native AI Assistant with Office Automation 13 February 2026
First Vibecoded AI Operating System for Local Deployment 13 February 2026
Running Mistral-7B on Intel NPU Achieves 12.6 Tokens/Second 12 February 2026
Qwen Coder Next Shows Specialized Agent Performance 12 February 2026
Microsoft MarkItDown: Document Preprocessing Tool for LLMs 12 February 2026
Memio Launches AI-Powered Knowledge Hub for Android with Local Processing 12 February 2026
New Header-Only C++ Benchmark Tool for Predictive Models on Raw Binary Streams 12 February 2026
NAS System Achieves 18 tok/s with 80B LLM Using Only Integrated Graphics 11 February 2026
Building a RAG Pipeline on 2M+ Pages: EpsteinFiles-RAG Project 11 February 2026
Community Member Builds 144GB VRAM Local LLM Powerhouse 11 February 2026