Tagged "tutorial"
- Building a Local AI Stack: Five Docker Containers to Replace ChatGPT Subscriptions
- Run a Local LLM Server on Raspberry Pi with Remote Access Capabilities
- Build Your Own Local AI Stack with 5 Docker Containers and Eliminate ChatGPT Subscriptions
- Using a Local LLM as a Zero-Shot Classifier
- I Built a Local AI Stack With 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again
- How to Make Sense of AI
- Llama 4 Scout on MLX: The Complete Apple Silicon Guide (2026)
- 10GB VRAM Local LLM: The Complete Setup Guide (2026)
- My AI Workflow: Practical Guide to Using AI Without Skill Atrophy
- 16 Ways to Make a Small Language Model Think Bigger
- Controlling the Secondary Fan on Minisforum AI Pro HX 370
- Running DeepSeek R1 Locally: Your Complete Setup Guide
- Web Agent Bridge: Open-Source OS for AI Agents
- I Built a Local AI Stack with 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again
- BibCrit – LLM Grounded in ETCBC Corpus Data for Biblical Textual Criticism
- Xiaomi 12 Pro Converted Into 24/7 Headless AI Server With Ollama and Gemma4
- Building Practical Local Coding Assistants: A Working Stack for Editor Integration
- GPU Passthrough to LXCs in Proxmox Simplifies Local Inference Infrastructure
- DGX Spark Setup Guide: Running vLLM and PyTorch for Local LLM Inference Backend
- Talking to a Local LLM in the Firefox Sidebar
- Build a Sovereign Local AI Stack: Ollama and Open WebUI and Pgvector 2026
- Learn LLM Internals
- The Best Local AI Model for Home Assistant Isn't Always the Biggest One
- I Gave My AI Shell Access and Felt Uneasy – So I Sandboxed It
- Aisbf (AI Should Be Free) Proxy 0.99.18 Released
- Run Qwen3.5 on an Old Laptop: A Lightweight Local Agentic AI Setup Guide
- Running AI Natively on Windows 11 Using an eGPU
- GPU Memory for LLM Inference (Part 1)
- Unpaved: Audit Toolkit for AI Developer Tool Bias in Global South Contexts
- Run AutoGEN with Ollama and LiteLLM in Simple Steps
- 5 Useful Docker Containers for Agentic Developers
- April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini
- Building Cross-Platform Ollama Dashboards with 95% Shared Code
- VRAM Optimization Technique Cuts Gemma 4 Memory Usage by 3x
- How to Integrate VS Code with Ollama for Local AI Assistance
- A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant
- Running AI on a Raspberry Pi, Part 2: Running AI on a Pi in Under 5 minutes
- I built an O(1) physics engine to stop LLM hallucinations in construction
- DeepSeek V3 Complete Guide: Deploy and Optimize Local AI in 2026
- DeepSeek-R1 Chain-of-Thought Debugging: A Developer's Guide
- GPU Passthrough to LXCs in Proxmox Simplifies Local LLM Deployment
- AI Slop or Quality Storytelling? – Dune Themed MCP Gateway Tutorial
- .APKs Are Just .ZIPs: Semi-Legally Hacking Software for Orphaned Hardware
- A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant
- How to Build a Self-Hosted AI Server with LM Studio: Step-by-Step Guide
- Setting Up a Private AI Brain on Windows: Complete Guide to Local LLM Deployment
- Automating Read-It-Later Workflows with Local LLMs for Overnight Summarization
- Self-Hosted AI Code Review with Local LLMs: Secure Automation Guide
- Pydantic-Deep: Production Deep Agents for Pydantic AI
- Local AI Coding Assistant: Free Cursor Alternative with VS Code, Ollama & Continue
- Build a $1,500 AI Server with DeepSeek-R1 on RTX 4090
- Community Converges on Optimal KV Cache Quantization Strategies for Qwen 3.5 Models
- You're Using Your Local LLM Wrong If You're Prompting It Like a Cloud LLM
- Run LLMs Locally with Llama.cpp
- How I Used Lima for an AI Coding Agent Sandbox
- Practical Fix for Qwen 3.5 Overthinking in llama.cpp
- Show HN: Voice-tracked teleprompter using on-device ASR in the browser
- Qwen3.5-397B Achieves 282 tok/s on 4x RTX PRO 6000 Blackwell Through Custom CUTLASS Kernel
- I made Karpathy's Autoresearch work on CPU
- Local LLMs on Apple Silicon Mac 2026: M1 M2 M3 Guide
- How to Run Local LLMs in 2026: The Complete Developer's Guide
- How to Install OpenClaw with Ollama (Step-by-Step Tutorial)
- Quantization Explained: Q4_K_M vs AWQ vs FP16 for Local LLMs
- The $1,500 Local AI Setup: DeepSeek-R1 on Consumer Hardware
- Local AI Coding Assistant: Complete VS Code + Ollama + Continue Setup
- 8 Local LLM Settings Most People Never Touch That Fixed My Worst AI Problems
- How to Run Your Own Local LLM — 2026 Edition
- Llama.cpp Prompt Processing Optimization: Ubatch Size Configuration Guide
- Self-Hosted Paperless-ngx With Optional Local AI Integration
- Turning Your Linux Terminal into a Local AI Assistant
- Jse v2.0 AI Output Specification
- How to Run High-Performance LLMs Locally on the Arduino UNO Q
- 5 Useful Docker Containers for Agentic Developers
- Accuracy vs. Speed in Local LLMs: Finding Your Sweet Spot
- 5 Useful Docker Containers for Agentic Developers
- Running LLMs on Raspberry Pi and Edge Devices: A Practical Guide
- Every agent framework has the same bug – prompt decay. Here's a fix
- Building a Privacy-Preserving RAG System in the Browser
- Ollama for JavaScript Developers: Building AI Apps Without API Keys
- The Complete Developer's Guide to Running LLMs Locally: From Ollama to Production
- Qwen3.5-27B Identified as Sweet Spot for Mid-Range Local Deployment
- The Complete Stack for Local Autonomous Agents: From GGML to Orchestration
- Breaking the Speed Limit: Strategies for 17k Tokens/Sec Local Inference
- I Thought I Needed a GPU to Run AI Until I Learned About These Models
- Ollama Production Deployment: Docker-Compose Setup Guide
- AI Integration in Sublime Text: Practical Local LLM Editor Enhancement
- Running Local LLMs and VLMs on Arduino UNO Q with yzma
- Local-First RAG: Vector Search in SQLite with Hamming Distance
- Ask HN: How Do You Debug Multi-Step AI Workflows When the Output Is Wrong?
- Self-Hosted AI: A Complete Roadmap for Beginners
- Qwen3-Next 80B MoE Achieves 39 Tokens/Second on RTX 5070/5060 Ti Dual-GPU Setup
- InitRunner: YAML-Based AI Agent Framework with RAG and Memory
- Switching From Ollama and LM Studio to llama.cpp: Performance Benefits
- Optimal llama.cpp Settings Found for Qwen3 Coder Next Loop Issues
- Running Your Own AI Assistant for €19/Month: Complete Self-Hosting Guide
- OpenClaw with vLLM Running for Free on AMD Developer Cloud
- 5 Practical Ways to Use Local LLMs with MCP Tools