Tagged "tutorial"

Supply Chain DLP: Stop Leaked .env Files, Credentials, SSH Keys, and API Tokens 2 June 2026
Good LLM Development and Usage Patterns 2 June 2026
How to Run LLM Locally Without Falling for the Hype 1 June 2026
Fine-tuning an LLM to Write Docs Like It's 1995 1 June 2026
Real-time LLM Inference on Standard GPUs: 3k tokens/s per request 29 May 2026
Tweaking Local Language Model Settings with Ollama 29 May 2026
The Infrastructure Behind Making Local LLM Agents Actually Useful 29 May 2026
The Anatomy of an LLM 28 May 2026
Local LLM Setup: How to Use RAG and an Embedding Model to Stop Wasting Context 27 May 2026
Developer Builds Local AI Coding Setup with Editor Integration, Zero Cloud Dependency 24 May 2026
Why Your Docker Container Is 1.2GB When It Should Be 80MB 24 May 2026
How to Self-Host LibreChat with Docker 23 May 2026
Deploying Hermes Agent for Free on AMD Developer Cloud with Open Models and vLLM 22 May 2026
How to Train Your GPT: Comprehensive Commented Training Guide 16 May 2026
Kog AI – Building a Real-Time Inference Stack on AMD Instinct GPUs 15 May 2026
Arm and Google Collaborate on On-Device AI Optimization Techniques 15 May 2026
Running AI Models Locally on M4 Processors with 24GB Memory 14 May 2026
How I Used a Local LLM to Organize the Store on My NAS 13 May 2026
Running a Local LLM on a 12-Year-Old Raspberry Pi: Practical Edge Inference 12 May 2026
One LM Studio Setting Change Makes Local LLMs Competitive With Cloud Models 11 May 2026
DFlash Speculative Decoding Delivers 8.5x Speed Improvement for LLM Inference 11 May 2026
Deploying Frigate & Ollama On A Minisforum MS-A2 Server 11 May 2026
Qwen3-Coder-Next Local Deployment: Complete Developer Guide for 2026 10 May 2026
Continue.dev for Developers: Complete Local AI Coding Assistant Setup 10 May 2026
Claude Code with Local LLM Running Offline: The Hybrid Setup You Didn't Know You Needed 10 May 2026
How to Run LLMs Locally on Your Laptop for Free: A Beginner's Guide 9 May 2026
Show HN: Runs AI Coding Agents Inside Isolated Docker Containers 8 May 2026
How to make SSE token streams resumable, cancellable, and multi-device 7 May 2026
Claude Code with a Local LLM Running Offline Is the Hybrid Setup I Didn't Know I Needed 7 May 2026
Improving Code Quality with Local Claude and Codex Models 6 May 2026
5 Things I Wish Someone Had Told Me Before I Tried Self-Hosting a Local LLM 5 May 2026
A 49-Line Physics Classifier That Beats kNN on 76% of Benchmarks 5 May 2026
How to Test AI Agents When They Never Give the Same Answer Twice 3 May 2026
How to Make SSE Token Streams Resumable, Cancellable, and Multi-Device 1 May 2026
Building a Remote-Accessible Local LLM Server on Raspberry Pi 30 April 2026
Building a Local AI Stack: Five Docker Containers to Replace ChatGPT Subscriptions 28 April 2026
Run a Local LLM Server on Raspberry Pi with Remote Access Capabilities 25 April 2026
Build Your Own Local AI Stack with 5 Docker Containers and Eliminate ChatGPT Subscriptions 25 April 2026
Using a Local LLM as a Zero-Shot Classifier 24 April 2026
I Built a Local AI Stack With 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again 24 April 2026
How to Make Sense of AI 24 April 2026
Llama 4 Scout on MLX: The Complete Apple Silicon Guide (2026) 23 April 2026
10GB VRAM Local LLM: The Complete Setup Guide (2026) 23 April 2026
My AI Workflow: Practical Guide to Using AI Without Skill Atrophy 22 April 2026
16 Ways to Make a Small Language Model Think Bigger 21 April 2026
Controlling the Secondary Fan on Minisforum AI Pro HX 370 20 April 2026
Running DeepSeek R1 Locally: Your Complete Setup Guide 20 April 2026
Web Agent Bridge: Open-Source OS for AI Agents 19 April 2026
I Built a Local AI Stack with 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again 18 April 2026
BibCrit – LLM Grounded in ETCBC Corpus Data for Biblical Textual Criticism 18 April 2026
Xiaomi 12 Pro Converted Into 24/7 Headless AI Server With Ollama and Gemma4 15 April 2026
Building Practical Local Coding Assistants: A Working Stack for Editor Integration 15 April 2026
GPU Passthrough to LXCs in Proxmox Simplifies Local Inference Infrastructure 15 April 2026
DGX Spark Setup Guide: Running vLLM and PyTorch for Local LLM Inference Backend 15 April 2026
Talking to a Local LLM in the Firefox Sidebar 14 April 2026
Build a Sovereign Local AI Stack: Ollama and Open WebUI and Pgvector 2026 13 April 2026
Learn LLM Internals 13 April 2026
The Best Local AI Model for Home Assistant Isn't Always the Biggest One 12 April 2026
I Gave My AI Shell Access and Felt Uneasy – So I Sandboxed It 12 April 2026
Aisbf (AI Should Be Free) Proxy 0.99.18 Released 11 April 2026
Run Qwen3.5 on an Old Laptop: A Lightweight Local Agentic AI Setup Guide 9 April 2026
Running AI Natively on Windows 11 Using an eGPU 7 April 2026
GPU Memory for LLM Inference (Part 1) 6 April 2026
Unpaved: Audit Toolkit for AI Developer Tool Bias in Global South Contexts 5 April 2026
Run AutoGEN with Ollama and LiteLLM in Simple Steps 5 April 2026
5 Useful Docker Containers for Agentic Developers 4 April 2026
April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini 3 April 2026
Building Cross-Platform Ollama Dashboards with 95% Shared Code 3 April 2026
VRAM Optimization Technique Cuts Gemma 4 Memory Usage by 3x 3 April 2026
How to Integrate VS Code with Ollama for Local AI Assistance 2 April 2026
A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant 2 April 2026
Running AI on a Raspberry Pi, Part 2: Running AI on a Pi in Under 5 minutes 31 March 2026
I built an O(1) physics engine to stop LLM hallucinations in construction 31 March 2026
DeepSeek V3 Complete Guide: Deploy and Optimize Local AI in 2026 30 March 2026
DeepSeek-R1 Chain-of-Thought Debugging: A Developer's Guide 30 March 2026
GPU Passthrough to LXCs in Proxmox Simplifies Local LLM Deployment 28 March 2026
AI Slop or Quality Storytelling? – Dune Themed MCP Gateway Tutorial 25 March 2026
.APKs Are Just .ZIPs: Semi-Legally Hacking Software for Orphaned Hardware 25 March 2026
A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant 24 March 2026
How to Build a Self-Hosted AI Server with LM Studio: Step-by-Step Guide 23 March 2026
Setting Up a Private AI Brain on Windows: Complete Guide to Local LLM Deployment 22 March 2026
Automating Read-It-Later Workflows with Local LLMs for Overnight Summarization 22 March 2026
Self-Hosted AI Code Review with Local LLMs: Secure Automation Guide 21 March 2026
Pydantic-Deep: Production Deep Agents for Pydantic AI 21 March 2026
Local AI Coding Assistant: Free Cursor Alternative with VS Code, Ollama & Continue 21 March 2026
Build a $1,500 AI Server with DeepSeek-R1 on RTX 4090 21 March 2026
Community Converges on Optimal KV Cache Quantization Strategies for Qwen 3.5 Models 20 March 2026
You're Using Your Local LLM Wrong If You're Prompting It Like a Cloud LLM 18 March 2026
Run LLMs Locally with Llama.cpp 17 March 2026
How I Used Lima for an AI Coding Agent Sandbox 17 March 2026
Practical Fix for Qwen 3.5 Overthinking in llama.cpp 16 March 2026
Show HN: Voice-tracked teleprompter using on-device ASR in the browser 15 March 2026
Qwen3.5-397B Achieves 282 tok/s on 4x RTX PRO 6000 Blackwell Through Custom CUTLASS Kernel 15 March 2026
I made Karpathy's Autoresearch work on CPU 15 March 2026
Local LLMs on Apple Silicon Mac 2026: M1 M2 M3 Guide 14 March 2026
How to Run Local LLMs in 2026: The Complete Developer's Guide 14 March 2026
How to Install OpenClaw with Ollama (Step-by-Step Tutorial) 13 March 2026
Quantization Explained: Q4_K_M vs AWQ vs FP16 for Local LLMs 12 March 2026
The $1,500 Local AI Setup: DeepSeek-R1 on Consumer Hardware 12 March 2026
Local AI Coding Assistant: Complete VS Code + Ollama + Continue Setup 12 March 2026
8 Local LLM Settings Most People Never Touch That Fixed My Worst AI Problems 10 March 2026
How to Run Your Own Local LLM — 2026 Edition 9 March 2026
Llama.cpp Prompt Processing Optimization: Ubatch Size Configuration Guide 8 March 2026
Self-Hosted Paperless-ngx With Optional Local AI Integration 7 March 2026
Turning Your Linux Terminal into a Local AI Assistant 7 March 2026
Jse v2.0 AI Output Specification 7 March 2026
How to Run High-Performance LLMs Locally on the Arduino UNO Q 1 March 2026
5 Useful Docker Containers for Agentic Developers 28 February 2026
Accuracy vs. Speed in Local LLMs: Finding Your Sweet Spot 28 February 2026
5 Useful Docker Containers for Agentic Developers 27 February 2026
Running LLMs on Raspberry Pi and Edge Devices: A Practical Guide 26 February 2026
Every agent framework has the same bug – prompt decay. Here's a fix 26 February 2026
Building a Privacy-Preserving RAG System in the Browser 26 February 2026
Ollama for JavaScript Developers: Building AI Apps Without API Keys 26 February 2026
The Complete Developer's Guide to Running LLMs Locally: From Ollama to Production 26 February 2026
Qwen3.5-27B Identified as Sweet Spot for Mid-Range Local Deployment 25 February 2026
The Complete Stack for Local Autonomous Agents: From GGML to Orchestration 23 February 2026
Breaking the Speed Limit: Strategies for 17k Tokens/Sec Local Inference 23 February 2026
I Thought I Needed a GPU to Run AI Until I Learned About These Models 21 February 2026
Ollama Production Deployment: Docker-Compose Setup Guide 20 February 2026
AI Integration in Sublime Text: Practical Local LLM Editor Enhancement 19 February 2026
Running Local LLMs and VLMs on Arduino UNO Q with yzma 19 February 2026
Local-First RAG: Vector Search in SQLite with Hamming Distance 19 February 2026
Ask HN: How Do You Debug Multi-Step AI Workflows When the Output Is Wrong? 18 February 2026
Self-Hosted AI: A Complete Roadmap for Beginners 17 February 2026
Qwen3-Next 80B MoE Achieves 39 Tokens/Second on RTX 5070/5060 Ti Dual-GPU Setup 17 February 2026
InitRunner: YAML-Based AI Agent Framework with RAG and Memory 16 February 2026
Switching From Ollama and LM Studio to llama.cpp: Performance Benefits 13 February 2026
Optimal llama.cpp Settings Found for Qwen3 Coder Next Loop Issues 13 February 2026
Running Your Own AI Assistant for €19/Month: Complete Self-Hosting Guide 12 February 2026
OpenClaw with vLLM Running for Free on AMD Developer Cloud 12 February 2026
5 Practical Ways to Use Local LLMs with MCP Tools 11 February 2026