Tagged "rlocalllama"

Xiaomi 12 Pro Converted Into 24/7 Headless AI Server With Ollama and Gemma4 15 April 2026
MiniMax M2.7 GGUF Investigation Reveals NaN Issues Affecting 21-38% of Hugging Face Conversions 15 April 2026
OpenClaw at 250K GitHub Stars: Community Explores Practical Limitations Beyond News Digests 14 April 2026
MiniMax M2.7 Released: New Model Available for Local Deployment 12 April 2026
Critical Unsloth Gemma-4 Chat Template Updates for Tool Calling 11 April 2026
Building Offline AI Companions on Severely Constrained Hardware (8GB RAM) 10 April 2026
Gemma 4 Template Improvements Enhance Tool Use and Dialog Compliance 10 April 2026
Community Reverse Engineers Gemma 4 Multi-Token Prediction Capability 10 April 2026
Hugging Face Moves Safetensors Under PyTorch Foundation 9 April 2026
Gemma 4 GGUF Models Updated with Critical Quantization Fixes 9 April 2026
Comprehensive Benchmark: 37 LLMs Tested on MacBook Air M5 With Open-Source Tool 7 April 2026
Context Window Optimization: Extending Gemma 4 Context Length Through Efficient Projection Quantization 6 April 2026
Gemma 4 26B MoE Emerges as Optimal All-Around Local Model for Consumer Hardware 5 April 2026
Netflix Open-Sources VOID Model for Video Object Deletion 4 April 2026
Kokoro TTS Achieves 20× Realtime Speed on CPU-Only On-Device Inference 4 April 2026
VRAM Optimization Technique Cuts Gemma 4 Memory Usage by 3x 3 April 2026
Qwen 3.5-27B Demonstrates Superior Performance vs Gemini 3.1 Pro and GPT-5.3 1 April 2026
TurboQuant: Understanding the Quantization Breakthrough 29 March 2026
Mixed KV Cache Quantization: Performance Risks and Pitfalls 29 March 2026
Qwen 3.5 27B Achieves 1.1M Tokens/Second on B200 GPUs with Optimized vLLM Config 27 March 2026
Real-World Benchmark: DeepSeek-V3 Matches Claude Sonnet on Routine Coding Tasks 26 March 2026
OmniCoder v2 Released: Improved Code Generation for Local Deployment 25 March 2026
New Open-Weight Models Released: GigaChat-3.1-Ultra and Lightning Variants 25 March 2026
Llama.cpp Benchmark: RTX 5090 vs Enterprise Systems Compared 25 March 2026
Critical: LiteLLM Supply Chain Attack Detected, Bifrost Alternative Released 25 March 2026