Tagged "gguf"

llama.cpp GGUF Parser Flaws: Critical Integer Overflow Enables Arbitrary Reads in Every Local AI Stack 27 May 2026
Ollama Out-of-Bounds Read Vulnerability Allows Remote Process Memory Leak 11 May 2026
Malicious GGUF Models Could Trigger Remote Code Execution on SGLang Servers 21 April 2026
MiniMax M2.7 GGUF Investigation Reveals NaN Issues Affecting 21-38% of Hugging Face Conversions 15 April 2026
Fine-Tuned Qwen3.5-0.8B for OCR Outperforms Previous 2B Release 14 April 2026
Unsloth Completes Comprehensive MiniMax M2.7 GGUF Quantization Suite 12 April 2026
ByteShape Releases Qwen 3.5 9B Quantisations with Hardware-Matched Tuning Guide 1 April 2026
Coding Implementation to Run Qwen3.5 Reasoning Models Distilled With Claude-Style Thinking Using GGUF and 4-Bit Quantization 27 March 2026
Qwen 3.5 122B Uncensored (Aggressive) Released with New K_P Quantisations 22 March 2026
Qwen 3.5-35B Uncensored GGUF Models Now Available 11 March 2026
Final Qwen3.5 Unsloth GGUF Update with Improved Size/Quality Tradeoffs 6 March 2026
Qwen 3.5-27B Q4 Quantization Comparison and Analysis 4 March 2026
Unsloth Dynamic 2.0 GGUFs 28 February 2026
Qwen3.5-35B Unsloth Dynamic GGUFs Achieve SOTA Across Nearly All Quantisation Levels 28 February 2026
nanollama: Open-Source Framework for Training Llama 3 from Scratch with One-Command GGUF Export 23 February 2026
Ouro 2.6B Thinking Model GGUFs Released with Q8_0 and Q4_K_M Quantization 22 February 2026