Tagged "gguf"
- Malicious GGUF Models Could Trigger Remote Code Execution on SGLang Servers
- MiniMax M2.7 GGUF Investigation Reveals NaN Issues Affecting 21-38% of Hugging Face Conversions
- Fine-Tuned Qwen3.5-0.8B for OCR Outperforms Previous 2B Release
- Unsloth Completes Comprehensive MiniMax M2.7 GGUF Quantization Suite
- ByteShape Releases Qwen 3.5 9B Quantisations with Hardware-Matched Tuning Guide
- Coding Implementation to Run Qwen3.5 Reasoning Models Distilled With Claude-Style Thinking Using GGUF and 4-Bit Quantization
- Qwen 3.5 122B Uncensored (Aggressive) Released with New K_P Quantisations
- Qwen 3.5-35B Uncensored GGUF Models Now Available
- Final Qwen3.5 Unsloth GGUF Update with Improved Size/Quality Tradeoffs
- Qwen 3.5-27B Q4 Quantization Comparison and Analysis
- Unsloth Dynamic 2.0 GGUFs
- Qwen3.5-35B Unsloth Dynamic GGUFs Achieve SOTA Across Nearly All Quantisation Levels
- nanollama: Open-Source Framework for Training Llama 3 from Scratch with One-Command GGUF Export
- Ouro 2.6B Thinking Model GGUFs Released with Q8_0 and Q4_K_M Quantization