Tagged "hugging-face"

Hugging Face Releases One-Liner for Automatic Hardware Detection and Model Selection 18 March 2026
Mistral Small 4 119B Released with NVFP4 Quantisation Support 17 March 2026
OmniCoder-9B: Efficient Coding Model for 8GB GPUs 16 March 2026
Cicikus v3 Prometheus 4.4B – An Experimental Franken-Merge for Edge Reasoning 15 March 2026
Qwen 3.5 Derestricted Model Available for Local Deployment 9 March 2026
Sarvam AI Releases 30B and 105B Open-Source Models Trained from Scratch 7 March 2026
Local LLM Performance Improvements: A Year of Progress Since DeepSeek R1 Moment 2 March 2026
Open-Source llama.cpp Finds Long-Term Home at Hugging Face 23 February 2026
O-TITANS: Orthogonal LoRA Framework for Gemma 3 with Google TITANS Memory Architecture 22 February 2026
GGML Joins Hugging Face: What This Means for Local Model Optimization 22 February 2026
Open-Source + AI: ggml Joins Hugging Face, llama.cpp Stays Open—Local AI's Long-Term Home 21 February 2026
GGML.AI Acquired by Hugging Face 21 February 2026
Matmul-Free Language Model Trained on CPU in 1.2 Hours 18 February 2026
Qwen 3.5-397B-A17B Now Available for Local Inference with Aggressive Quantisation 17 February 2026
GPT-OSS 20B Now Runs 100% Locally in Browser via WebGPU 14 February 2026
MiniMax M2.5: 230B Parameter MoE Model Coming to HuggingFace 13 February 2026