Qwen 3.5-35B-A3B Achieves 37.8% on SWE-bench Verified Hard

1 min read
r/LocalLLaMApublisher

Qwen's latest 35B parameter model has achieved a landmark 37.8% score on SWE-bench Verified Hard, nearly matching Claude Opus 4.6's 40% performance with the right verification strategy. This represents a major breakthrough for local model deployment, as it demonstrates that reasonably-sized open models can now compete with enterprise closed-source solutions on complex software engineering tasks.

For practitioners, this means you can now run a self-hosted coding assistant that performs at near-state-of-the-art levels without relying on API calls or subscriptions. The 35B size is deployable on mid-range GPUs (24GB+ VRAM), making it practical for development teams seeking cost savings and data privacy. This benchmark validates that local LLMs are no longer just hobbyist tools—they're viable for production engineering workflows.


Source: r/LocalLLaMA · Relevance: 9/10