Tagged "training"

Chinese LLM Ecosystem Landscape: ByteDance Doubao, Alibaba, and Open-Source Competition 24 March 2026
Building a Production AI Receptionist: Practical Local LLM Deployment Case Study 23 March 2026
Local AI Coding Assistant: Free Cursor Alternative with VS Code, Ollama & Continue 21 March 2026
Cursor's Composer 2 model attribution dispute highlights open-source licensing concerns 21 March 2026
Your Site Content Is Powering AI. Your Bank Account Has No Idea 21 March 2026
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks 20 March 2026
NVIDIA Nemotron Cascade 2 30B Delivers 120B-Class Performance in Compact Form Factor 20 March 2026
Cursor's Composer 2 Model Analysis – Fine-Tuned Variant of Kimi K2.5 20 March 2026
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training 19 March 2026
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally 18 March 2026
On-Device AI: Tether's QVAC Fabric Enables Local Training 18 March 2026
You're Using Your Local LLM Wrong If You're Prompting It Like a Cloud LLM 18 March 2026
Show HN: Generate, Clean, and Prepare LLM Training Data, All-in-One 16 March 2026
StepFun Releases SFT Dataset Used to Train Step 3.5 Flash for Community Fine-Tuning 15 March 2026
Fine-Tuned 14B Model Outperforms Claude Opus 4.6 on Ada Code Generation 14 March 2026
Experiment: 0.8B Model Self-Improvement on MacBook Air Yields Surprising Results 11 March 2026
Show HN: AIWatermarkDetector: Detect AI Watermarks in Text or Code 11 March 2026
Sarvam AI Releases 30B and 105B Open-Source Models Trained from Scratch 7 March 2026
Mojo: Creating a Programming Language for an AI World with Chris Lattner 7 March 2026
Change Intent Records: The Missing Artifact in AI-Assisted Development 2 March 2026
C7: Pipe Up-to-Date Library Docs Into Any LLM From the Terminal 2 March 2026
Apple Neural Engine Reverse-Engineered for Local Model Training on Mac Mini M4 2 March 2026
Google Research Finds Longer Chain-of-Thought Correlates Negatively With Accuracy 1 March 2026
Arduino, Qualcomm Bring On-Device AI and Robotics Learning to Indian School Systems 28 February 2026
Show HN: 100% LLM Accuracy–No Fine-Tuning, JSON Only 25 February 2026
No, Local LLMs Can't Replace ChatGPT or Gemini — I Tried 24 February 2026
Anthropic Has Never Open-Sourced an LLM: Implications for Local Deployment Strategy 24 February 2026
Anthropic Reveals Industrial-Scale Distillation Attacks by Chinese AI Labs 24 February 2026
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours 23 February 2026
How Do You Know Which SKILL.md Is Good? 23 February 2026
nanollama: Open-Source Framework for Training Llama 3 from Scratch with One-Command GGUF Export 23 February 2026
CPU-Trained Language Model Outperforms GPU Baseline After 40 Hours 22 February 2026
Why AI Models Fail at Iterative Reasoning and What Could Fix It 20 February 2026
GLM-5 Technical Report: DSA Innovation Reduces Training and Inference Costs 18 February 2026
Matmul-Free Language Model Trained on CPU in 1.2 Hours 18 February 2026
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages 17 February 2026
GPT-OSS 120B Uncensored Model Released in Native MXFP4 Precision 14 February 2026
Optimal llama.cpp Settings Found for Qwen3 Coder Next Loop Issues 13 February 2026
Student Releases Dhi-5B: Multimodal Model Trained for Just $1,200 13 February 2026