Tagged "fine-tuning"

Nvidia Enters Windows Laptop Market, Taking on Intel and AMD 1 June 2026
Fine-tuning an LLM to Write Docs Like It's 1995 1 June 2026
CNN sues Perplexity over alleged AI copyright theft 29 May 2026
AI Guardrails Stripped From Meta and Google Models in Minutes 25 May 2026
Show HN: An Open-Source Interactive AI Engineering Syllabus (1,100 Papers) 25 May 2026
From Source Code to LLM Constraints: A Semantic Extractor for Python, SwiftUI, Lua 24 May 2026
User Migration from LM Studio/Ollama to llama.cpp Shows Growing Preference 22 May 2026
AMD's New Ryzen AI Max Pro 400 with 192GB LPDDR5X Memory 21 May 2026
On-Device AI to Be in 80% of Wearables by 2032 19 May 2026
Safety Paradox: How RLHF Creates the AI Psychosis Problem It's Meant to Prevent 18 May 2026
Local LLMs Offer Unique Advantages That Cloud AI Services Cannot Match 18 May 2026
The AI Layoff Receipts: Market Consolidation Accelerates Open-Source Model Adoption 18 May 2026
MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU 17 May 2026
How to Train Your GPT: Comprehensive Commented Training Guide 16 May 2026
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training 14 May 2026
Claude Opus 4.7 System Prompt Leaks Raise Local Deployment Questions 14 May 2026
Avocado Studio: Open-Source AI Content Editor for Next.js Sites 14 May 2026
Legacy System Analysis with AI Reveals Modern Architecture Under the Hood 14 May 2026
Discussion: Including New Mathematical Proofs in LLM Training Data for Rediscovery 9 May 2026
Local LLM Rewrites Resume Better Than ChatGPT, and It's Not Even Close 8 May 2026
I Replaced ChatGPT and Claude With This Powerful Local LLM and Saved Over $20 a Month While Gaining Full Control 5 May 2026
NHS to Close-Source GitHub Repos Over AI and Security Concerns 5 May 2026
Study: AI Models That Consider User Feelings Are More Likely to Make Errors 2 May 2026
AI Coding Tools Are Silently Disagreeing with Each Other 2 May 2026
IBM Introduces Granite 4.1 Family of Models for Local Deployment 30 April 2026
Local AI Isn't Just Ollama—Here's the Ecosystem That Actually Makes It Useful 28 April 2026
Unsloth's Custom Kernels Make LLM Fine-Tuning Viable on Consumer GPUs 27 April 2026
Fixing Hallucination in LLM Prediction With Only One 48GB GPU 25 April 2026
Build Your Own Local AI Stack with 5 Docker Containers and Eliminate ChatGPT Subscriptions 25 April 2026
I Replaced My Local LLM With a Model Half Its Size and Got Better Results 24 April 2026
Mathesar 0.10.0 24 April 2026
Using a Local LLM as a Zero-Shot Classifier 24 April 2026
AI Licensing Marketplaces: A Guide for Publishers and Content Creators 22 April 2026
Laimark – 8B LLM That Self-Improves on Consumer GPUs 18 April 2026
BibCrit – LLM Grounded in ETCBC Corpus Data for Biblical Textual Criticism 18 April 2026
When Should AI Step Aside?: Teaching Agents When Humans Want to Intervene 17 April 2026
The Case for Out-of-Process Enforcement for AI Agents 17 April 2026
LLM Personalization Breaks Down in High-Stakes Finance 16 April 2026
GBrain – System to Make Your AI Agent Better Reflect You 15 April 2026
Fine-Tuned Qwen3.5-0.8B for OCR Outperforms Previous 2B Release 14 April 2026
Minisforum N5 MAX AI NAS Delivers 126 TOPS with 200TB Storage for Local LLM Workloads 14 April 2026
Developer Shares Golden Stack for Local Coding Assistant Integration Directly Inside Code Editors 14 April 2026
Abliterated Local LLM Models Show Distinct Behavioral Characteristics Compared to Standard Variants 14 April 2026
MiniMax-M2.7 Delivers Exceptional Performance on Consumer Hardware 13 April 2026
MiniMax M2.7 Open-Sources Globally as Industry's First Self-Improving Model 13 April 2026
MiniMax M2.7 Released: New Model Available for Local Deployment 12 April 2026
MiniMax M2.7 Is Now Open Source 12 April 2026
Self-Hosted LLMs Transform Personal Knowledge Management Systems 11 April 2026
5 Open-Source Projects Running Transformers on CPUs to GPUs in Pure Java 10 April 2026
Quansloth Using Google's Turboquant Breaks the VRAM Wall for Local LLMs 7 April 2026
Apple Research Shows Self-Distillation Significantly Improves Local Code Generation 5 April 2026
Google Launches Gemma 4 For Advanced On-Device AI 4 April 2026
Autonet: Decentralized AI Training with Constitutional Governance 4 April 2026
Local AI Ecosystem Extends Far Beyond Ollama 1 April 2026
Does RAG Help AI Coding Tools? 31 March 2026
Unsloth Studio Beta Ships 50+ New Features for Local Model Training and Inference 28 March 2026
Qwen3.5-27B Emerges as Sweet Spot for Single-GPU Local Deployment 24 March 2026
Building a Production AI Receptionist: Practical Local LLM Deployment Case Study 23 March 2026
Llama 8B Matches 70B Performance on Multi-Hop QA Using Structured Prompting 22 March 2026
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach 22 March 2026
Self-Hosted AI Code Review with Local LLMs: Secure Automation Guide 21 March 2026
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks 20 March 2026
Cursor's Composer 2 Model Analysis – Fine-Tuned Variant of Kimi K2.5 20 March 2026
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training 19 March 2026
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally 18 March 2026
On-Device AI: Tether's QVAC Fabric Enables Local Training 18 March 2026
Mistral Releases Small 4 Open-Source Model Under Apache 2.0 17 March 2026
Mistral Releases Leanstral: First Open-Source Code Agent for Lean 4 Proof Assistant 17 March 2026
Researcher Discovers Universal "Danger Zone" in Transformer Model Architecture at 50% Depth 17 March 2026
KAIST Develops World's First Hyper-Personalized On-Device AI Chip 17 March 2026
NVIDIA Updates Nemotron 3 122B License, Removes Deployment Restrictions 16 March 2026
Show HN: Generate, Clean, and Prepare LLM Training Data, All-in-One 16 March 2026
StepFun Releases SFT Dataset Used to Train Step 3.5 Flash for Community Fine-Tuning 15 March 2026
OpenClaw vs Eigent vs Claude Cowork: Comparing Open-Source AI Collaboration Platforms 15 March 2026
Fine-Tuned 14B Model Outperforms Claude Opus 4.6 on Ada Code Generation 14 March 2026
Sarvam Open-Sources 30B and 105B Reasoning Models 12 March 2026
Experiment: 0.8B Model Self-Improvement on MacBook Air Yields Surprising Results 11 March 2026
Texas Instruments Launches NPU-Powered MCUs for Low-Power Edge AI 11 March 2026
Sarvam Open-Sources 30B and 105B Reasoning Models 11 March 2026
LMF – LLM Markup Format 11 March 2026
Show HN: AIWatermarkDetector: Detect AI Watermarks in Text or Code 11 March 2026
.ispec: Runtime Specification Validation for AI System Consistency 10 March 2026
Fish Audio Open-Sources S2: Expressive Text-to-Speech with Natural Language Control and 100ms Latency 10 March 2026
Fine-Tuned Qwen SLMs (0.6–8B) Demonstrate Competitive Performance Against Frontier LLMs on Specialized Tasks 10 March 2026
Sarvam Open-Sources 30B and 105B Reasoning Models 9 March 2026
FretBench – Testing 14 LLMs on Reading Guitar Tabs Reveals Performance Gaps 9 March 2026
Apple Launches MacBook Neo with A18 Pro Chip for Affordable Local AI Inference 8 March 2026
Sarvam AI Releases 30B and 105B Open-Source Models Trained from Scratch 7 March 2026
Show HN: BoardMint – A PCB Review Tool That Avoids AI Hallucinations 6 March 2026
Incrmd: Incremental AI Coding by Editing PROJECT.md 4 March 2026
Qwen 3.5 vs Qwen 3 Benchmark Analysis: Generational Performance Improvements Visualized 3 March 2026
Jan Releases Code-Tuned 4B Model for Efficient Local Code Generation and Development Tasks 2 March 2026
Change Intent Records: The Missing Artifact in AI-Assisted Development 2 March 2026
Apple Neural Engine Reverse-Engineered for Local Model Training on Mac Mini M4 2 March 2026
Nummi – AI Companion with Memory and Daily Guidance 1 March 2026
Google Research Finds Longer Chain-of-Thought Correlates Negatively With Accuracy 1 March 2026
Extracting 100K Concepts from an 8B LLM 27 February 2026
Researchers Develop Persistent Memory System for Local LLMs—No RAG Required 26 February 2026
Show HN: 100% LLM Accuracy–No Fine-Tuning, JSON Only 25 February 2026
Anthropic Has Never Open-Sourced an LLM: Implications for Local Deployment Strategy 24 February 2026
Comparing Manual vs. AI Requirements Gathering: 2 Sentences vs. 127-Point Spec 24 February 2026
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours 23 February 2026
nanollama: Open-Source Framework for Training Llama 3 from Scratch with One-Command GGUF Export 23 February 2026
Local GPT-OSS 20B Model Demonstrates Practical Agentic Capabilities 23 February 2026
O-TITANS: Orthogonal LoRA Framework for Gemma 3 with Google TITANS Memory Architecture 22 February 2026
CPU-Trained Language Model Outperforms GPU Baseline After 40 Hours 22 February 2026
Matmul-Free Language Model Trained on CPU in 1.2 Hours 18 February 2026
Can We Leverage AI/LLMs for Self-Learning? 18 February 2026
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages 17 February 2026
GPU-Accelerated DataFrame Library for Local Inference Workloads 16 February 2026
Developer Creates Custom Local AI Headshot Generator After Commercial Solutions Fail 11 February 2026