Tagged "edge-device"

Singapore's Foreign Minister Builds an AI "Second Brain" Using NanoClaw 26 April 2026
Thinking Outside the Box: New Attack Surfaces in Sandboxed AI Agents 26 April 2026
Show HN: Phonetic Formatter – Offline English Text to IPA on iPhone and iPad 26 April 2026
75% of US Health Systems Are Using AI. Only 18% of That Deployment Is Governed 26 April 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 26 April 2026
Blueprint: AI Hardware Design 26 April 2026
Rust Open-Source Headless Browser for AI Agents and Web Scraping 25 April 2026
Run a Local LLM Server on Raspberry Pi with Remote Access Capabilities 25 April 2026
LLMs Consume 5.4x Less Mobile Energy Than Ad-Supported Web Search 25 April 2026
Google's Gemma 4 Brings Powerful On-Device AI to Phones and Laptops 25 April 2026
I Replaced My Local LLM With a Model Half Its Size and Got Better Results 24 April 2026
Using a Local LLM as a Zero-Shot Classifier 24 April 2026
I Built a Local AI Stack With 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again 24 April 2026
Building Real-World On-Device AI with LiteRT and NPU 24 April 2026
AI Agent Designs a RISC-V CPU Core from Scratch 24 April 2026
Anker Unveils 'Thus' Chip to Bring On-Device AI Across Product Line 23 April 2026
Tesseron: New API Framework for AI Agents with Developer-Defined Configuration 22 April 2026
Sarvam Edge: India's Offline AI Model Runs on Phones and Laptops Without Internet 22 April 2026
Developer Turns Phone Into Local LLM Server with Vision, Voice, and Tool Calling Capabilities 22 April 2026
16 Ways to Make a Small Language Model Think Bigger 21 April 2026
Malicious GGUF Models Could Trigger Remote Code Execution on SGLang Servers 21 April 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 21 April 2026
DeepX and Hyundai Motor Group Robotics LAB Partner to Develop Next-Generation Physical AI Compute Platform 21 April 2026
ZeusHammer: Built an AI Agent That Thinks Locally 20 April 2026
Controlling the Secondary Fan on Minisforum AI Pro HX 370 20 April 2026
llama.cpp Merges Speculative Checkpointing for Major Inference Speed Boost 20 April 2026
Intel Extends AI PC Reach With New Core Ultra Series 3 Launch 20 April 2026
Bun v1.3.13 20 April 2026
Waterloo's Live AI-Goose Tracker: Real-Time Edge Vision 19 April 2026
Minisforum Launches N5 Max AI NAS with OpenClaw 19 April 2026
LlaMa.cpp Robot Wars 19 April 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 19 April 2026
Unweight: Lossless MLP Weight Compression for LLM Inference 18 April 2026
115 TOPS in 0.67L: CHUWI AuBox X Packs On-Device AI Power Into a Palm-Sized Mini PC 18 April 2026
Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw 18 April 2026
The 'Ollama' Tool Has Numerous Problems, and Some Argue That Llama.cpp Is Better 17 April 2026
Show HN: An MCP server that lets AI compose music on a hardware synth 17 April 2026
Local AI Isn't Just Ollama—Here's the Ecosystem That Actually Makes It Useful 17 April 2026
Community Computer: Collaborative Autoresearch on a Peer-to-Peer Network 17 April 2026
Building a Voice AI Wearable in a Casio F91W with Whisper and BLE 16 April 2026
Bonsai 1.7B in the Browser: A 290MB 1-bit LLM on WebGPU 16 April 2026
Xiaomi 12 Pro Converted Into 24/7 Headless AI Server With Ollama and Gemma4 15 April 2026
SigMap – Shrink AI Coding Context 97% with Auto-Scaling Token Budget 15 April 2026
Self-Hosted LLMs Transform Personal Knowledge Management Systems 15 April 2026
Running Gemma 4 on an iPhone 13 Pro 15 April 2026
Ubiquiti UniFi G6 Turret 4K Camera Features On-Device AI Processing at $199 Price Point 14 April 2026
Sovereign AI: Why the Next GPT Will Be Born in Our Living Rooms 14 April 2026
Fine-Tuned Qwen3.5-0.8B for OCR Outperforms Previous 2B Release 14 April 2026
Qwen 3.5 Small – On-Device Multimodal Models Released 14 April 2026
OpenNebula 7.2 "Dark Horse" Released with Enhanced Infrastructure Support 14 April 2026
oMLX Framework Implements DFlash Attention for Optimized Inference 14 April 2026
Minisforum N5 MAX AI NAS Delivers 126 TOPS with 200TB Storage for Local LLM Workloads 14 April 2026
Local LLM Connected to Home Assistant via MCP Now Enables Autonomous Smart Home Management 14 April 2026
Show HN: SkillCompass – Open-Source Quality Evaluator for Your AI Skills 13 April 2026
Defender – Local Prompt Injection Detection for AI Agents 13 April 2026
Learn LLM Internals 13 April 2026
Researchers Achieve 1-Bit Quantization of OLMo-3 7B Using Distillation 13 April 2026
ASUS Malaysia to Bring UGen300 USB AI Accelerator in Q2 for Portable On-Device AI Inferencing 13 April 2026
Unsloth Completes Comprehensive MiniMax M2.7 GGUF Quantization Suite 12 April 2026
A Deep Dive into Tinygrad AI Compiler 12 April 2026
MiniMax M2.7 Released: New Model Available for Local Deployment 12 April 2026
MiniMax M2.7 Is Now Open Source 12 April 2026
Google's Gemma 4 Brings Free Agentic AI to Your Phone With Zero Data Leaving the Device 12 April 2026
Google Gemma 4 Delivers Exceptional Speed and Accuracy for Local Inference 12 April 2026
DFlash Speculative Decoding Achieves 3.3x Speedup on Apple Silicon 12 April 2026
The Best Local AI Model for Home Assistant Isn't Always the Biggest One 12 April 2026
Qualcomm Snapdragon XR Powers Next-Generation AI Glasses with Local Inference 11 April 2026
Google's Gemini Nano 4 Offers Faster, Smarter Local Inference Capabilities 11 April 2026
DMax: New Parallel Decoding Paradigm for Diffusion Language Models 11 April 2026
ASUS ExpertBook P1 Integrates On-Device AI for Enterprise Collaboration 11 April 2026
AI PC Market Projected to Reach $235B by 2032, Driven by On-Device Computing Adoption 11 April 2026
Self-Installing Skill Manager for AI Agents 11 April 2026
Warp Decode vs. vLLM's Triton Kernel: Performance Crossover Analysis 10 April 2026
Tether Launches QVAC SDK for Cross-Platform Local AI Development 10 April 2026
Samsung Integrates On-Device AI Features into Galaxy A-Series Smartphones 10 April 2026
Building Offline AI Companions on Severely Constrained Hardware (8GB RAM) 10 April 2026
LLM Wiki v2: Extended Knowledge Base for LLM Practitioners 10 April 2026
CarryAI's Serverless Vision-Language Models Enable On-Device Multimodal AI 10 April 2026
Energy Consumption: The Final Frontier for AI and Local Inference 10 April 2026
Speculative Decoding Made My Local LLM Actually Usable 9 April 2026
Running a 1.7B Parameters LLM on an Apple Watch 9 April 2026
Run Qwen3.5 on an Old Laptop: A Lightweight Local Agentic AI Setup Guide 9 April 2026
Gemini-CLI, Llama.cpp, and Qwen3.5 Running on NVIDIA Jetson TK1 9 April 2026
Gemma 4 Support Stabilized in Llama.cpp 9 April 2026
Gemma 4 GGUF Models Updated with Critical Quantization Fixes 9 April 2026
Google AI Edge Gallery Showcases Offline Inference with Gemma 4 8 April 2026
Google's Gemma 4 Brings Powerful On-Device AI to Android and iOS 8 April 2026
Quansloth Using Google's Turboquant Breaks the VRAM Wall for Local LLMs 7 April 2026
PyTorch Foundation Welcomes Helion as a Foundation-Hosted Project to Standardize Open, Portable, and Accessible AI Kernel Authoring 7 April 2026
MemPalace, the Highest-Scoring AI Memory System Ever Benchmarked 7 April 2026
CricketBrain: Neuromorphic Signal Processor in Rust (0.175us/step, 944 bytes) 7 April 2026
VLA Learns How to Act. S2S Decides Whether the Motion Is Physically Trustworthy 6 April 2026
Verbatim 140W GAN: One of the First Chargers With USB PD 3.2 AVS (SPR) Support 6 April 2026
TurboQuant in Llama.cpp Achieves 6X Smaller KV Cache 6 April 2026
Show HN: Lightweight LLM Tracing Tool with CLI 6 April 2026
Lenovo Korea Launches AI-Powered Industrial Edge Solutions 6 April 2026
HunyuanOCR 1B: High-Quality OCR Now Viable on Budget Consumer Hardware 6 April 2026
GPU Memory for LLM Inference (Part 1) 6 April 2026
Google AI Edge Gallery Tops App Store Charts with On-Device Gemma 4 6 April 2026
Real-time Multimodal AI on Apple Silicon: Gemma E2B Demo Shows Practical Edge Deployment 6 April 2026
Apple Brings Enhanced On-Device AI Features to iPhone 6 April 2026
Vektor – Local-First Associative Memory for AI Agents 5 April 2026
Satsgate: Monetize AI Agents and APIs with Lightning L402 Protocol 5 April 2026
Qualcomm Snapdragon Innovations Enable Advanced On-Device AI for Wearables 5 April 2026
Microsoft Quantum Development Kit Ported to Rust: 100x Faster and Smaller 5 April 2026
Google Previews Gemini Nano 4 for Android AICore with On-Device Capabilities 5 April 2026
GMKtec NucBox K17 Launches with 97 TOPS AI Performance for Local Inference 5 April 2026
Gemma 4 31B Achieves Third Place on FoodTruck Bench, Beating Larger Models 5 April 2026
Gemma 4 26B MoE Emerges as Optimal All-Around Local Model for Consumer Hardware 5 April 2026
Run AutoGEN with Ollama and LiteLLM in Simple Steps 5 April 2026
Nex Life Logger: Local Activity Tracker with AI Agent Integration 4 April 2026
Netflix Open-Sources VOID Model for Video Object Deletion 4 April 2026
Kokoro TTS Achieves 20× Realtime Speed on CPU-Only On-Device Inference 4 April 2026
GPUs vs. TPUs: Decoding the Powerhouses of AI 4 April 2026
Google Launches Gemma 4 For Advanced On-Device AI 4 April 2026
Gemma 4 KV Cache Memory Issues Fixed in llama.cpp 4 April 2026
SkillCompass – Diagnose and Improve AI Agent Skills Across 6 Dimensions 3 April 2026
Google Gemma 4 Released with GGUF Quantizations 3 April 2026
Google Launches Gemma 4 Open Models for Local On-Device AI 3 April 2026
Gemma 4 Makes Local AI Agents Practical 3 April 2026
Gemma 4 2B Successfully Runs on Raspberry Pi 5 3 April 2026
Gemma 4 on Arm: Optimized On-Device AI for Mobile and Edge Deployment 3 April 2026
AMD Provides Day 0 Support for Gemma 4 on Ryzen AI Processors and GPUs 3 April 2026
SmolLM2-360M Running on Samsung Galaxy Watch 4 with 74% Memory Reduction 2 April 2026
Qwen 3.6-Plus Released 2 April 2026
Show HN: Memsearch – Persistent, Cross-Agent, Cross-Session Memory for AI Agents 2 April 2026
Lotte Innovate and DeepX Collaborate on Mass Production of Domestic AI Semiconductors 2 April 2026
A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant 2 April 2026
Show HN: Extra-Platforms, Python Library to Detect OS, Arch, Shell, CI, AI 2 April 2026
Bonsai 1-Bit Models Deliver Exceptional Local Inference Performance 2 April 2026
If Your AI Agent Ran NPM Install During the Axios Attack, You're Compromised 1 April 2026
Local AI Ecosystem Extends Far Beyond Ollama 1 April 2026
Claw64 – Full Agentic Loop in <4KB on Commodore 64 1 April 2026
PrismML Announces 1-Bit Bonsai: First Commercially Viable 1-Bit LLMs 1 April 2026
Running AI on a Raspberry Pi, Part 2: Running AI on a Pi in Under 5 minutes 31 March 2026
Dell Technologies Unveils 10 AI PC Models for Business, from Ultralight Laptops to Ultracompact Desktops 30 March 2026
TurboQuant: Understanding the Quantization Breakthrough 29 March 2026
Google's TurboQuant Shows Memory Constraints Remain Critical for Local LLM Inference 29 March 2026
OLED Emerges as the Display Standard for Energy-Efficient AI Systems 29 March 2026
IBM Granite 4.0 3B Vision: Compact Enterprise-Grade Document AI 29 March 2026
ESP32-S31: 320MHz 2-Core Microcontroller with 512KB SRAM and Networking 29 March 2026
HP Launches Copilot+ PCs in India with On-Device AI Capabilities for Local Inference 28 March 2026
GPU Passthrough to LXCs in Proxmox Simplifies Local LLM Deployment 28 March 2026
CERN Embeds Tiny AI Models in Silicon Chips for Real-Time LHC Data Filtering 28 March 2026
This Wearable Runs an On-Device AI With 2-Week Battery Life 27 March 2026
Comparison of Two Frameworks: 40% Token Efficiency Improvement 27 March 2026
Mistral AI Releases Voxtral: Open-Source TTS Model Beating ElevenLabs on Local Hardware 27 March 2026
Apple Gets Full Gemini Access and Uses Distillation to Build Lightweight On-Device AI 27 March 2026
See What Your AI Agents Are Doing: Multi-Agent Observability Tool 27 March 2026
Samsung Galaxy A37 and A57 5G Launch with On-Device AI Capabilities in India 26 March 2026
RF-DETR Nano and YOLO26 Enable On-Device Object Detection on Smartphones 26 March 2026
Why Responsible AI Is the Bedrock of AI-Powered Applications 26 March 2026
NVIDIA Releases GPT-OSS-Puzzle-88B, a Deployment-Optimized Model 26 March 2026
Nota AI and SiMa.ai Partner on Physical AI Technology for Local Deployment 26 March 2026
Meta Releases HyperAgents: Self-Improving AI 26 March 2026
Show HN: Beforeyouship – Pre-Build Tool to Estimate LLM Cost 26 March 2026
Operating Systems. One USB. ZFS on Root. AI-Powered. Free 26 March 2026
Google's TurboQuant: The Unsexy AI Breakthrough Worth Watching 26 March 2026
Apple Plans Slimmed-Down Gemini Models for Local iPhone AI Features 26 March 2026
Google TurboQuant: Extreme Compression for Local LLM Deployment 25 March 2026
Running an Open-Weight LLM Locally on an Apple Watch 25 March 2026
New Open-Weight Models Released: GigaChat-3.1-Ultra and Lightning Variants 25 March 2026
Lemonade 10.0.1 Improves Setup Process For Using AMD Ryzen AI NPUs On Linux 25 March 2026
.APKs Are Just .ZIPs: Semi-Legally Hacking Software for Orphaned Hardware 25 March 2026
Ultra-Large 400B-Class LLM Runs on iPhone in Test 25 March 2026