Tagged "edge-device"
-
A Cinematic Landing-Page Hero for 80 Cents (GPT Image 2 and Veo 3.1)
-
Tether AI Upgrades QVAC SDK With TurboQuant for Data Center-Sized Memory on Everyday Devices
-
Meet Memory OS: A 6-Layer Open-Source Memory Stack Built on Hermes Agent
-
JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks
-
Good LLM Development and Usage Patterns
-
Qualcomm Reveals Snapdragon C with Advanced On-Device AI Engine
-
Nvidia Enters Windows Laptop Market, Taking on Intel and AMD
-
NVIDIA Levels Up Local AI Agents Across RTX PCs and DGX Spark
-
Netflix Wiz Creates App to Slash AI Bills, Then Open Sources It
-
Fine-tuning an LLM to Write Docs Like It's 1995
-
What Apple Knows About AI That Silicon Valley Won't Admit
-
Snapdragon C Specs Revealed: 6nm Process, On-Device AI Engine for Budget Laptops
-
Liquid AI Launches Edge-Focused LFM2.5 Model to Power On-Device AI Agents
-
Chrome Quietly Downloads 4GB AI Model Without User Permission
-
Zoho-Backed Netrasemi Launches 12nm AI Chip, Mass Production Begins This Year
-
Snapdragon C Debuts with 6nm Process and Dedicated On-Device AI Engine
-
Slow Journal App with AI Integration
-
MediaTek Dimensity 7500 Brings On-Device AI and Enhanced Power Efficiency to Mid-Range Phones
-
Apple Doubles Down on On-Device AI at WWDC 2026, Setting Privacy-First Strategy
-
The Windows Device Manager, on Linux
-
Tiny microphone on my balcony to listen for any birds passing by
-
MediaTek Launches Dimensity 8550 4nm SoC with Integrated On-Device AI Focus
-
Liquid AI Unveils Edge-Focused LFM2.5 Model for On-Device AI Agents
-
The Infrastructure Behind Making Local LLM Agents Actually Useful
-
Google Launches Tiny Board for Running Gemma 3 Locally
-
Privacy-Focused Raspberry Pi Zero 2W DIY Security Camera with On-Device AI and End-to-End Encryption
-
Mistral AI Launches Mistral Vibe
-
Local-first: Rebuilding a Read-later App with PowerSync and SQLite
-
MediaTek Dimensity 8550 Shifts Focus to Gemini Nano V3 and On-Device AI on Phones
-
The Anatomy of an LLM
-
Alibaba Cloud Joins PyTorch Foundation as Platinum Member
-
OpenBMB Runs Local Agents with MiniCPM5-1B – Efficient LLM for Edge Deployment
-
Local LLM Setup: How to Use RAG and an Embedding Model to Stop Wasting Context
-
Meet EAGLE 3.1: The Speculative Decoding Algorithm That Fixes Attention Drift in LLM Inference
-
Samsung's Exynos 2800 Brings HBM Memory to Mobile AI, Enabling Faster Local Model Inference
-
Developer Switches from LM Studio to llama.cpp, Reports No Performance Downgrade
-
Anker Soundcore Liberty 5 Pro Earbuds Feature Dedicated On-Device AI Chip with Touch Screen
-
LM Studio 0.4 Introduces Headless Deployment for Local LLM APIs
-
Users Report Superior Performance Switching from LM Studio to llama.cpp
-
Maker Demonstrates Portable AI with Suitcase-Integrated Jetson Orin Setup
-
Gemma 4: A New Budget-Focused Model in Posit AI
-
From Source Code to LLM Constraints: A Semantic Extractor for Python, SwiftUI, Lua
-
Qualcomm's AI-Device Strategy Reflects Growing Market Momentum in On-Device Intelligence
-
MCP Servers Transform Local LLM Stack, Replacing $249 Paid Tools
-
A Maintainability Ratchet for AI-Assisted Python
-
Why Your Docker Container Is 1.2GB When It Should Be 80MB
-
New 8B Local LLM Design Marks Biggest Shift Since DeepSeek R1
-
AMD Unveils Ryzen AI Halo Developer Platform for On-Device AI Workloads
-
llama.cpp MTP Leak Fix Stabilizes Local AI Agents
-
Show HN: Interactive and Stylized AI Chat Chrome Extension
-
Google Makes Gemini 3.5 Flash the Default AI Model for Billions of Users
-
The Brain vs. Deep Learning Part I: Computational Complexity Analysis
-
Benchmarking a Portable AI Workstation: Lenovo ThinkPad P16 Gen 3, Part 2
-
Hardware LLM Taalas Reaches >14,000 TPS on Llama 3.1 8B
-
Google's Cormac Brick on Tiny LLMs for On-Device Agents
-
Auditing Apple's DifferentialPrivacy.framework: Bugs, Misconfig, Practical Risks
-
AI Token Streaming Isn't About SSE vs. WebSockets
-
Meta Plans Agentic AI on Smartphones and Wearables by 2026
-
Google Tensor SDK Beta with LiteRT Enables Efficient On-Device AI
-
Google and Synaptics Partner on Coralboard for Immersive Edge AI Experiences
-
Google's Offline AI App Gets Three Major Feature Upgrades
-
Samsung's Exynos 2800 Could Be the First Mobile Chip to Use HBM for Powerful On-Device AI
-
OpenAI Agents SDK Ported to React Native for Mobile Deployment
-
Open Source Local Audio Stem Separation Tool Released
-
On-Device AI to Be in 80% of Wearables by 2032
-
llama.cpp Adds Multi-Token Prediction, Doubles Qwen 3.6B Throughput for Local Inference
-
Chrome Is Quietly Downloading a 4GB AI Model Without Your Permission
-
Running Large Language Models on Single-Board Computer Clusters: Creative Edge Deployment
-
Samsung's Exynos 2800 Brings Significant On-Device AI Capabilities
-
Ansede-static: Offline SAST Tool Demonstrates Value of Local AI Tools
-
Local LLMs Enable Intelligent Smart Camera Control Without Cloud Dependency
-
Linux 7.1-rc4 Released: Kernel Updates Relevant to Local LLM Inference
-
The Time Bomb Went Off: AI's All-You-Can-Eat Era Just Ended in Real Time
-
MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU
-
Local LLM Takes Control of Video Doorbell—The Future of Smart Cameras
-
Maker Builds Offline Jetson-Powered Chatbot Suitcase
-
HP's On-Device AI Needs More If It Is Going to Compete With Copilot
-
Google Limits Gemini Intelligence to New Flagships—Hardware Requirements for Local Deployment
-
SynapseKit: A New Production Framework for Deploying LLMs
-
Orthrus Reshapes Economics of Local AI Inference with New Optimization Approach
-
Offline Voice-to-Text and AI Keyboard App for Local Processing
-
DwarfStar 4: Native Inference Engine Optimized for DeepSeek V4 Flash
-
Chrome Silently Downloads 4GB Gemini Nano Model Without User Consent
-
Show HN: Find the best local LLM for your hardware, ranked by benchmarks
-
Arm and Google Collaborate on On-Device AI Optimization Techniques
-
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training
-
Chrome Automatically Downloads 4GB AI Model for Local Processing
-
Researchers Report AI Breaking Every Benchmark for Autonomous Cyber Capability
-
Tsjilp – AI as a Silent Communication Assistant
-
Running a Local LLM on a 12-Year-Old Raspberry Pi
-
Mainline Linux 6.12 on Annapurna Labs Alpine V2 (Ubiquiti UNVR, UDM-Pro)
-
Lucebox Brings Faster Local AI Inference to AMD Strix Halo
-
How I Used a Local LLM to Organize the Store on My NAS
-
BT Explainer: Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
-
Running a Local LLM on a 12-Year-Old Raspberry Pi: Practical Edge Inference
-
AMD's vLLM-ATOM Plugin Supercharges DeepSeek-R1 and Kimi-K2 Inference on MI350/MI400
-
MDL: Endless Visual Novel Engine Powered by AI
-
Lython: Experimental Python Compiler Toolchain Based on LLVM
-
Deploying Frigate & Ollama On A Minisforum MS-A2 Server
-
Cotypist – AI Autocomplete for Mac
-
I Built My Second Brain for Meetings. No Monthly Subscription
-
All Those A.I. Note Takers? They're Making Lawyers Nervous
-
DistillFast: AI Cost Optimization Tool for Model Efficiency
-
How I Used a Local LLM to Organize the Store on My NAS
-
Dikaletus: Open-Source Meeting Recording and Transcription Using Mistral AI
-
Critical Ollama Memory Leak Vulnerability Exposes 300,000 Servers Globally
-
Chrome's On-Device AI Features Consuming 4GB of Storage for Gemini Nano
-
Chrome Is Secretly Downloading 4GB Gemini Nano Model Without User Consent
-
Lemonade Gives AMD Startups a Wider Path to Local Inference
-
Perplexity Brings On-Device AI Workflow to Macs with 'Personal Computer' Feature
-
Critical Ollama Memory Leak Vulnerability Exposes 300,000 Servers Globally
-
Show HN: A Local-First Agentic Knowledge Manager
-
Google Releases Gemma 4 Multi-Token Prediction Drafters To Accelerate AI Inference
-
Running Espressif's OpenClaw-Inspired AI Agent on ESP32 with Self-Hosted LLM Works in Practice
-
How to make SSE token streams resumable, cancellable, and multi-device
-
Nota AI Partners with Mobilint to Accelerate On-Device AI on Domestic NPU Infrastructure
-
Locked, stocked, and losing budget: AI vendor lock-in bites back
-
Microsoft VibeVoice C++ Port Enables Local Voice AI on CPU and GPU Without Python
-
Sarvam Edge: Indian-Built AI Models Run Offline on Phones and Laptops Without Internet
-
Improving Code Quality with Local Claude and Codex Models
-
Agentic AI Community Focus: Building Local Agents in 2026
-
5 Things I Wish Someone Had Told Me Before I Tried Self-Hosting a Local LLM
-
A 49-Line Physics Classifier That Beats kNN on 76% of Benchmarks
-
Show HN: Memex, Claude Memory via Local RAG with MCP and Offline Embeddings
-
llama.cpp Now Supports Multi-Token Prediction in Beta
-
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
-
Major Smartphone Brands Introduce Advanced On-Device AI Features
-
Ruflo: Multi-Agent AI Orchestration for Claude Code
-
NordVPN Adds On-Device AI Voice Detector to Chrome Extension to Identify Synthetic Audio
-
Google Explains Why AICore Storage Requirements Are Increasing on Android
-
Gemma 4 Just Replaced My Whole Local LLM Stack
-
Anker's Thus Chip Puts AI On-Device, Promising Faster Responses And Better Privacy
-
I Put a Local LLM on My Phone and Stopped Needing Cloud AI for Most Tasks
-
Show HN: Kit – Editor, Browser, Terminal, Mail with AI Agents Sharing Context
-
PFlash Claims 10x Prefill Speedup Over llama.cpp
-
Google Drops COSMO: Experimental On-Device AI Assistant for Android
-
Anker's New 'Thus' Chip Brings 150x AI Power to Earbuds
-
Xmemory: Benchmarking Structured AI Memory Against RAG and Hybrid RAG
-
Ubuntu is Going All In on Generative AI and Other Linux Distros Might Follow
-
Building a Raspberry Pi-Based Local LLM Server for Remote Access
-
Home Assistant's Local LLM Support Outperforms Gemini for Home Automation
-
Running Capable Local LLMs Without Expensive GPU Hardware
-
IBM Introduces Granite 4.1 Family of Models for Local Deployment
-
How Much "Brain Damage" Can an LLM Tolerate?
-
Google's Gemma 4 Brings Powerful AI Capabilities to Phones and Laptops
-
Building a Remote-Accessible Local LLM Server on Raspberry Pi
-
Why the Same LLM Gives Different Answers in Different Environments
-
Show HN: Minimal Linux Sandboxes to Manage AI-Generated Code with Ease
-
Google's Gemma 4: Powerful AI Models Optimized for Your Phone and Laptop
-
Pocket LLM v1.5.0 Brings Multimodal AI to Android with No Cloud Required
-
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
-
Singapore's Foreign Minister Builds an AI "Second Brain" Using NanoClaw
-
Thinking Outside the Box: New Attack Surfaces in Sandboxed AI Agents
-
Show HN: Phonetic Formatter – Offline English Text to IPA on iPhone and iPad
-
75% of US Health Systems Are Using AI. Only 18% of That Deployment Is Governed
-
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
-
Blueprint: AI Hardware Design
-
Rust Open-Source Headless Browser for AI Agents and Web Scraping
-
Run a Local LLM Server on Raspberry Pi with Remote Access Capabilities
-
LLMs Consume 5.4x Less Mobile Energy Than Ad-Supported Web Search
-
Google's Gemma 4 Brings Powerful On-Device AI to Phones and Laptops
-
I Replaced My Local LLM With a Model Half Its Size and Got Better Results
-
Using a Local LLM as a Zero-Shot Classifier
-
I Built a Local AI Stack With 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again
-
Building Real-World On-Device AI with LiteRT and NPU
-
AI Agent Designs a RISC-V CPU Core from Scratch
-
Anker Unveils 'Thus' Chip to Bring On-Device AI Across Product Line
-
Tesseron: New API Framework for AI Agents with Developer-Defined Configuration
-
Sarvam Edge: India's Offline AI Model Runs on Phones and Laptops Without Internet
-
Developer Turns Phone Into Local LLM Server with Vision, Voice, and Tool Calling Capabilities
-
16 Ways to Make a Small Language Model Think Bigger
-
Malicious GGUF Models Could Trigger Remote Code Execution on SGLang Servers
-
Gemma 4 Just Replaced My Whole Local LLM Stack
-
DeepX and Hyundai Motor Group Robotics LAB Partner to Develop Next-Generation Physical AI Compute Platform
-
ZeusHammer: Built an AI Agent That Thinks Locally
-
Controlling the Secondary Fan on Minisforum AI Pro HX 370
-
llama.cpp Merges Speculative Checkpointing for Major Inference Speed Boost
-
Intel Extends AI PC Reach With New Core Ultra Series 3 Launch
-
Bun v1.3.13
-
Waterloo's Live AI-Goose Tracker: Real-Time Edge Vision
-
Minisforum Launches N5 Max AI NAS with OpenClaw
-
LlaMa.cpp Robot Wars
-
Gemma 4 Just Replaced My Whole Local LLM Stack
-
Unweight: Lossless MLP Weight Compression for LLM Inference
-
115 TOPS in 0.67L: CHUWI AuBox X Packs On-Device AI Power Into a Palm-Sized Mini PC
-
Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw
-
The 'Ollama' Tool Has Numerous Problems, and Some Argue That Llama.cpp Is Better
-
Show HN: An MCP server that lets AI compose music on a hardware synth
-
Local AI Isn't Just Ollama—Here's the Ecosystem That Actually Makes It Useful
-
Community Computer: Collaborative Autoresearch on a Peer-to-Peer Network
-
Building a Voice AI Wearable in a Casio F91W with Whisper and BLE
-
Bonsai 1.7B in the Browser: A 290MB 1-bit LLM on WebGPU
-
Xiaomi 12 Pro Converted Into 24/7 Headless AI Server With Ollama and Gemma4
-
SigMap – Shrink AI Coding Context 97% with Auto-Scaling Token Budget
-
Self-Hosted LLMs Transform Personal Knowledge Management Systems
-
Running Gemma 4 on an iPhone 13 Pro
-
Ubiquiti UniFi G6 Turret 4K Camera Features On-Device AI Processing at $199 Price Point
-
Sovereign AI: Why the Next GPT Will Be Born in Our Living Rooms
-
Fine-Tuned Qwen3.5-0.8B for OCR Outperforms Previous 2B Release
-
Qwen 3.5 Small – On-Device Multimodal Models Released
-
OpenNebula 7.2 "Dark Horse" Released with Enhanced Infrastructure Support
-
oMLX Framework Implements DFlash Attention for Optimized Inference
-
Minisforum N5 MAX AI NAS Delivers 126 TOPS with 200TB Storage for Local LLM Workloads
-
Local LLM Connected to Home Assistant via MCP Now Enables Autonomous Smart Home Management
-
Show HN: SkillCompass – Open-Source Quality Evaluator for Your AI Skills
-
Defender – Local Prompt Injection Detection for AI Agents
-
Learn LLM Internals
-
Researchers Achieve 1-Bit Quantization of OLMo-3 7B Using Distillation
-
ASUS Malaysia to Bring UGen300 USB AI Accelerator in Q2 for Portable On-Device AI Inferencing
-
Unsloth Completes Comprehensive MiniMax M2.7 GGUF Quantization Suite
-
A Deep Dive into Tinygrad AI Compiler
-
MiniMax M2.7 Released: New Model Available for Local Deployment
-
MiniMax M2.7 Is Now Open Source
-
Google's Gemma 4 Brings Free Agentic AI to Your Phone With Zero Data Leaving the Device
-
Google Gemma 4 Delivers Exceptional Speed and Accuracy for Local Inference
-
DFlash Speculative Decoding Achieves 3.3x Speedup on Apple Silicon
-
The Best Local AI Model for Home Assistant Isn't Always the Biggest One
-
Qualcomm Snapdragon XR Powers Next-Generation AI Glasses with Local Inference
-
Google's Gemini Nano 4 Offers Faster, Smarter Local Inference Capabilities
-
DMax: New Parallel Decoding Paradigm for Diffusion Language Models
-
ASUS ExpertBook P1 Integrates On-Device AI for Enterprise Collaboration
-
AI PC Market Projected to Reach $235B by 2032, Driven by On-Device Computing Adoption
-
Self-Installing Skill Manager for AI Agents
-
Warp Decode vs. vLLM's Triton Kernel: Performance Crossover Analysis
-
Tether Launches QVAC SDK for Cross-Platform Local AI Development
-
Samsung Integrates On-Device AI Features into Galaxy A-Series Smartphones
-
Building Offline AI Companions on Severely Constrained Hardware (8GB RAM)
-
LLM Wiki v2: Extended Knowledge Base for LLM Practitioners
-
CarryAI's Serverless Vision-Language Models Enable On-Device Multimodal AI
-
Energy Consumption: The Final Frontier for AI and Local Inference
-
Speculative Decoding Made My Local LLM Actually Usable
-
Running a 1.7B Parameters LLM on an Apple Watch
-
Run Qwen3.5 on an Old Laptop: A Lightweight Local Agentic AI Setup Guide
-
Gemini-CLI, Llama.cpp, and Qwen3.5 Running on NVIDIA Jetson TK1
-
Gemma 4 Support Stabilized in Llama.cpp
-
Gemma 4 GGUF Models Updated with Critical Quantization Fixes
-
Google AI Edge Gallery Showcases Offline Inference with Gemma 4
-
Google's Gemma 4 Brings Powerful On-Device AI to Android and iOS
-
Quansloth Using Google's Turboquant Breaks the VRAM Wall for Local LLMs
-
PyTorch Foundation Welcomes Helion as a Foundation-Hosted Project to Standardize Open, Portable, and Accessible AI Kernel Authoring
-
MemPalace, the Highest-Scoring AI Memory System Ever Benchmarked
-
CricketBrain: Neuromorphic Signal Processor in Rust (0.175us/step, 944 bytes)
-
VLA Learns How to Act. S2S Decides Whether the Motion Is Physically Trustworthy
-
Verbatim 140W GAN: One of the First Chargers With USB PD 3.2 AVS (SPR) Support
-
TurboQuant in Llama.cpp Achieves 6X Smaller KV Cache
-
Show HN: Lightweight LLM Tracing Tool with CLI
-
Lenovo Korea Launches AI-Powered Industrial Edge Solutions
-
HunyuanOCR 1B: High-Quality OCR Now Viable on Budget Consumer Hardware
-
GPU Memory for LLM Inference (Part 1)
-
Google AI Edge Gallery Tops App Store Charts with On-Device Gemma 4
-
Real-time Multimodal AI on Apple Silicon: Gemma E2B Demo Shows Practical Edge Deployment
-
Apple Brings Enhanced On-Device AI Features to iPhone
-
Vektor – Local-First Associative Memory for AI Agents
-
Satsgate: Monetize AI Agents and APIs with Lightning L402 Protocol
-
Qualcomm Snapdragon Innovations Enable Advanced On-Device AI for Wearables
-
Microsoft Quantum Development Kit Ported to Rust: 100x Faster and Smaller
-
Google Previews Gemini Nano 4 for Android AICore with On-Device Capabilities
-
GMKtec NucBox K17 Launches with 97 TOPS AI Performance for Local Inference
-
Gemma 4 31B Achieves Third Place on FoodTruck Bench, Beating Larger Models
-
Gemma 4 26B MoE Emerges as Optimal All-Around Local Model for Consumer Hardware
-
Run AutoGEN with Ollama and LiteLLM in Simple Steps
-
Nex Life Logger: Local Activity Tracker with AI Agent Integration
-
Netflix Open-Sources VOID Model for Video Object Deletion
-
Kokoro TTS Achieves 20× Realtime Speed on CPU-Only On-Device Inference
-
GPUs vs. TPUs: Decoding the Powerhouses of AI
-
Google Launches Gemma 4 For Advanced On-Device AI
-
Gemma 4 KV Cache Memory Issues Fixed in llama.cpp
-
SkillCompass – Diagnose and Improve AI Agent Skills Across 6 Dimensions
-
Google Gemma 4 Released with GGUF Quantizations
-
Google Launches Gemma 4 Open Models for Local On-Device AI
-
Gemma 4 Makes Local AI Agents Practical
-
Gemma 4 2B Successfully Runs on Raspberry Pi 5
-
Gemma 4 on Arm: Optimized On-Device AI for Mobile and Edge Deployment
-
AMD Provides Day 0 Support for Gemma 4 on Ryzen AI Processors and GPUs
-
SmolLM2-360M Running on Samsung Galaxy Watch 4 with 74% Memory Reduction
-
Qwen 3.6-Plus Released
-
Show HN: Memsearch – Persistent, Cross-Agent, Cross-Session Memory for AI Agents
-
Lotte Innovate and DeepX Collaborate on Mass Production of Domestic AI Semiconductors
-
A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant
-
Show HN: Extra-Platforms, Python Library to Detect OS, Arch, Shell, CI, AI
-
Bonsai 1-Bit Models Deliver Exceptional Local Inference Performance
-
If Your AI Agent Ran NPM Install During the Axios Attack, You're Compromised
-
Local AI Ecosystem Extends Far Beyond Ollama
-
Claw64 – Full Agentic Loop in <4KB on Commodore 64
-
PrismML Announces 1-Bit Bonsai: First Commercially Viable 1-Bit LLMs
-
Running AI on a Raspberry Pi, Part 2: Running AI on a Pi in Under 5 minutes
-
Dell Technologies Unveils 10 AI PC Models for Business, from Ultralight Laptops to Ultracompact Desktops
-
TurboQuant: Understanding the Quantization Breakthrough
-
Google's TurboQuant Shows Memory Constraints Remain Critical for Local LLM Inference
-
OLED Emerges as the Display Standard for Energy-Efficient AI Systems
-
IBM Granite 4.0 3B Vision: Compact Enterprise-Grade Document AI
-
ESP32-S31: 320MHz 2-Core Microcontroller with 512KB SRAM and Networking
-
HP Launches Copilot+ PCs in India with On-Device AI Capabilities for Local Inference
-
GPU Passthrough to LXCs in Proxmox Simplifies Local LLM Deployment
-
CERN Embeds Tiny AI Models in Silicon Chips for Real-Time LHC Data Filtering
-
This Wearable Runs an On-Device AI With 2-Week Battery Life
-
Comparison of Two Frameworks: 40% Token Efficiency Improvement
-
Mistral AI Releases Voxtral: Open-Source TTS Model Beating ElevenLabs on Local Hardware
-
Apple Gets Full Gemini Access and Uses Distillation to Build Lightweight On-Device AI
-
See What Your AI Agents Are Doing: Multi-Agent Observability Tool
-
Samsung Galaxy A37 and A57 5G Launch with On-Device AI Capabilities in India
-
RF-DETR Nano and YOLO26 Enable On-Device Object Detection on Smartphones
-
Why Responsible AI Is the Bedrock of AI-Powered Applications
-
NVIDIA Releases GPT-OSS-Puzzle-88B, a Deployment-Optimized Model
-
Nota AI and SiMa.ai Partner on Physical AI Technology for Local Deployment
-
Meta Releases HyperAgents: Self-Improving AI
-
Show HN: Beforeyouship – Pre-Build Tool to Estimate LLM Cost
-
Operating Systems. One USB. ZFS on Root. AI-Powered. Free
-
Google's TurboQuant: The Unsexy AI Breakthrough Worth Watching
-
Apple Plans Slimmed-Down Gemini Models for Local iPhone AI Features
-
Google TurboQuant: Extreme Compression for Local LLM Deployment
-
Running an Open-Weight LLM Locally on an Apple Watch
-
New Open-Weight Models Released: GigaChat-3.1-Ultra and Lightning Variants
-
Lemonade 10.0.1 Improves Setup Process For Using AMD Ryzen AI NPUs On Linux
-
.APKs Are Just .ZIPs: Semi-Legally Hacking Software for Orphaned Hardware
-
Ultra-Large 400B-Class LLM Runs on iPhone in Test