Tagged "edge-deployment"
-
New Era of On-Device AI Driven by High-Speed UFS 5.0 Storage
-
Red Hat Launches AI Enterprise for Hybrid AI Deployments
-
PyTorch Foundation Announces New Members as Agentic AI Demand Grows
-
Mirai Announces $10M to Advance On-Device AI Performance for Consumer Devices
-
Show HN: MCP-Enabled File Storage for AI Agents, Auth via Ethereum Wallet
-
Show HN: 100% LLM Accuracy–No Fine-Tuning, JSON Only
-
What Breaks When AI Agent Frameworks Are Forced Into <1MB RAM and Sub-ms Startup
-
Show HN: A Ground Up TLS 1.3 Client Written in C
-
Mirai Tech Raises $10 Million for On-Device AI Innovation
-
No, Local LLMs Can't Replace ChatGPT or Gemini — I Tried
-
Kioxia Sampling UFS 5.0 Embedded Flash Memory for Next-Generation Mobile Applications
-
Enhanced Interface Speed Enables High-Performance On-Device AI Features in Smartphones
-
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search
-
Show HN: Dypai – Build Backends from Your IDE Using AI and MCP
-
Enterprise Infrastructure Guide: Running Local LLMs for 70-150 Developers
-
Apple Accelerates U.S. Manufacturing with Mac Mini Production
-
Anthropic Has Never Open-Sourced an LLM: Implications for Local Deployment Strategy
-
Comparing Manual vs. AI Requirements Gathering: 2 Sentences vs. 127-Point Spec
-
Which Web Frameworks Are Most Token-Efficient for AI Agents?
-
Breaking the Speed Limit: Strategies for 17k Tokens/Sec Local Inference
-
South Korea to Launch $687 Million Project to Develop On-Device AI Semiconductors
-
Qwen3's Voice Embeddings Enable Local Voice Cloning and Mathematical Voice Manipulation
-
Custom Portable Workstation Optimized for Local AI Inference Builds
-
Nvidia Could Launch Its First Laptops With Its Own Processors
-
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools
-
Local GPT-OSS 20B Model Demonstrates Practical Agentic Capabilities
-
Open-Source llama.cpp Finds Long-Term Home at Hugging Face
-
GPT-OSS 20B Demonstrates Practical Agentic Capabilities Running Fully Locally
-
Gix: Go CLI for AI-Generated Commit Messages
-
Future of Mobile AI: What On-Device Intelligence Means for App Developers
-
Future of Mobile AI: What On-Device Intelligence Means for App Developers
-
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search
-
Breaking the Speed Limit: Strategies for 17k Tokens/Sec Local Inference
-
Yet Another Fix Coming for Older AMD GPUs on Linux – Thanks to Valve Developer
-
AI Is Stress Testing Processor Architectures and RISC-V Fits the Moment
-
Ollama 0.17 Released With Improved OpenClaw Onboarding
-
How Slow Local LLMs Are on My Framework 13 AMD Strix Point
-
At India AI Impact Summit, Intel Showcases AI PCs and Cost-Efficient Frugal AI
-
Show HN: Horizon – My AI-Powered Personal News Aggregator and Summarizer
-
Google Open-Sources NPU IP, Synaptics Implements It for Hardware Acceleration
-
GGML Joins Hugging Face: What This Means for Local Model Optimization
-
DietPi Released a New Version v10.1
-
CPU-Trained Language Model Outperforms GPU Baseline After 40 Hours
-
Asus ExpertBook B3 G2 with 50 TOPS AI Sets New Enterprise Standard
-
AI PCs Explained: 7 Critical Truths About NPUs and Privacy
-
Vellium v0.3.5: Major Writing Mode Overhaul and Native KoboldCpp Support
-
Taalas Etches AI Models onto Transistors to Rocket Boost Inference
-
I Run Local LLMs in One of the World's Priciest Energy Markets, and I Can Barely Tell
-
Qwen3 Coder Next Remains Effective at Aggressive Quantization Levels
-
[Release] Ouro-2.6B-Thinking: ByteDance's Recurrent Model Now Runnable Locally
-
At India AI Impact Summit, Intel Showcases Its AI PCs and Cost-Efficient Frugal AI
-
Google Is Exploring Ways to Use Its Financial Might to Take on Nvidia
-
Open-Source + AI: ggml Joins Hugging Face, llama.cpp Stays Open—Local AI's Long-Term Home
-
GGML.AI Acquired by Hugging Face
-
Apple Researchers Develop On-Device AI Agent That Interacts With Apps for You
-
Sarvam Brings AI to Feature Phones, Cars, and Smart Glasses
-
Running Local LLMs and VLMs on Arduino UNO Q with yzma
-
Mihup and Qualcomm Collaborate to Advance Secure On-Device Voice AI for BFSI
-
Kitten TTS V0.8 Released: State-of-the-Art Super-Tiny Text-to-Speech Model Under 25MB
-
Clipthesis: Free Local App for Video Tagging and Search Across Drives
-
Why My Country's AI Scene Is Built on Sand
-
Tailscale Releases New Tool to Prevent Sensitive Data Leakage to Cloud AI Services
-
Sarvam AI Launches Edge Model to Challenge Major AI Players with Local-First Approach
-
Qualcomm Ventures Positions India as Blueprint for Affordable On-Device AI Infrastructure
-
GLM-5 Technical Report: DSA Innovation Reduces Training and Inference Costs
-
Matmul-Free Language Model Trained on CPU in 1.2 Hours
-
Cloudflare Releases Agents SDK v0.5.0 with Rust-Powered Infire Engine for Edge Inference
-
Can We Leverage AI/LLMs for Self-Learning?
-
Ask HN: How Do You Debug Multi-Step AI Workflows When the Output Is Wrong?
-
AMD Announces Day 0 Support for Qwen 3.5 LLM on Instinct GPUs
-
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet
-
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages
-
Chinese AI Chipmaker Axera Semiconductor Plans $379 Million Hong Kong IPO for Edge Inference Hardware
-
ASUS Zenbook 14 Launches in India with AI-Capable Hardware, Starting at Rs 1,15,990
-
Asus ExpertBook B3 G2 Laptop Features Ryzen AI 9 HX 470 CPU in 1.41kg Ultraportable Form Factor
-
Ask HN: What is the best bang for buck budget AI coding?
-
I broke into my own AI system in 10 minutes. I built it
-
Sourdine: Open-Source macOS App for 100% Local AI Transcription
-
Alibaba Unveils Major AI Model Upgrade Ahead of DeepSeek Release
-
Simile AI Raises $100M Series A for Local AI Infrastructure
-
Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues
-
Samsung's REAM: Alternative Model Compression Technique
-
Running Mistral-7B on Intel NPU Achieves 12.6 Tokens/Second
-
Memio Launches AI-Powered Knowledge Hub for Android with Local Processing
-
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts
-
Energy-Based Models Compared Against Frontier AI for Sudoku Solving
-
Arm SME2 Technology Expands CPU Capabilities for On-Device AI