Tagged "google"

Tether AI Upgrades QVAC SDK With TurboQuant for Data Center-Sized Memory on Everyday Devices 2 June 2026
Phison and Intel Roll Out aiDAPTIV to Boost Local AI on Intel AI PC Platforms 2 June 2026
NVIDIA and Microsoft Team Up to Bring Secure On-Device AI Agents to Windows PCs 2 June 2026
Meet Memory OS: A 6-Layer Open-Source Memory Stack Built on Hermes Agent 2 June 2026
JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks 2 June 2026
Chrome Quietly Downloads 4GB AI Model for Local Processing 1 June 2026
Chrome Quietly Downloads 4GB AI Model Without User Permission 31 May 2026
Zoho-Backed Netrasemi Launches 12nm AI Chip, Mass Production Begins This Year 30 May 2026
Snapdragon C Debuts with 6nm Process and Dedicated On-Device AI Engine 30 May 2026
MediaTek Dimensity 7500 Brings On-Device AI and Enhanced Power Efficiency to Mid-Range Phones 30 May 2026
Chrome Silently Downloads 4GB AI Model for Local Inference Without User Consent 30 May 2026
Apple Doubles Down on On-Device AI at WWDC 2026, Setting Privacy-First Strategy 30 May 2026
Tweaking Local Language Model Settings with Ollama 29 May 2026
MediaTek Launches Dimensity 8550 4nm SoC with Integrated On-Device AI Focus 29 May 2026
Liquid AI Unveils Edge-Focused LFM2.5 Model for On-Device AI Agents 29 May 2026
The Infrastructure Behind Making Local LLM Agents Actually Useful 29 May 2026
Google Launches Tiny Board for Running Gemma 3 Locally 29 May 2026
Privacy-Focused Raspberry Pi Zero 2W DIY Security Camera with On-Device AI and End-to-End Encryption 28 May 2026
MCP Security Flaws Are Turning AI Infrastructure Into a Supply-Chain Risk 28 May 2026
Lenovo Bets on On-Device AI to Lift Business PC Upgrades 28 May 2026
MediaTek Dimensity 8550 Shifts Focus to Gemini Nano V3 and On-Device AI on Phones 28 May 2026
Alibaba Cloud Joins PyTorch Foundation as Platinum Member 28 May 2026
Dell Launches 14 Plus Laptop with Intel Core Ultra 9 and 32GB RAM at $1,499.99, Enabling Local Model Inference 26 May 2026
DeepSeek's Flagship V4 Pro Model Drops to 75% Lower Pricing, Increasing Competitive Pressure on Local Inference Economics 26 May 2026
Anker Soundcore Liberty 5 Pro Earbuds Feature Dedicated On-Device AI Chip with Touch Screen 26 May 2026
Gemma 4: A New Budget-Focused Model in Posit AI 25 May 2026
AI Guardrails Stripped From Meta and Google Models in Minutes 25 May 2026
Google Adds llms.txt Check to Chrome Lighthouse 24 May 2026
Google Chrome Raises Privacy Questions with 4GB AI Model Download 24 May 2026
110 Tokens/Second on RTX 4070 Super with Qwen 3.6 35B 22 May 2026
llama.cpp MTP Leak Fix Stabilizes Local AI Agents 22 May 2026
llama.cpp Checkpoint Fix Accelerates Local Coding Agents 22 May 2026
Google Makes Gemini 3.5 Flash the Default AI Model for Billions of Users 22 May 2026
Google's Cormac Brick on Tiny LLMs for On-Device Agents 21 May 2026
Occupy Wall Street Co-Founder Builds Offline-Running AI Organizing Mentor 20 May 2026
Meta Plans Agentic AI on Smartphones and Wearables by 2026 20 May 2026
Google Tensor SDK Beta with LiteRT Enables Efficient On-Device AI 20 May 2026
Google and Synaptics Partner on Coralboard for Immersive Edge AI Experiences 20 May 2026
Google's Offline AI App Gets Three Major Feature Upgrades 20 May 2026
I Stopped Trying to Replace My Cloud LLMs, and Local Models Finally Made Sense 19 May 2026
Samsung's Exynos 2800 Could Be the First Mobile Chip to Use HBM for Powerful On-Device AI 19 May 2026
On-Device AI to Be in 80% of Wearables by 2032 19 May 2026
llama.cpp Adds Multi-Token Prediction, Doubles Qwen 3.6B Throughput for Local Inference 19 May 2026
Chrome Is Quietly Downloading a 4GB AI Model Without Your Permission 19 May 2026
Running Large Language Models on Single-Board Computer Clusters: Creative Edge Deployment 18 May 2026
Samsung's Exynos 2800 Brings Significant On-Device AI Capabilities 18 May 2026
Local LLMs Offer Unique Advantages That Cloud AI Services Cannot Match 18 May 2026
Local LLMs Enable Intelligent Smart Camera Control Without Cloud Dependency 18 May 2026
AMD's Lemonade SDK Advances macOS Support for Local AI Inference with ROCm 7.13 18 May 2026
Google Limits Gemini Intelligence to New Flagships—Hardware Requirements for Local Deployment 17 May 2026
Chrome Quietly Downloads 4GB AI Model Without User Permission 17 May 2026
Orthrus Reshapes Economics of Local AI Inference with New Optimization Approach 16 May 2026
Local LLM Integration Enables Replacement of Paid Subscription Services 16 May 2026
DwarfStar 4: Native Inference Engine Optimized for DeepSeek V4 Flash 16 May 2026
Chrome Silently Downloads 4GB Gemini Nano Model Without User Consent 16 May 2026
Apple's M5 MacBook Air Advances On-Device AI with Redesigned Hardware 16 May 2026
Arm and Google Collaborate on On-Device AI Optimization Techniques 15 May 2026
Chrome Automatically Downloads 4GB AI Model for Local Processing 14 May 2026
BT Explainer: Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 13 May 2026
Gemma 4 Replaces Entire Local LLM Stack for Many Practitioners 12 May 2026
Chrome Silently Installs 4GB AI Model Without User Permission 12 May 2026
Chrome's On-Device AI Features Consuming 4GB of Storage for Gemini Nano 9 May 2026
Chrome Is Secretly Downloading 4GB Gemini Nano Model Without User Consent 9 May 2026
Perplexity Brings On-Device AI Workflow to Macs with 'Personal Computer' Feature 8 May 2026
Critical Ollama Memory Leak Vulnerability Exposes 300,000 Servers Globally 8 May 2026
Local LLM Rewrites Resume Better Than ChatGPT, and It's Not Even Close 8 May 2026
Google Removes Privacy Assurances After Stuffing Devices With Their AI Model 8 May 2026
Google Releases Gemma 4 Multi-Token Prediction Drafters To Accelerate AI Inference 8 May 2026
Running Espressif's OpenClaw-Inspired AI Agent on ESP32 with Self-Hosted LLM Works in Practice 8 May 2026
Airplane AI – Local NDA Safe AI Powered by Gemma 8 May 2026
Google Chrome Downloads 4GB Gemini Nano Model Silently Without User Consent 7 May 2026
Microsoft VibeVoice C++ Port Enables Local Voice AI on CPU and GPU Without Python 6 May 2026
Sarvam Edge: Indian-Built AI Models Run Offline on Phones and Laptops Without Internet 6 May 2026
On-Device AI Market Poised for Explosive Growth as Major Tech Companies Invest Heavily 6 May 2026
Google Accelerates Gemma 4 Inference Speed 3x With Multi-Token Prediction Drafters 6 May 2026
Supercharging LLM Inference on Google TPUs: Achieving 3X Speedups With Diffusion-Style Speculative Decoding 5 May 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 5 May 2026
Major Smartphone Brands Introduce Advanced On-Device AI Features 4 May 2026
NordVPN Adds On-Device AI Voice Detector to Chrome Extension to Identify Synthetic Audio 4 May 2026
Google Explains Why AICore Storage Requirements Are Increasing on Android 4 May 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 4 May 2026
Anker's Thus Chip Puts AI On-Device, Promising Faster Responses And Better Privacy 4 May 2026
Home Assistant's Local LLM Support Outperforms Gemini for Home, and Google Knows It 3 May 2026
Google Drops COSMO: Experimental On-Device AI Assistant for Android 2 May 2026
Home Assistant's Local LLM Support Outperforms Gemini for Home Automation 1 May 2026
Google's Gemma 4 Brings Powerful AI Capabilities to Phones and Laptops 30 April 2026
Google's Gemma 4: Powerful AI Models Optimized for Your Phone and Laptop 28 April 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 27 April 2026
Pluggable's TBT5-AI: First Thunderbolt Dock Explicitly Targeting Local LLM Workstations 26 April 2026
NVIDIA Adds Day-0 DeepSeek V4 Blackwell Support 26 April 2026
Elastic KV Cache Memory Breakthrough Enables Efficient Bursty LLM Serving and GPU Sharing 26 April 2026
Can IBM's RITS Platform and vLLM Reset the Bar for Enterprise AI Access? 26 April 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 26 April 2026
Google's Gemma 4 Brings Powerful On-Device AI to Phones and Laptops 25 April 2026
Building Real-World On-Device AI with LiteRT and NPU 24 April 2026
Google's Gemma 4 Finally Makes Local LLM Deployment Compelling for Practitioners 22 April 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 21 April 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 19 April 2026
Google's Gemma 4: The Most Practical Local LLM Despite Not Being The Smartest 16 April 2026
Google's Gemma 4 Brings Game-Changing Performance to Local Laptop Inference 15 April 2026
Running Gemma 4 on an iPhone 13 Pro 15 April 2026
Self-Hosted LLM Took Personal Knowledge Management System to the Next Level 13 April 2026
On-Device AI Inference Emerges as New Security Blind Spot for CISOs 13 April 2026
MiniMax M2.7 Open-Sources Globally as Industry's First Self-Improving Model 13 April 2026
Running Same Prompts Through Claude and Local LLM Revealed Unexpected Results 13 April 2026
ASUS Malaysia to Bring UGen300 USB AI Accelerator in Q2 for Portable On-Device AI Inferencing 13 April 2026
Self-Hosted LLM Elevates Personal Knowledge Management Systems to New Levels 12 April 2026
On-Device AI: Achieving Powerful AI Capabilities Without Internet Connectivity 12 April 2026
MiniMax M2.7 Advances Scalable Agentic Workflows on NVIDIA Platforms for Complex AI Applications 12 April 2026
Google's Gemma 4 Brings Free Agentic AI to Your Phone With Zero Data Leaving the Device 12 April 2026
Google Gemma 4 Delivers Exceptional Speed and Accuracy for Local Inference 12 April 2026
The Best Local AI Model for Home Assistant Isn't Always the Biggest One 12 April 2026
Critical Unsloth Gemma-4 Chat Template Updates for Tool Calling 11 April 2026
Google's Gemini Nano 4 Offers Faster, Smarter Local Inference Capabilities 11 April 2026
LiteLLM Integrates with Ollama to Simplify Running 100+ Models Locally 8 April 2026
Google AI Edge Gallery Showcases Offline Inference with Gemma 4 8 April 2026
Google's Gemma 4 Brings Powerful On-Device AI to Android and iOS 8 April 2026
Docsie Launches On-Premise AI Platform for Regulated Industries 8 April 2026
Running AI Natively on Windows 11 Using an eGPU 7 April 2026
Quansloth Using Google's Turboquant Breaks the VRAM Wall for Local LLMs 7 April 2026
PyTorch Foundation Welcomes Helion as a Foundation-Hosted Project to Standardize Open, Portable, and Accessible AI Kernel Authoring 7 April 2026
Your Next Assistant is Your PC: How On-Device AI is Transforming Work, One Workflow at a Time 7 April 2026
Google Launches Offline AI Dictation App for iOS with Gemma 7 April 2026
Gemma 4 26B Achieves Impressive Local Performance With Proper Configuration 7 April 2026
AMD Announces Day 0 Support for Google Gemma 4 Across Processors and GPUs 7 April 2026
Google AI Edge Gallery Tops App Store Charts with On-Device Gemma 4 6 April 2026
Gemma 4 31B Achieves Exceptional Performance on Local Hardware 6 April 2026
Google Previews Gemini Nano 4 for Android AICore with On-Device Capabilities 5 April 2026
Gemma 4 31B Achieves Third Place on FoodTruck Bench, Beating Larger Models 5 April 2026
Samsung Launches Galaxy Book6 Series with NVIDIA RTX 5070 and On-Device AI 4 April 2026
NVIDIA and Google Optimize Gemma 4 AI Models for Local RTX Deployment 4 April 2026
Google Launches Gemma 4 For Advanced On-Device AI 4 April 2026
5 Useful Docker Containers for Agentic Developers 4 April 2026
AMD Rolls Out Gemma 4 Model Support Across Full Range of GPUs & CPUs 4 April 2026
NVIDIA Accelerates Gemma 4 for Local Agentic AI on RTX GPUs 3 April 2026
Google Gemma 4 Released with GGUF Quantizations 3 April 2026
Google Launches Gemma 4 Open Models for Local On-Device AI 3 April 2026
Gemma 4 Makes Local AI Agents Practical 3 April 2026
Gemma 4 on Arm: Optimized On-Device AI for Mobile and Edge Deployment 3 April 2026
AMD Provides Day 0 Support for Gemma 4 on Ryzen AI Processors and GPUs 3 April 2026
Gemini CLI – Open-Source AI Agent for Terminal Integration 1 April 2026
Google's TurboQuant Shows Memory Constraints Remain Critical for Local LLM Inference 29 March 2026
Scion: Running Concurrent LLM Agents with Isolated Identities and Workspaces 29 March 2026
TurboQuant KV Cache Compression Achieves 22.8% Faster Decoding at 32K Context 28 March 2026
Samsung Galaxy Book6 Series Brings Intel Core Ultra Chips for On-Device LLM Inference 28 March 2026
Prompt Security Challenges Emerge as Critical Concern for Local LLM Deployments 28 March 2026
HP Launches Copilot+ PCs in India with On-Device AI Capabilities for Local Inference 28 March 2026
GPU Passthrough to LXCs in Proxmox Simplifies Local LLM Deployment 28 March 2026
Acer TravelMate AI Laptops Launch in UAE for Business On-Device Inference 28 March 2026
TurboQuant Benchmarked in Llama.cpp: Google's Extreme Compression Research Tested in Practice 27 March 2026
RotorQuant: 10-19x Faster Quantisation Alternative Using Clifford Algebra 27 March 2026
Samsung Galaxy A37 and A57 5G Launch with On-Device AI Capabilities in India 26 March 2026
Pluggable's TBT5-AI: First Thunderbolt Dock Explicitly Targeting Local LLM Workstations 26 March 2026
Nota AI and SiMa.ai Partner on Physical AI Technology for Local Deployment 26 March 2026
Google's TurboQuant: The Unsexy AI Breakthrough Worth Watching 26 March 2026
Apple Plans Slimmed-Down Gemini Models for Local iPhone AI Features 26 March 2026
Google TurboQuant: Extreme Compression for Local LLM Deployment 25 March 2026
Ultra-Large 400B-Class LLM Runs on iPhone in Test 25 March 2026
Open-Source AI Text-to-Speech Models You Can Run Locally for Natural Voice 24 March 2026
Korea to Deploy Domestic AI Chips in Smart Cities as NPU Trials Scale Up 23 March 2026
Sarvam Open-Sources 30B and 105B Reasoning Models 12 March 2026
Quantization Explained: Q4_K_M vs AWQ vs FP16 for Local LLMs 12 March 2026
Nvidia Pushes Jetson as Edge Hub for Open AI Models 12 March 2026
The $1,500 Local AI Setup: DeepSeek-R1 on Consumer Hardware 12 March 2026
Local AI Coding Assistant: Complete VS Code + Ollama + Continue Setup 12 March 2026
Google Delivers On-Device AI Features in New Chromebook Plus Model 10 March 2026
Gloss: Open-Source, Local-First RAG Alternative to NotebookLM Built in Rust 10 March 2026
Sarvam Open-Sources 30B and 105B Reasoning Models 9 March 2026
Qwen 3.5 Small Expands On-Device AI to Phones and IoT with Offline Support 9 March 2026
When Running Ollama on Your PC for Local AI, One Thing Matters More Than Most 9 March 2026
Nota AI to Showcase End-to-End On-Device AI Optimization at Embedded World 2026 9 March 2026
Snapdragon Wear Elite Unveiled at MWC 2026, Advancing Wearable AI Inference 8 March 2026
Samsung Opens Registration for Vision AI QLED and OLED Television Integration 8 March 2026
Mistral AI Prepares Workflows Integration for Le Chat 8 March 2026
HP Refreshes Lineup with AI-Focused Workstations 8 March 2026
Apple Launches MacBook Neo with A18 Pro Chip for Affordable Local AI Inference 8 March 2026
Windows 11 Notepad Gets On-Device AI Text Generation Without Subscription 7 March 2026
Self-Hosted Paperless-ngx With Optional Local AI Integration 7 March 2026
Building PyTorch-Native Support for IBM Spyre Accelerator 7 March 2026
Turning Your Linux Terminal into a Local AI Assistant 7 March 2026
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support 7 March 2026
Windows 11 Notepad to Feature On-Device AI Text Generation Without Subscription 6 March 2026
Building PyTorch-Native Support for IBM Spyre Accelerator 6 March 2026
OPPO and MediaTek Highlight On-Device AI Innovations at MWC 2026 6 March 2026
HyperExcel Seeks 150 Billion Won Series B to Scale LPU and Verda in Korea 6 March 2026
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support 6 March 2026
Unity Showcases Manufacturing AI Workflow at Smart Factory Expo 5 March 2026
MediaTek Advances Omni Model for Efficient Smartphone Inference 5 March 2026
Kakao Launches Kanana AI for On-Device Schedule and Recommendation Management 5 March 2026
Apple Unveils MacBook Pro with M5 Pro and M5 Max Featuring On-Device AI 5 March 2026
RunAnywhere Launches Production-Grade On-Device AI Platform for Enterprise Scale 4 March 2026
Qualcomm Snapdragon Wear Elite Brings On-Device AI to Smartwatches 4 March 2026
On-Device AI Laptop Lineups Become Standard Across Major Manufacturers 4 March 2026
Apple Unveils MacBook Pro With M5 Pro and M5 Max for On-Device AI 4 March 2026
AMD Launches Copilot+ Desktop Chips to Compete in On-Device AI Market 4 March 2026
Qualcomm Snapdragon Wear Elite: 2B Parameter NPU for Personal AI Wearables 3 March 2026
Intel Arc Pro B70 Workstation GPU Confirmed via vLLM AI Release Notes 3 March 2026
Apple M4 iPad Air Targets AI Users with Double M1 Speed Performance 3 March 2026
AMD Ryzen AI 400 Series Desktop Processors Launch with Integrated 60 TOPS NPU 3 March 2026
Alibaba's Qwen 3.5 Small Model Runs Directly on iPhone 17 3 March 2026
Running Local AI Models on Mac Studio 128GB: 4B, 20B & 120B Tested 2 March 2026
Qualcomm Launches Snapdragon Wear Elite for On-Device AI on Wearables 2 March 2026
HP ZBook Ultra 14 G1a Workstation Reclaims Local AI Workflows for Professionals 2 March 2026
AMD Expands Ryzen AI 400 Series Portfolio for Consumer and Enterprise AI PC Options 2 March 2026
Alibaba's Open-Source CoPaw AI Agent Now Compatible with MCP and ClawHub Skills 2 March 2026
Google Research Finds Longer Chain-of-Thought Correlates Negatively With Accuracy 1 March 2026
Apple Intelligence, Galaxy AI, Gemini: Why Your AI-Powered Phone Is Worth Repairing 1 March 2026
Snapdragon 8 Elite Gen 5 for Galaxy Official: 5 Key Improvements that Push the Boundaries 27 February 2026
Seco Launches Edge AI System-on-Module at Embedded World 2026 27 February 2026
Snapdragon 8 Elite Gen 5 Powers Galaxy S26 Series With Enhanced On-Device AI 27 February 2026
On-Device AI in Mobile Apps: What Should Run on the Phone vs the Cloud (A 2026 Decision Guide) 27 February 2026
On-Device Function Calling in Google AI Edge Gallery 27 February 2026
Arduino, Qualcomm Bring On-Device AI and Robotics Learning to Indian School Systems 27 February 2026
Arduino and Qualcomm Bring On-Device AI Learning to Indian Schools 27 February 2026
Android Phones Are Getting Smarter Without Internet — Here's Why On-Device AI Is the Next Big Shift 27 February 2026
Android Phones Are Getting Smarter Without Internet — On-Device AI as the Next Shift 27 February 2026
Anthropic Has Never Open-Sourced an LLM: Implications for Local Deployment Strategy 24 February 2026
O-TITANS: Orthogonal LoRA Framework for Gemma 3 with Google TITANS Memory Architecture 22 February 2026
Google Open-Sources NPU IP, Synaptics Implements It for Hardware Acceleration 22 February 2026
Google Is Exploring Ways to Use Its Financial Might to Take on Nvidia 21 February 2026
24 Simultaneous Claude Code Agents on Local Hardware 21 February 2026
Tailscale Releases New Tool to Prevent Sensitive Data Leakage to Cloud AI Services 18 February 2026
Sarvam AI Launches Edge Model to Challenge Major AI Players with Local-First Approach 18 February 2026
Qualcomm Ventures Positions India as Blueprint for Affordable On-Device AI Infrastructure 18 February 2026
Cloudflare Releases Agents SDK v0.5.0 with Rust-Powered Infire Engine for Edge Inference 18 February 2026
AMD Announces Day 0 Support for Qwen 3.5 LLM on Instinct GPUs 18 February 2026