Tagged "google"
-
Google's Gemma 4: Powerful AI Models Optimized for Your Phone and Laptop
-
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
-
Pluggable's TBT5-AI: First Thunderbolt Dock Explicitly Targeting Local LLM Workstations
-
NVIDIA Adds Day-0 DeepSeek V4 Blackwell Support
-
Elastic KV Cache Memory Breakthrough Enables Efficient Bursty LLM Serving and GPU Sharing
-
Can IBM's RITS Platform and vLLM Reset the Bar for Enterprise AI Access?
-
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
-
Google's Gemma 4 Brings Powerful On-Device AI to Phones and Laptops
-
Building Real-World On-Device AI with LiteRT and NPU
-
Google's Gemma 4 Finally Makes Local LLM Deployment Compelling for Practitioners
-
Gemma 4 Just Replaced My Whole Local LLM Stack
-
Gemma 4 Just Replaced My Whole Local LLM Stack
-
Google's Gemma 4: The Most Practical Local LLM Despite Not Being The Smartest
-
Google's Gemma 4 Brings Game-Changing Performance to Local Laptop Inference
-
Running Gemma 4 on an iPhone 13 Pro
-
Self-Hosted LLM Took Personal Knowledge Management System to the Next Level
-
On-Device AI Inference Emerges as New Security Blind Spot for CISOs
-
MiniMax M2.7 Open-Sources Globally as Industry's First Self-Improving Model
-
Running Same Prompts Through Claude and Local LLM Revealed Unexpected Results
-
ASUS Malaysia to Bring UGen300 USB AI Accelerator in Q2 for Portable On-Device AI Inferencing
-
Self-Hosted LLM Elevates Personal Knowledge Management Systems to New Levels
-
On-Device AI: Achieving Powerful AI Capabilities Without Internet Connectivity
-
MiniMax M2.7 Advances Scalable Agentic Workflows on NVIDIA Platforms for Complex AI Applications
-
Google's Gemma 4 Brings Free Agentic AI to Your Phone With Zero Data Leaving the Device
-
Google Gemma 4 Delivers Exceptional Speed and Accuracy for Local Inference
-
The Best Local AI Model for Home Assistant Isn't Always the Biggest One
-
Critical Unsloth Gemma-4 Chat Template Updates for Tool Calling
-
Google's Gemini Nano 4 Offers Faster, Smarter Local Inference Capabilities
-
LiteLLM Integrates with Ollama to Simplify Running 100+ Models Locally
-
Google AI Edge Gallery Showcases Offline Inference with Gemma 4
-
Google's Gemma 4 Brings Powerful On-Device AI to Android and iOS
-
Docsie Launches On-Premise AI Platform for Regulated Industries
-
Running AI Natively on Windows 11 Using an eGPU
-
Quansloth Using Google's Turboquant Breaks the VRAM Wall for Local LLMs
-
PyTorch Foundation Welcomes Helion as a Foundation-Hosted Project to Standardize Open, Portable, and Accessible AI Kernel Authoring
-
Your Next Assistant is Your PC: How On-Device AI is Transforming Work, One Workflow at a Time
-
Google Launches Offline AI Dictation App for iOS with Gemma
-
Gemma 4 26B Achieves Impressive Local Performance With Proper Configuration
-
AMD Announces Day 0 Support for Google Gemma 4 Across Processors and GPUs
-
Google AI Edge Gallery Tops App Store Charts with On-Device Gemma 4
-
Gemma 4 31B Achieves Exceptional Performance on Local Hardware
-
Google Previews Gemini Nano 4 for Android AICore with On-Device Capabilities
-
Gemma 4 31B Achieves Third Place on FoodTruck Bench, Beating Larger Models
-
Samsung Launches Galaxy Book6 Series with NVIDIA RTX 5070 and On-Device AI
-
NVIDIA and Google Optimize Gemma 4 AI Models for Local RTX Deployment
-
Google Launches Gemma 4 For Advanced On-Device AI
-
5 Useful Docker Containers for Agentic Developers
-
AMD Rolls Out Gemma 4 Model Support Across Full Range of GPUs & CPUs
-
NVIDIA Accelerates Gemma 4 for Local Agentic AI on RTX GPUs
-
Google Gemma 4 Released with GGUF Quantizations
-
Google Launches Gemma 4 Open Models for Local On-Device AI
-
Gemma 4 Makes Local AI Agents Practical
-
Gemma 4 on Arm: Optimized On-Device AI for Mobile and Edge Deployment
-
AMD Provides Day 0 Support for Gemma 4 on Ryzen AI Processors and GPUs
-
Gemini CLI – Open-Source AI Agent for Terminal Integration
-
Google's TurboQuant Shows Memory Constraints Remain Critical for Local LLM Inference
-
Scion: Running Concurrent LLM Agents with Isolated Identities and Workspaces
-
TurboQuant KV Cache Compression Achieves 22.8% Faster Decoding at 32K Context
-
Samsung Galaxy Book6 Series Brings Intel Core Ultra Chips for On-Device LLM Inference
-
Prompt Security Challenges Emerge as Critical Concern for Local LLM Deployments
-
HP Launches Copilot+ PCs in India with On-Device AI Capabilities for Local Inference
-
GPU Passthrough to LXCs in Proxmox Simplifies Local LLM Deployment
-
Acer TravelMate AI Laptops Launch in UAE for Business On-Device Inference
-
TurboQuant Benchmarked in Llama.cpp: Google's Extreme Compression Research Tested in Practice
-
RotorQuant: 10-19x Faster Quantisation Alternative Using Clifford Algebra
-
Samsung Galaxy A37 and A57 5G Launch with On-Device AI Capabilities in India
-
Pluggable's TBT5-AI: First Thunderbolt Dock Explicitly Targeting Local LLM Workstations
-
Nota AI and SiMa.ai Partner on Physical AI Technology for Local Deployment
-
Google's TurboQuant: The Unsexy AI Breakthrough Worth Watching
-
Apple Plans Slimmed-Down Gemini Models for Local iPhone AI Features
-
Google TurboQuant: Extreme Compression for Local LLM Deployment
-
Ultra-Large 400B-Class LLM Runs on iPhone in Test
-
Open-Source AI Text-to-Speech Models You Can Run Locally for Natural Voice
-
Korea to Deploy Domestic AI Chips in Smart Cities as NPU Trials Scale Up
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
Quantization Explained: Q4_K_M vs AWQ vs FP16 for Local LLMs
-
Nvidia Pushes Jetson as Edge Hub for Open AI Models
-
The $1,500 Local AI Setup: DeepSeek-R1 on Consumer Hardware
-
Local AI Coding Assistant: Complete VS Code + Ollama + Continue Setup
-
Google Delivers On-Device AI Features in New Chromebook Plus Model
-
Gloss: Open-Source, Local-First RAG Alternative to NotebookLM Built in Rust
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
Qwen 3.5 Small Expands On-Device AI to Phones and IoT with Offline Support
-
When Running Ollama on Your PC for Local AI, One Thing Matters More Than Most
-
Nota AI to Showcase End-to-End On-Device AI Optimization at Embedded World 2026
-
Snapdragon Wear Elite Unveiled at MWC 2026, Advancing Wearable AI Inference
-
Samsung Opens Registration for Vision AI QLED and OLED Television Integration
-
Mistral AI Prepares Workflows Integration for Le Chat
-
HP Refreshes Lineup with AI-Focused Workstations
-
Apple Launches MacBook Neo with A18 Pro Chip for Affordable Local AI Inference
-
Windows 11 Notepad Gets On-Device AI Text Generation Without Subscription
-
Self-Hosted Paperless-ngx With Optional Local AI Integration
-
Building PyTorch-Native Support for IBM Spyre Accelerator
-
Turning Your Linux Terminal into a Local AI Assistant
-
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support
-
Windows 11 Notepad to Feature On-Device AI Text Generation Without Subscription
-
Building PyTorch-Native Support for IBM Spyre Accelerator
-
OPPO and MediaTek Highlight On-Device AI Innovations at MWC 2026
-
HyperExcel Seeks 150 Billion Won Series B to Scale LPU and Verda in Korea
-
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support
-
Unity Showcases Manufacturing AI Workflow at Smart Factory Expo
-
MediaTek Advances Omni Model for Efficient Smartphone Inference
-
Kakao Launches Kanana AI for On-Device Schedule and Recommendation Management
-
Apple Unveils MacBook Pro with M5 Pro and M5 Max Featuring On-Device AI
-
RunAnywhere Launches Production-Grade On-Device AI Platform for Enterprise Scale
-
Qualcomm Snapdragon Wear Elite Brings On-Device AI to Smartwatches
-
On-Device AI Laptop Lineups Become Standard Across Major Manufacturers
-
Apple Unveils MacBook Pro With M5 Pro and M5 Max for On-Device AI
-
AMD Launches Copilot+ Desktop Chips to Compete in On-Device AI Market
-
Qualcomm Snapdragon Wear Elite: 2B Parameter NPU for Personal AI Wearables
-
Intel Arc Pro B70 Workstation GPU Confirmed via vLLM AI Release Notes
-
Apple M4 iPad Air Targets AI Users with Double M1 Speed Performance
-
AMD Ryzen AI 400 Series Desktop Processors Launch with Integrated 60 TOPS NPU
-
Alibaba's Qwen 3.5 Small Model Runs Directly on iPhone 17
-
Running Local AI Models on Mac Studio 128GB: 4B, 20B & 120B Tested
-
Qualcomm Launches Snapdragon Wear Elite for On-Device AI on Wearables
-
HP ZBook Ultra 14 G1a Workstation Reclaims Local AI Workflows for Professionals
-
AMD Expands Ryzen AI 400 Series Portfolio for Consumer and Enterprise AI PC Options
-
Alibaba's Open-Source CoPaw AI Agent Now Compatible with MCP and ClawHub Skills
-
Google Research Finds Longer Chain-of-Thought Correlates Negatively With Accuracy
-
Apple Intelligence, Galaxy AI, Gemini: Why Your AI-Powered Phone Is Worth Repairing
-
Snapdragon 8 Elite Gen 5 for Galaxy Official: 5 Key Improvements that Push the Boundaries
-
Seco Launches Edge AI System-on-Module at Embedded World 2026
-
Snapdragon 8 Elite Gen 5 Powers Galaxy S26 Series With Enhanced On-Device AI
-
On-Device AI in Mobile Apps: What Should Run on the Phone vs the Cloud (A 2026 Decision Guide)
-
On-Device Function Calling in Google AI Edge Gallery
-
Arduino, Qualcomm Bring On-Device AI and Robotics Learning to Indian School Systems
-
Arduino and Qualcomm Bring On-Device AI Learning to Indian Schools
-
Android Phones Are Getting Smarter Without Internet — Here's Why On-Device AI Is the Next Big Shift
-
Android Phones Are Getting Smarter Without Internet — On-Device AI as the Next Shift
-
Anthropic Has Never Open-Sourced an LLM: Implications for Local Deployment Strategy
-
O-TITANS: Orthogonal LoRA Framework for Gemma 3 with Google TITANS Memory Architecture
-
Google Open-Sources NPU IP, Synaptics Implements It for Hardware Acceleration
-
Google Is Exploring Ways to Use Its Financial Might to Take on Nvidia
-
24 Simultaneous Claude Code Agents on Local Hardware
-
Tailscale Releases New Tool to Prevent Sensitive Data Leakage to Cloud AI Services
-
Sarvam AI Launches Edge Model to Challenge Major AI Players with Local-First Approach
-
Qualcomm Ventures Positions India as Blueprint for Affordable On-Device AI Infrastructure
-
Cloudflare Releases Agents SDK v0.5.0 with Rust-Powered Infire Engine for Edge Inference
-
AMD Announces Day 0 Support for Qwen 3.5 LLM on Instinct GPUs