Tagged "real-time-inference"
- What Type of AI Usage? Deployment Patterns and Implementation Considerations
- Waterloo's Live AI-Goose Tracker: Real-Time Edge Vision
- LlaMa.cpp Robot Wars
- 115 TOPS in 0.67L: CHUWI AuBox X Packs On-Device AI Power Into a Palm-Sized Mini PC
- DFlash Doubles Token Generation Speed of Qwen3.5 27B on Mac M5 Max
- Google Gemma 4 Delivers Exceptional Speed and Accuracy for Local Inference
- DFlash Speculative Decoding Achieves 3.3x Speedup on Apple Silicon
- CarryAI's Serverless Vision-Language Models Enable On-Device Multimodal AI
- Lenovo Korea Launches AI-Powered Industrial Edge Solutions
- A Journey to a Reliable and Enjoyable Locally Hosted Voice Assistant
- AI-Native Store Research
- Arduino, Qualcomm Bring On-Device AI and Robotics Learning to Indian School Systems
- The Path to Ubiquitous AI (17k tokens/sec)