Tagged "vision-language-models"
- Show HN: We built an OCR server that can process 270 dense images/s on a 5090
- CarryAI's Serverless Vision-Language Models Enable On-Device Multimodal AI
- Qwen 3.5 Ultra-Compact Models Enable On-Device AI from Watches to Gaming
- Qwen 3.5 Small Models Released: 0.8B to 9B Parameters Optimized for On-Device Inference
- NVIDIA Releases Dynamo v0.9.0: Infrastructure Overhaul With FlashIndexer and Multi-Modal Support
- Running Local LLMs and VLMs on Arduino UNO Q with yzma