Running on Zero Featured 99 SAM3 Video Segmentation 🐠 99 Track and label objects in videos using text prompts or clicks
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 29 days ago • 198k • 1.56k
Running on Zero MCP Featured 209 ViTPose Transformers ⚡ 209 Detect and estimate human poses in images and videos
Running on Zero Featured 573 Chat with DeepSeek-VL2-small 🌍 573 Generate responses using images and text input
Running on Zero Featured 111 VLM Object Understanding 🦀 111 Explore object detection, visual grounding, keypoint Detecti