view post Post 9756 deepseek-ai/DeepSeek-OCR is out! 🔥 my take ⤵️ > pretty insane it can parse and re-render charts in HTML> it uses CLIP and SAM features concatenated, so better grounding> very efficient per vision tokens/performance ratio> covers 100 languages See translation
Mar 6 Releases tencent/Penguin-VL-2B Text Generation • 2B • Updated 5 days ago • 1.28k • 32 KORMo-VL/KORMo-VL-Diffusion Updated 11 days ago • 10 • 16 Lightricks/LTX-2.3 Image-to-Video • Updated about 15 hours ago • 597k • 636 sarvamai/sarvam-30b Text Generation • 32B • Updated 6 days ago • 35.6k • 165
Feb 27 Releases Qwen/Qwen3.5-122B-A10B-FP8 Image-Text-to-Text • 125B • Updated 18 days ago • 275k • 68 cyankiwi/Qwen3.5-27B-AWQ-BF16-INT4 Image-Text-to-Text • 12B • Updated 19 days ago • 33.2k • 28 Aratako/Irodori-TTS-500M Text-to-Speech • 0.5B • Updated 19 days ago • 49 LiconStudio/VBVR-wan2.2-comfy-bf16 Updated 18 days ago • 7.08k • 22
Mar 6 Releases tencent/Penguin-VL-2B Text Generation • 2B • Updated 5 days ago • 1.28k • 32 KORMo-VL/KORMo-VL-Diffusion Updated 11 days ago • 10 • 16 Lightricks/LTX-2.3 Image-to-Video • Updated about 15 hours ago • 597k • 636 sarvamai/sarvam-30b Text Generation • 32B • Updated 6 days ago • 35.6k • 165
Feb 27 Releases Qwen/Qwen3.5-122B-A10B-FP8 Image-Text-to-Text • 125B • Updated 18 days ago • 275k • 68 cyankiwi/Qwen3.5-27B-AWQ-BF16-INT4 Image-Text-to-Text • 12B • Updated 19 days ago • 33.2k • 28 Aratako/Irodori-TTS-500M Text-to-Speech • 0.5B • Updated 19 days ago • 49 LiconStudio/VBVR-wan2.2-comfy-bf16 Updated 18 days ago • 7.08k • 22
Running on CPU Upgrade 18 Daggr Image To 3d 👀 Convert images into 3D assets with background removal and enhancement
Running on Zero Featured 111 SAM3 Video Segmentation 🐠 Track and label objects in videos using text prompts or clicks