microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 397k • 1.54k
docling-project/SmolDocling-256M-preview Image-Text-to-Text • 0.3B • Updated Sep 17 • 141k • 1.6k
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published Mar 2 • 64
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking Paper • 2502.20730 • Published Feb 28 • 38