VisRAG 2.0: Evidence-Guided Multi-Image Reasoning in Visual Retrieval-Augmented Generation Paper • 2510.09733 • Published Oct 10 • 4
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe Paper • 2509.18154 • Published Sep 16 • 51
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe Paper • 2509.18154 • Published Sep 16 • 51
view article Article Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning Aug 9 • 12
Running on Zero Featured 229 Spark TTS 🌖 229 A text-to-speech model powered by SparkAudio and Mobvoi.