Multimodal Implementations Collection Comprehensive Demo of Multimodal VLMs on the Hub • 26 items • Updated 1 day ago • 13
view article Article We’re open-sourcing our text-to-image model and the process behind it Nov 12, 2025 • 96
BitDance Collection BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. • 10 items • Updated 16 days ago • 11
Alterbute: Editing Intrinsic Attributes of Objects in Images Paper • 2601.10714 • Published Jan 15 • 31
YOLO26 Models Collection YOLO26 models: detection, segmentation, classification, pose, and OBB variants with demos and ONNX variants. • 42 items • Updated Jan 19 • 36
CoreML Collection Models for Apple devices. See https://github.com/FluidInference/FluidAudio for usage details • 12 items • Updated 2 days ago • 5
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models Dec 15, 2025 • 110
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face Dec 5, 2025 • 43
DictaLM 3.0 Collection Collection Dicta-LM 3.0 is a powerful open-weight collection of sovereign LLMs for Hebrew. • 24 items • Updated Dec 10, 2025 • 18
view article Article How to make NeuTTS-air generate over 200 seconds of audio in a single second. Nov 21, 2025 • 24