Vision stuff - a CKeibel Collection

CKeibel 's Collections

Code-Embeddings

Speech2Text (ASR)

diffusion models

Text-Classification

Causal LMs, seq2seq models

Embedding models

BERT based tasks (models)

Vision stuff

updated Jun 19, 2024

HuggingFaceM4/idefics-9b-instruct

Text Generation • 9B • Updated Oct 12, 2023 • 1.95k • 107
liuhaotian/llava-v1.5-13b

Image-Text-to-Text • Updated May 9, 2024 • 36.3k • 528
llava-hf/llava-v1.6-34b-hf

Image-Text-to-Text • 35B • Updated Jan 27, 2025 • 9.08k • 93
openai/clip-vit-base-patch32

Zero-Shot Image Classification • Updated Feb 29, 2024 • 21.7M • 934
HuggingFaceM4/idefics-80b-instruct

Text Generation • Updated Oct 12, 2023 • 5.1k • 189
microsoft/Florence-2-base

Image-Text-to-Text • 0.2B • Updated Aug 4, 2025 • 2.08M • 369