Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
CKeibel
's Collections
SLMs
PII
Code-Embeddings
Speech2Text (ASR)
Seq2Seq
Reward Models
diffusion models
Text-Classification
Data
PEFT (Papers)
LLMs (Papers)
Causal LMs, seq2seq models
Embedding models
Vision stuff
datasets
NER
BERT based tasks (models)
Multimodal
Multimodal
updated
Apr 15, 2025
Upvote
-
HuggingFaceM4/idefics-80b-instruct
Text Generation
•
Updated
Oct 12, 2023
•
4.58k
•
189
liuhaotian/llava-v1.5-13b
Image-Text-to-Text
•
Updated
May 9, 2024
•
38.5k
•
528
llava-hf/llava-v1.6-34b-hf
Image-Text-to-Text
•
35B
•
Updated
Jan 27, 2025
•
7.97k
•
94
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
8B
•
Updated
Oct 14, 2024
•
108k
•
623
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
Updated
Dec 10, 2025
•
234k
•
971
google/paligemma-3b-pt-224
Image-Text-to-Text
•
Updated
Sep 21, 2024
•
139k
•
444
jinaai/jina-clip-v1
Feature Extraction
•
0.2B
•
Updated
Apr 8
•
75.5k
•
256
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
Jan 12, 2025
•
3.95M
•
502
llamaindex/vdr-2b-multi-v1
Image-Text-to-Text
•
2B
•
Updated
Apr 8
•
940
•
128
Upvote
-
Share collection
View history
Collection guide
Browse collections