Shail Shah
shail-2512
AI & ML interests
None yet
Organizations
LLMs
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 276k • • 1.96k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 642k • • 568 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 1.62k • 74 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 92.9k • 673
Image Generation
3D
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 1.38k • 450 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 51.6k • 929 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 92.6k • 319 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 4.6M • • 2.72k
Reranking Models
ALMs (Audio Language Models)
TTS
Reasoning (LRMs)
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 54.3k • 562 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 480 • 1.7k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 57 • 53 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 179k • • 1.54k
Video Generation
Dataset to fine-tune Embeddings
Embedding Models
MultiModal (Any-to-Any)
ALMs (Audio Language Models)
LLMs
TTS
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 276k • • 1.96k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 642k • • 568 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 1.62k • 74 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 92.9k • 673
Reasoning (LRMs)
Image Generation
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 54.3k • 562 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 480 • 1.7k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 57 • 53 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 179k • • 1.54k
3D
Video Generation
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 1.38k • 450 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 51.6k • 929 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 92.6k • 319 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 4.6M • • 2.72k
Dataset to fine-tune Embeddings
Reranking Models
Embedding Models