Fast inference for Blackwell GPUs
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
ig1/Qwen2.5-VL-7B-Instruct-NVFP4
Image-Text-to-Text • 5B • Updated • 10 -
ig1/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text • 8B • Updated • 21 -
ig1/Qwen2.5-VL-32B-Instruct-FP8-Dynamic
Image-Text-to-Text • 33B • Updated • 13 -
ig1/Qwen2.5-VL-72B-Instruct-FP8-Dynamic
Image-Text-to-Text • 73B • Updated • 28
Fast inference for Blackwell GPUs
-
ig1/Qwen2.5-VL-7B-Instruct-NVFP4
Image-Text-to-Text • 5B • Updated • 10 -
ig1/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text • 8B • Updated • 21 -
ig1/Qwen2.5-VL-32B-Instruct-FP8-Dynamic
Image-Text-to-Text • 33B • Updated • 13 -
ig1/Qwen2.5-VL-72B-Instruct-FP8-Dynamic
Image-Text-to-Text • 73B • Updated • 28
models
13
ig1/Qwen3-30B-A3B-Instruct-2507-NVFP4
17B
•
Updated
•
207
ig1/Qwen3-30B-A3B-NVFP4
17B
•
Updated
•
4
ig1/Qwen3-VL-30B-A3B-Instruct-NVFP4
Image-Text-to-Text
•
18B
•
Updated
•
2.1k
•
2
ig1/Qwen3-Coder-30B-A3B-Instruct-NVFP4
Text Generation
•
17B
•
Updated
•
548
•
1
ig1/Qwen2.5-VL-7B-Instruct-NVFP4
Image-Text-to-Text
•
5B
•
Updated
•
10
ig1/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text
•
8B
•
Updated
•
21
ig1/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
19.1k
•
1
ig1/Qwen2.5-VL-32B-Instruct-FP8-Dynamic
Image-Text-to-Text
•
33B
•
Updated
•
13
ig1/Qwen2.5-VL-72B-Instruct-FP8-Dynamic
Image-Text-to-Text
•
73B
•
Updated
•
28
ig1/r1-1776-AWQ
671B
•
Updated
•
5
datasets
0
None public yet