Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Optimization
community
Activity Feed
Follow
23
AI & ML interests
None defined yet.
Recent Activity
nm-research
updated
a dataset
1 day ago
inference-optimization/laguna-xs-ultrachat-responses
ChibuUkachi
updated
a model
1 day ago
inference-optimization/MiniMax-M2.5.w8a8
MeganEFlynn
updated
a dataset
1 day ago
inference-optimization/laguna-xs-ultrachat-responses
View all activity
Team members
15
inference-optimization
's models
307
Sort: Recently updated
inference-optimization/Qwen3-8B_6_bits_mode_noise
7B
•
Updated
Mar 12
•
12
inference-optimization/Qwen3-8B_6_bits_mode_hybrid
7B
•
Updated
Mar 12
•
8
inference-optimization/Qwen3-8B_5.5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
7
inference-optimization/Qwen3-8B_5.5_bits_mode_noise
6B
•
Updated
Mar 12
•
9
inference-optimization/Qwen3-8B_5.5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
8
inference-optimization/Qwen3-8B_5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
8
inference-optimization/Qwen3-8B_5_bits_mode_noise
6B
•
Updated
Mar 12
•
9
inference-optimization/Qwen3-8B_5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
8
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_heuristic
7B
•
Updated
Mar 12
•
9
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_noise
7B
•
Updated
Mar 12
•
7
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_hybrid
7B
•
Updated
Mar 12
•
10
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_heuristic
7B
•
Updated
Mar 12
•
15
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_noise
7B
•
Updated
Mar 12
•
8
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_hybrid
7B
•
Updated
Mar 12
•
6
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_heuristic
6B
•
Updated
Mar 12
•
7
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_noise
6B
•
Updated
Mar 12
•
10
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_hybrid
6B
•
Updated
Mar 12
•
9
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
7
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_noise
6B
•
Updated
Mar 12
•
9
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
13
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
8
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_noise
6B
•
Updated
Mar 12
•
8
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
11
inference-optimization/sarvam-105b-FP8-Dynamic
Text Generation
•
106B
•
Updated
Mar 9
•
3
inference-optimization/sarvam-30b-FP8-Dynamic
Text Generation
•
32B
•
Updated
Mar 9
•
62
•
1
inference-optimization/sarvam-30b-NVFP4
Text Generation
•
19B
•
Updated
Mar 9
•
24
•
1
inference-optimization/sarvam-105b-NVFP4
61B
•
Updated
Mar 9
•
4
•
1
inference-optimization/Qwen3.5-35B-A3B-FP8-Dynamic
35B
•
Updated
Mar 6
•
11
inference-optimization/Kimi-K2-Instruct-0905-BF16-FP8-BLOCK
Text Generation
•
1T
•
Updated
Mar 6
•
6
inference-optimization/gpt-oss-20b-FP8-Dynamic
21B
•
Updated
Mar 5
•
12
•
1
Previous
1
...
7
8
9
10
11
Next