Update inference-cache-config/trn1/granite.json 1fdf53e verified dacorvo HF Staff commited on Nov 26, 2025
Update inference-cache-config/trn1/mixtral.json 8343560 verified dacorvo HF Staff commited on Oct 21, 2025
Update inference-cache-config/trn1/mixtral.json e64396b verified dacorvo HF Staff commited on Oct 21, 2025
Delete inference-cache-config/mistral.json cdc796b verified dacorvo HF Staff commited on Oct 20, 2025
Delete inference-cache-config/mistral-variants.json 3551ea0 verified dacorvo HF Staff commited on Oct 20, 2025
Update inference-cache-config/llama-variants.json a510ca8 verified dacorvo HF Staff commited on Oct 13, 2025
Update inference-cache-config/qwen-moe.json 51619c0 verified dacorvo HF Staff commited on Sep 24, 2025
Update inference-cache-config/qwen-moe.json 6266253 verified dacorvo HF Staff commited on Sep 17, 2025
Rename inference-cache-config/qwen3-moe.json to inference-cache-config/qwen-moe.json 24ae643 verified dacorvo HF Staff commited on Sep 2, 2025
Update inference-cache-config/qwen3-moe.json 5e828c9 verified dacorvo HF Staff commited on Aug 28, 2025
Add batch size 4 configurations for LLama 1B and 3B models 3b6312a verified dacorvo HF Staff commited on Jun 25, 2025
Rename inference-cache-config/pixart_sigma_xl_512x512.json to inference-cache-config/pixart-sigma-xl-512x512.json 1d662ce verified Jingya HF Staff commited on Jun 22, 2025
Rename inference-cache-config/pixart-xl-2-512x512.json to inference-cache-config/pixart-alpha-xl-512x512.json cb11624 verified Jingya HF Staff commited on Jun 22, 2025
Rename inference-cache-config/pixArt-XL-2-512x512.json to inference-cache-config/pixart-xl-2-512x512.json c7f992d verified Jingya HF Staff commited on Jun 22, 2025
Create stable-diffusion-xl-refiner-1.0.json aa72a1a verified Jingya HF Staff commited on Jun 22, 2025
Rename inference-cache-config/diffusion.json to inference-cache-config/stable-diffusion-v1-5.json 4a034bb verified Jingya HF Staff commited on Jun 22, 2025
Added TinyLlama as requested by Jim burtoft d9640f4 verified dacorvo HF Staff commited on May 12, 2025
Add DeepSeek distilled versions of LLama 8B 509e6bf verified dacorvo HF Staff commited on Jan 29, 2025
Update inference-cache-config/qwen2.5-large.json 84982b8 verified dacorvo HF Staff commited on Jan 28, 2025
Rename inference-cache-config/qwen-2.5-large.json to inference-cache-config/qwen2.5-large.json 2aa52ac verified dacorvo HF Staff commited on Dec 4, 2024
Rename inference-cache-config/qwen2.5 to inference-cache-config/qwen2.5.json b9f1fde verified dacorvo HF Staff commited on Dec 4, 2024