sumitdotml
/

moe-emergence

Text Generation

mixture-of-experts

expert-specialization

Model card Files Files and versions

moe-emergence / no-lb-ablation

5.44 GB

Ctrl+K

Ctrl+K

2 contributors

History: 8 commits

sumitdotml's picture

Upload no-lb-ablation/ckpt-step-500.pt with huggingface_hub

fddc934 verified 3 months ago

best-model.json

818 Bytes
Upload no-lb-ablation/best-model.json with huggingface_hub 3 months ago
best-model.safetensors

1.18 GB
xet

Upload no-lb-ablation/best-model.safetensors with huggingface_hub 3 months ago
ckpt-step-500.pt

3.08 GB
xet

Upload no-lb-ablation/ckpt-step-500.pt with huggingface_hub 3 months ago
config.json

559 Bytes
Upload no-lb-ablation/config.json with huggingface_hub 3 months ago
final-model.json

808 Bytes
Upload no-lb-ablation/final-model.json with huggingface_hub 3 months ago
final-model.safetensors

1.18 GB
xet

Upload no-lb-ablation/final-model.safetensors with huggingface_hub 3 months ago
metrics.jsonl

173 kB
Upload no-lb-ablation/metrics.jsonl with huggingface_hub 3 months ago
run_summary.json

163 Bytes
Upload no-lb-ablation/run_summary.json with huggingface_hub 3 months ago