Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_16384_epoch_1 Text Generation • 4B • Updated 10 days ago • 27
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_8192_epoch_1 Text Generation • 4B • Updated 10 days ago • 17
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_4096_epoch_1 Text Generation • 4B • Updated 11 days ago • 27
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_16384_epoch_1 Text Generation • 4B • Updated 11 days ago • 20
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_4096_epoch_1 Text Generation • 4B • Updated 11 days ago • 22
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_8192_epoch_1 Text Generation • 4B • Updated 11 days ago • 17
Ujan/lts_DeepMath-103K_samples_10000_seq_16384_Qwen3-30B-A3B-Thinking-2507_22_23_24_0.8 Viewer • Updated 5 days ago • 11k • 6
Ujan/lts_pruned_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_17_18_19_0.5 Viewer • Updated 5 days ago • 11k • 10
Ujan/lts_pruned_processed_DeepMath-103K_samples_50000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated 6 days ago • 51k • 13
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.8 Viewer • Updated 6 days ago • 11k • 10
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated 7 days ago • 11k • 29