Cool way to fine tune that I wanted to share.
π
1
#9 opened 8 months ago
by
SuperbEmphasis
Model decent when running with 6 active experts
#8 opened 9 months ago
by
userzyzz
Another question: How did you train this model?
π
1
#7 opened 9 months ago
by
marcuscedricridia
This is the first Qwen3 A3B model that doesnt immediately start repeating itself
3
#2 opened 9 months ago
by
SuperbEmphasis
Feedback after some use
β€οΈ
π
3
7
#1 opened 9 months ago
by
AlecFoster