Commit History

Add model card with training details
08251c0
verified

thomasjhuang commited on

RLOO checkpoint at optimizer step 250 - Fixed prompt format, temp=0.1, lr=3e-6
e4ad155
verified

thomasjhuang commited on

RLOO checkpoint at optimizer step 250 - Fixed prompt format, temp=0.1, lr=3e-6
4b722d3
verified

thomasjhuang commited on

initial commit
0660638
verified

thomasjhuang commited on