Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
VaidikML0508
/
Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-GRPO-16bits-V1
like
1
Text Generation
Transformers
Safetensors
VaidikML0508/SharkTank-Offer-V1
English
llama
shark-tank
SFT
RL
GRPO
conversational
text-generation-inference
License:
llama3.2
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-GRPO-16bits-V1
Commit History
Update README.md
35cf07b
verified
VaidikML0508
commited on
Apr 22
Trained with Unsloth
6cb13c9
verified
VaidikML0508
commited on
Apr 22
Trained with Unsloth
636db9d
verified
VaidikML0508
commited on
Apr 22
Upload tokenizer
75a4ffa
verified
VaidikML0508
commited on
Apr 22
Upload README.md with huggingface_hub
44007dc
verified
VaidikML0508
commited on
Apr 22
initial commit
edb5b3b
verified
VaidikML0508
commited on
Apr 22