ads
sxcasf
AI & ML interests
None yet
Recent Activity
commentedon a paper 6 days ago
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation upvoted a paper 20 days ago
A Survey of On-Policy Distillation for Large Language Models