Arina Puchkova's picture

1 3 2

Arina Puchkova

rinapch

·

rinapch

AI & ML interests

NLP, RL

Recent Activity

upvoted a paper about 1 month ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

upvoted a paper 2 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

upvoted a collection 7 months ago

View all activity

Organizations

New activity in rinapch/distilbert-media-bias over 2 years ago

Adding `safetensors` variant of this model

#1 opened over 2 years ago by