Post
385
This super detailed tutorial by
@Paulescu
is pure gold ๐ช "Fine-tuning a Small Language Model for browser control with GRPO and OpenEnv"
LFM2-350M ( @LiquidAI ) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control ๐ค
https://paulabartabajo.substack.com/p/fine-tuning-lfm2-350m-for-browser
LFM2-350M ( @LiquidAI ) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control ๐ค
https://paulabartabajo.substack.com/p/fine-tuning-lfm2-350m-for-browser