Xiaoyu Tan
WIlliam1900
AI & ML interests
None yet
Recent Activity
authored
a paper
11 days ago
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive
Exploration for Agentic Reinforcement Learning